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METHODS OF DIAGNOSIS OF BREAST CANCER, COMPOSITIONS 
^YND^flETHODS OF SCREENING FOR MODULATORS OF BREAST 

CANCER 



5 CROSS-REFERENCES TO RELATED APPLICATIONS 

ThisappUcadoac^P^^USSN^^ffled,^^; 

60/282 698 med April 9, 2001; USSN 6(V288,590, fiMMay 4, 2001; andUSSN 
entirety. 

FIELD OF THE INVENTION 
The invention relates to the idenfificarion of nocleic acid and protein 
1 5 expression profiles and nncleic adds, prodnota, and antibodies thereto «ha, are involved tn 

ptosis and therapy of breast canoer. The invention fiarther relates to na.fi.ods for 
identifying and using agents and/or targets that inhibi.broaat cancer. 

BACKGROUND OF THE INVENTION 
20 B^tcanccison.of.hemos.^nenfiydiag.aoseficanccraand.hesecnnd 

teadin* canse of fern* cancer dead, in North America and northern Enrop., with lung 
" LpL aensifivity has heen eaUnta.od at only ~ of breast 
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hormone therapy, and/or chemotherapy. Althoughmany breast cancer patients are effectively 
treated, the current therapies can all induce serious side effects which diminish quality of hfe. 
Deciding on a particular course of treatment is typically based on a variety of prognostic 
parameters and markers (Fitzgibbons et al., 2000, Arch. Pathol. Lab. Med. 124:966-978; 
Hamilton and Piccart, 2000, Ann. Oncol. 11:647-663), including genetic predispostion 
markersBRCA-1 and BRCA-2 (Rohson, 2000, J. Clin. Oncol. 18:113sup-118sup). 

Imaging of breast cancer for diagnosis has been problematic and limited. In 
addition, dissemination of tumor cells (metastases) to locoregional lymph nodes is an 
important prognostic factor; five year survival rates drop from 80 percent in patients with no 
lymph node metastases to 45 to 50 percent in those patients who do have lymph node 
metastases. A recent report showed that micrometastases can be detected from lymph nodes 
using reverse transcriptase-PCR methods based on the presence ofmRNA for 
carcinoembryonic antigen, which has previously been shown to be present in the vast 
majority of breast cancers but not in normal tissues. Liefers et al., New England J. of Med. 

339(4):223 (1998). 

The identification of novel therapeutic targets and diagnostic markers is 
essential for improving the current treatment of breast cancer patients. Recent advances in 
molecular medicine have increased the interest in tumor-specific cell surface antigens that 
could serve as targets for various immunotherapeutic or small molecule strategies. Antigens 
suitable for immunotherapeutic strategies should be highly expressed in cancer tissues and 
ideally not expressed in normal adult tissues. Expression in tissues that are dispensable for 
life, however, may be tolerated. Examples of such antigens include Her2/neu and the B-cell 
antigen CD20. Humanized monclonal antibodies directed to Her2/neu 
(HerceptinO/trastuzumab) are currently in use for the treatment of metastatic breast cancer 
(Ross and Fletcher, 1998, Stem CeUs 16:413-428). Similarly, anti-CD20 monoclonal 
antibodies (Rituxm®/rituxhnab) are used to effectively treat non-Hodgekin' s lymphoma 
(Maloney et al., 1997, Blood 90:2188-2195; Leget and Czuczman, 1998, Curr. Opin. Oncol. 
10:548-551). 

Other potential immunotherapeutic targets have been identified for breast 
, cancer. One such target is polymorphic epithelial mucin (MUC1). MUC1 is a transmembrane 
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15 



20 



proton, present tire* ^ ^ „ m 

tumor marker, CA 15-3 (Bon et ai., w, w that other targets 

h,v.^ors«^^HER2 M rr^4 ; U.e»fb W .. 1 sol«Wote^ 

teport ed,obeoverex P ressedinbreas< cancer, — EQFR( ™"t 
1( Ul3 9 S-1402), c-erbB3 (Naidu e, aa.,.988, B, , Cancer 

(Lul.-Llorca - * «». '■ °~ 6I:17M76) ' PKW ^7n ■» ; 202 2,2, 
Lancer Res. 20:2255-2264), MTA1 (Nawa e, a.., 2000, Cel. ^ ***** 
breastcancer associa.ec gene 1 (Kurt * .... 2000, Breast Cancer Re, Treat 5MM» 

noneareomenUyapprovedforbreaetcancertherapynifheUS. 

Disclosures of certain genes end ESTs described as being expressed m breaat 
cancer are fonnd in in«ema,iona. patent applications WO-99,338o9, WO-97/25426, WO 
,7/02280 andWO-00V55n3.WO.98/45328 and WO-«,/22130. Snnilarly, genes and KTs 

5,693,522. The ntthty of sncb genes is described in each of these pubticattons, and then 
disclosures are incorporated herein in their entirety. 

While industry and academia have identified novel sequences, there has not 
been an equal effort exerted to identify the function of th^e novel aequettcas. The 

^entioninbreastandothercancers. Additional provided hereto are ntedtods Or , can 
^nsedtodiagnosiaandprognosiaofhreaa. cancer. Further provided axe metitodstha. can 
30 be used to screen candidate bioactive agents for tire ability to modulate breast cancer. 
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SUMMARY OF THE INVENTION 
The present invention therefore provides nucleotide sequences of genes that 
areup-anddown-regulatedinbreastcancercell, Such genes are useful for diagnostic 

purposes.andalsoasta^^ t 
cancer, such as hormones or antibodies. Other aspects of the invention will become apparent 
to the skilled artisan by the following description of the invention. 

In one aspect, the present invention provides a method of detecting a breast 
cancer-associated transcript in a cell from a patient, the method comprising contacting a 
biological sample from the patient with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25. 

In one embodiment, the present invention provides a method of determining 
the level of a breast cancer associated transcript in a cell from a patient. 

In one embodiment, the present invention provides a method of detecting a 
breast cancer-associated transcript in a cell from a patient, the method comprising contacting 
a biological sample from the patient with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25. 

In one embodiment, the polynucleotide selectively hybridizes to a sequence at 
least 95% identical to a sequence as shown in Tables 1-25. 

In one embodiment, the biological sample is a tissue sample. In another 
embodiment, the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent 

label. 

In one embodiment, the polynucleotide is immobilized on a solid surface. 

In one embodiment, the patient is undergoing a therapeutic regimen to treat 
breast cancer. In another embodiment, the patient is suspected of having metastatic breast 
cancer. 

In one embodiment, the patient is a human. 

In one embodiment, the breast cancer associated transcript is mRNA. 
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U one embodiment, the method tamer comprises the step of ^amplifying 

re«cac y o ( «h.«h^y. b .^^^^^ teI ^ , ^ MnCer - 
0 loaflrterembodimeot.meparieo.hasadrusres^.fonnofbreas.canoer. 

to one embodiment, the method former comprises the step of: (»,) oomparmg 
to level of the breast eaneer-aasociated transcript to a level of me breast cancer-assocated 
transcript in a biological sample from the patient prior to, or earlier in, the therapcuUe 

treatment. „ 
15 Additionally, provided herein is a method of evaluating the effect of a 

candidate breast cancer drug comprising administering the drug to a patient and removing a 
cell sample from the patient. The expression profile of the ceU is then determined. Tms 
method may further comprise comparing the expression profile to an expression profile of a 
healthy individual. In a preferred embodiment, said expression profile includes a gene of 

20 Tables 1-25. . , 

In one aspect, the present invention provides an isolated nucleic acid molecule 

consisting of a polynucleotide sequence as shown in Tables 1-25. 

In one embodiment, an expression vector or cell comprises the isolated nucleic 

25 aCld ' m one aspect, the present invention provides an isolated polypeptide which is 

encoded by a nucleic acid molecule having polynucleotide sequence as shown in Tables 1-25. 

In another aspect, the present invention provides an antibody that specifically 
binds to an isolated polypeptide which is encoded by a nucleic acid molecule having 
polynucleotide sequence as shown in Tables 1-25. 
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In one embodiment, the antibody is conjugated to an effector component, e.g., 
a fluorescent label, a radioisotope or a cytotoxic chemical. 

In one embodiment, the antibody is an antibody fragment. In another 
embodiment, the antibody is humanized. 
5 In one aspect, the present invention provides a method of detecting a breast 

cancer cell in a biological sample from a patient, the method comprising contacting the 
biological sample with an antibody as described herein. 

In another aspect, the present invention provides a method of detecting 
antibodies specific to breast cancer in a patient, the method comprising contacting a 
1 0 biological sample from the patient with a polypeptide encoded by a nucleic acid comprising a 
sequence from Tables 1-25. 

In another aspect, the present invention provides a method for identifying a 
compound that modulates a breast cancer-associated polypeptide, the method comprising the 
steps of: (i) contacting the compound with a breast cancer-associated polypeptide, the 
1 5 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 
80% identical to a sequence as shown in Tables 1-25; and (ii) determining the functional 
effect of the compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic 
effect, or a chemical effect. 
20 In one embodiment, the polypeptide is expressed in a eukaryotic host cell or 

cell membrane. In another embodiment, the polypeptide is recombinant. 

In one embodiment, the functional effect is determined by measuring ligand 
binding to the polypeptide. 

In another aspect, the present invention provides a method of inhibiting 
25 proliferation of a breast cancer-associated cell to treat breast cancer in a patient, the method 
comprising the step of administering to the subject a therapeutically effective amount of a v 
compound identified as described herein. 

In one embodiment, the compound is an antibody. 
In another aspect, the present invention provides a drug screening assay 
30 comprising the steps of: (i) administering a test compound to a mammal having breast cancer 
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as shown in Tables 1-25 in a treated cell or mammal wth the level of ge P 

taone.mbodimeat.meoon^lUamammalwiU.breastosncerora^ 
the control is a normal cell or mammal. 

concen^ons. *^<+~*"'^* ,i ^* 1 *?JZ 

*" ^ U one embodiment, the leva, of a pluratity of po,ynucleotides ft* se.eotive.y 
hybridize to a sequence a, .east 80% identical to a sequence as shown in Tables 1-25 are 
individually compared to their respective levels in a control oell sample or mamma!. In 
preferred embodiment the plurality of polynucleotides is from three lo ten. 

I„ another aspect, the present invention provides a method for treating a 

described herein. . 

In another aspeet, the present invention provides a pharmaoeuttoal 
composition for treating a mammal having breast cancer, tire composition comprtstng a 

excipient 

In one aspect, the present invention provides a method of screenmg drug 

5 - ^l, P ^.--^•«-« h ^ -ta,,H ^;!■' ta,,, 

can c«. moneembodime»«,ageneis selee.eofromTabtesl-25. The method htrrner 

duties ^.dn,. — «-*- 0, * ,dng " ,,W * 
on the expression of the expression profile gene. 

In one embodiment, the method of screening drug candidates includes 
comparing the level of expression in the absence of the drug candidate to the level of 



30 
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• • the 0 resenceofthedmgcandidate, W h e r e intheco n oentratiouofthedrug 
expression in the presence 01 mcui 5 addition or 

nrotein for example as a result of a gene knockout.. 

' MLver.provi^he.inisabiocMpoo^singoneormorcnuc.e.cac, 
— ofTab.es 1-25, wherein ft. bio* comprises fewer to .000 nuc.eic »d probes. 

is provided. The medrod comprises determining the expression of a gene of Tab.es 1-25 

individual orasecondnnaffeoted individual. Adifference in Ore expression mdrcates to .he 
first individual has a disorder associated with breast cancer. 

to a further embodiment, me biochip also includes a polynucleotide sequence 
20 ^fagenethatUnotup-anddown-regnlatedinbreasteancer. 

to one embodiment a method fo, screening for a bioactive agent capable of 

^^.n^u^f^an^WywMchbindsmsddb^tc^mCduU^ 
protein or fragment thereof. In a prefer^ embodhnen, the method comprisea combmmg a 

antibody which binds to said breast cancer modulatory protein or flagmen, thereof. The 
mel hod further inCudes delermming Ore binding of said breast cancer mo4ula.ory proton or 
fegmentmereofandsaidanttbo^y. Wherein toe is a change in binding, an agent ts 
i^edaaanmtrf^ag^Ttai^emgagemcanbe^agomatoranan^ 

30 Preferably, Ore agent inhibite breast cancer. 
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Also provided herein are methods of eliciting an immune response lit an 

of Tables 1-25.^ ^ ^ ^ rf ^ mbsmm 

prefer*., of Table 25, or a fragment .hereof, and a pharmaceutic^ accep.able earner. In 
Lmer emhodhnen., said composition comprises a nuc.eio acid comprising a se,uence 

Tables 1-25, and a pharmaceutical^ acceptable carrier. 

Also provided are memoes of neutralizing me At of a breast cancer protem, 
oranagmen.merentoomprismgeonUcHnganagen.speoincforaaidpm.emwimsard 

protein in an amount sufflcient to effect neunahzadon. In another embodiment, me protem . 
encodedbyanucleicacidselectedfromthose.ofTablesl-25. 

in another aspect of the invenSon, a method of treating an individual for breast 
„ is provided. In one embodiment, the method comprises administering to said 
individual an inhibitor of a breast cancer modulating protein. In another embodiment, the 
Method comprises administering to a patient having breast cancer an antibody to a breast 
cteer modulating protein conjugated to a merapeutiomoiery. Such a merapendomoie^ can 
be a cytotoxic agent or a radioisotope. 

DETAILED DESCRIPTION OF THE INVENTION 
In accordance with the objects outlined above, the present invention provides 
novel methods for diagnosis and prognosis evaluation for breast cancer (PC), including 
metastatic breast cancer, as well as methods for screening for compositions which modulate 
breast cancer. Also provided are methods for treating breast cancer. 

Tables 1-24B provide unigene cluster identification numbers for the 
, nucleotide sequence of genes that exhibit increased or decreased expression in breast cancer 
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„ o n m 11 10 n 18 19 20 21, and 22 list those genes that are 

r^rr^rr::; r - :r - 

that is part of the unigene cluster. 

The term "breast cancer protein" or "breast cancer polynucleotide" or "breast 

10 cancer-associated^ 

alleles, mutants, and interspecies homologues that: (1) have a nucleotide sequence fat has 
greater than about 60% nucleotide sequence identity, 65%, 70»/o, 75% 80*85*90* 
preferably 91%, 92%, 93%, 94%, 95%, 96%, 91%, 98% or 99% or greater nucleot.de 
sequence identity, preferably over a region of over a region of at least about 25, 50, 100 200, 

15 500 lOOO,ormorenucleotides,toanucleotidesequenceoforassociatedwithageneof 

Tabies 1-25; (2) bind to antibodies, e.g., polyclonal antibodies, raised against an immunogen 
comprising an amino acid sequence encoded by a nucleotide sequence of or associated wrth a 
gene of Tables 1-25, and conservatively modified variants thereof; (3) specifically hybndxze 
under stringent hybridization conditions to a nucleic acid sequence, or the complement 
20 thereof of Tables 1-25 and conservatively modified variants thereof or (4) have an ammo add 
sequence that has greater than about 60% amino acid sequence identity, 65%, 10%, 75%, 
80% 85%, 90o/o, preferably 91%, 92%, 93%, 94* 95%, 96%, 97%, 98% or 99% or greater 
amino sequence identity, preferably over a region of over a region of at least about 25, 50 
100 200 500, 1000, or more amino acid, to an amino acid sequence encoded by a nucleotide 
25 seq uence of or associated with a gene of Tables 1-25. A polynucleotide or polypeptide 
sequence is typically from a mammal including, but not limited to, primate, e.g., human; 
rodent, e.g., rat, mouse, hamster, cow, pig, horse, sheep, or other mammal. A "breast cancer 
polypeptide" and a "breast cancer polynucleotide," include both naturally occurnng or 
recombinant forms. 
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A fulllengtn ,, h „„f mat contains all of the elements 

polynucleotide or polypeptide sequences. The MUengtb my v 

oiolopcal sample is typically obtained front a erotic organism, most preferably a 
JL such as a prince e.g„ chimpanzee ot homan; cow; dor, oaf, a rode* e.g., ganrea 

15 pit, rat, moose; rabbit; or a bird; reptile; or fish. 

.p^idingabiologtcalsanrple-meanstoobUinabiologi^lsamplefornsem 

isol a«ed by softer pemon, a. anomer time, and/or for another pnrpose), or by perfonmng the 
» methodsoffceinvendoninvivo. Arohivaln S snes,ha™tgheahne„«oron to o m eln Sto ry,w,U 

be particularly useful. 

The temts "identical" o, peroent "identity," in the context of two or more 

arethe same or have a specified percentage of amino acid redoes or nucleotides that arethe 

95% 96% 97%, 98%, 99%, or higher idendlyova: a specified region, W hen compared .mi 

mMso ^ l ^nBI^orBI^T2.0^e n coc«mpa*ndgon l bm S w,md rf « 

pammeters described below, or by mannal alignment and visnal inspection (see, , g ., NCB 
30 LhaitehW^ww.nchi.nhn.nih.gov/BLAST/orttrehlre). Snch sevens are then sa,d to 
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50-100 amino acids, or nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, 
to which testsequencesarecompared. When using a sequence comparison algonthm, .est 
10 andreferencesequenc.sareenteredmtoacomputer.subsequence^ 

program parameters can be used, or alternative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 
, 5 A "comparison window", as used herein, includes reference to a segment of 

one ofthe number of contiguous positions selected from the group consisting typically of 
from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in winch 
a sequence may be compared to a reference sequence ofthe same number of contiguous 
positions after the two sequences are optimally aligned. Methods of alignment of sequences 
20 forcomparisonarewell-knownintheart. Optimal alignment of sequences for companson 
can be conducted, e.g., by the local homology algorithm of Smith & Waterman, ^v. 
Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J, Mol 
Biol. 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Nat'l. 
' Acad. Sci. USA 85 :2444 (1988), by computerized implementations of these algorithms 
25 (GAP BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and 
visual inspections eg., Current Protocols in Molecular Biology (Ausubel et a/.,eds. 

1995 supplement)). 

Preferred examples of algorithms that are suitable for detenmning percent 
30 sequence identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, 
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al. J. Uol Biol. ^ v proteins of 

toinvontion.Softwar.forp.rfcmu.gBLASTanalyses^pubUoy 

algorithm involves first identifying high scoring sequence pans(HSPs) by V g 

cumulative alignment score can bo inched. Cumulafive scores ere calculated ustng, e.g., 
Z^teofideZence,, *e parameters M (reward score forap,rof matebmgrestdues; 
^ays>0)andN<pena.tyseorefor m ia m a.c»ingresidues;alw»ys<0). For anuno ac.d 

hits in each direction are halted when: the cumulafive alignment score falls offby fire 
quantity X from maximum achieved value; me cumulafive score goes to zero or below 

eithersequenceisreached. The BLAST algorithm parameteis W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 
us esa S defa»ltsawordlengm(W)oni,anexpee b fio»(E)of.0,M-5,N=4anda 

comparison of both sttands. For amino acid sequences, the BLASTP program uses as 
JL a wordlength of 3, and expectation (E) of R and the BLOSUMo* scormg n^x 
( ^Hem k orf & Hen i xo £f ,^ N n^cn <i&i .^ 8 9 : 10915(1989))altgnmen B (B)of 

50,expectafion(E)ofl0,M-5,N-l,andaoom P ari S onofbott. S trands. 

The BLAST algorithm also performs a statistical analysts of the sumlanty 
W^t^W^ScAM^.rroc.MLAeUSci. USA 90:5873- 

5787 (.993)) One measure of similarity provided by the BLAST algorithm ts the smallest 
.umptobabt^^wlnohprovideaanmdiotfonofmepmbabtU^bywhtchamateh 

30 ^oen^onucleottdeoraminoacidsequ^wouldocourbyehance. Forexampl.,a 
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nucleicacidis considered ^"^^""^Tf^Z 

belargenegativenumbers.e.g.,5,10,20,30,40, 40,70,90,110,150,170^1. 

An indication that two nucleic acid sequences or polypeptides are substantially 
identical is that the polypeptide encoded by the first nucleic acid is 
.eactivewiththeantibodiesraisedagdnstthepolypeptide 

acid asdescribedbelow. Thus, a polypeptide is typically substantially identical to a second 
polypeptide,^., where the two peptides differ only by conservative substitutions. Another 
LLLthattwonucleicacidsequencesaresubst^^ 

or their complements hybridize to each other under stringent conditions, as descnbed below. 
Yet another indication that two nucleic acid sequences are substantiaUy identical is that the 
same primers can be used to amplify the sequences. 

A "host cell" is a naturally occurring ceU or a transformed cell that contains an 
expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, e.g., the American Type Culture 
Collection catalog or web site, www.atcc.org). 

. The terms "isolated," "purified;- or "biologically pure" refer to material that is 
substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 
chromatography. A protein or nucleic acid that is the predominant species present in a 
- preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term "purified" in some embodiments denotes that a nucleic acid 
or protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means 
that the nucleic acid or protein is at least 85% pure, more preferably at least 95% pure, and 
most preferably at least 99% pure. "Purify" or "purification" in other embodiments means 



30 
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removing at least one contaminant from the composition to be purified. In this s^nse, 
purification does notrequire that the purified compound be homogenous e.g., 10 /o pm. 

The terms "polypeptide," "peptide" and "protein" are used mterchangeably 
.ereintorefertoapolymerofamh.oaoidresidue, ^ — ^^^Z 
5 raw hichoneormoreammoac^^ 

naturally occurring amino acid, as well as to naturally occurring amino acid polymers, those 
containing modified residues, and non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic ammo acids, 
as well as amino acid analogs and amino acid mimetics that function similarly to the naturally 
10 occurring amino acids. Naturally occurring amino acids are those encoded by the genetic 
code as well as those amino acids that are later modified, e.g., hydroxyprohne, y- 
carboxyglutan^ 

the same basic chemical structure as anaturally occurring amino acid, e. g ., an a earbonthat is 
bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserme, 

15 norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 

modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic 
chemical structure as a naturally occurring amino acid. Amino acid mimetics refers to 
chemical compounds that have a structure that is different from the general chemical 
structure of an amino acid, but that functions similarly to a naturally occurring ammo acid. 

20 Amino acids may be referred to herein by either their commonly known three 

iettersvmbolsorbymeon^ . 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 

accepted single-letter codes. 

"Conservatively modified variants" applies to both amino acid and nucleic 
25 acid sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical or associated, e.g., naturally contiguous, sequences. Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids encode 
30 m ostproteins. Forinst^ 
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polypeptide aho describes ailent variations of thenuclerccrd. One of stall wrllr gma 
Xlonfor— e.andTGG.^chisotdinari.ytheoniyeodoofor.vp^cM 
bldifiedlo yield a— y identic,. mo.ecu,e. Accordingly, often silep. vattaUoo, 

d.eexpreasiopproduct.butpotwithreapeottoactpalprobeaequences. 

As to amino acid sequences, ope of skill will recognize that mdtvtdual 
substitutions, deletions ot additions to a ppoleie aeid, peptide, polypeptide, or protem 
sequence which alters, adds or deletes a single amino acid or a smaU percentage of annno 
acids in the encoded sentence is a ••conservatively modified variant" where the alteration 
results ip (he substitution of an amino acid with a chemically similar ppppp actd. 
Conservative substitution Uh.es providing fimctinnafiy similar amino acids rue well known tn 
the ar, Such conservatively modified variants are in addition to and do pot exch.de 
polymorphic variants, interspecies homologs, and alleles of the mvention.typrca.ly 

Glutamic acid (E); 3) Asparagine (N), Glntamine (Q); 4) Arginipe <R), Lysme (K); 5) 
Meucmeffl.UucinenMemiottine^^ 

Tryptophan (W); 7) Serine (S), Threonine CD; and 8) Cysteine (C), Methiomne (M) (see, 
e.g., Creighton,/'ro<eins(1984)). 

Macmmolecular structures such as polypeptide stiuctures can be descnbed r» 
terms of various levels of organization. For a genera, discussion of mis organization, see 
ee . Albert, e , a ,.,MolecularBiolo g yo/ l l,eCen<?^., ,994) and Cantor & Schimmel, 
BtopHysiC CHen.is.ry Par, I: The Confer**™ of Biological Uacronolecules : (1980). 
"Primary structure" refers to tire amino acid sequence of a particular peptide. Secondary 
structure" refers to locally ordered, three dimensional structure* wfthta a polypeptide. These 
, stiuctures are commonly known as domains. Domains are portions ofa polypeptide mat 
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often form a compact unit of the polypeptide and are typically 25 to approximately 500 

amino acids long. Typical domains axe made up of sections of lesser organization such as 

stretches of (3-sheet and ohelices. 'Tertiary structure" refers to the complete three 

dimensional stmcture of a polypeptide mo™ 

dimensional stmctuxe formed, usua^ 

units Anisotropic terms are also known as energy terms. 

"Nucleic acid" or "ohgonucleotide" or "polynucleotide" or grammatical 
equivalents used herein means at least two nucleotides covalently linked together. 
OUgonucleotides axe typically from about 5, 6,7,8,9,10, 12, 15,25, 30, 40, 50 or more 
nucleotides in length, up to about 100 nucleotides in length. Nucleic acids and 
polynucleotides are a polymers of any length, including longer lengths, e.g., 200, 300, 500, 
1000, 2000, 3000, 5000, 7000, 10,000, etc. Anucleic acid of the present invention will 
generally contain phosphodiester bonds, although in some cases, nucleic acid analogs are 
included that may have alternate backbones, comprising, e.g., phosphoramidate, 
phosphorothioate, phosphorodithioate, or O-methylphophoroamidite linkages (see Eckstein, 
Oligonucleotides and Analogues: A Practical Approach, Oxford University Press); and 
peptide nucleic acid backbones and linkages. Other analog nucleic acids include those with 
positive backbones; non-ionic backbones, and non-ribose backbones, including those 
described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC 
Symposium Series 580, Carbohydrate Modifications in Antisense Research, Sanghui & 
Cook eds Nucleic acids containing one or more carbocyclic sugars are also included within 
one definition of nucleic acids. Modifications of the ribose-phosphate backbone may be done 
for a variety of reasons, e.g. to increase the stability and half-life of such molecules in 
physiological environments or as probes on a biochip. Mixtures of naturally occurring 
nucleic acids and analogs can be made; alternatively, mixtures of different nucleic acid 
analogs, and mixtures of naturally occurring nucleic acids and analogs may be made. 

A variety of references disclose such nucleic acid analogs, mcluding, for 
example, phosphoramidate (Beaucage et aL, Tetrahedron 49(10):1925 (1993) and references 
therein- Letsinger, J. Org. Chem. 35:3800 (1970); Sprinzl et aL, Eur. J. Biochem. 81:579 
(1977); Letsinger et aL, Nucl. Acids Res. 14:3487 (1986); Sawai et al, Chem. Lett. 805 
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(19 S4), Utshtge. - a,., , Am. Cheru. Soc. .10:4470 (.983)-, andPauwefc , 
^26:14. 9.986)), phosphorottoate^^^ 

aoiU .S.Pa K n l No.5, 6 44,04 8 ),phosphorodiUuoa, e (Briue. ^I^O-lS-UUJJH 
(,989) 0-memylphophoro»rnidue Unkages (s« Eckstein, Oligonucleotides and Analogues. 

bokagea (sea Eghokn, J. Am. Cham Soc. 114,895 (.992); Meier e. al, Chem. fit . 
3.-.W8 (1992); Nielsen, Nature, 365:566 (.993); Cnrisson e. a.., Nature 380:207 (.996), al, 
of which are incorporated hy refers). Other analog nucleic acids include those with 

backbone, (US. PatentNos. 5,386,023, 5,637,684, 5,602,240, 5,2,6,14. and 4,469,863; 
Kiedrowshi e. al., Augew. Chem. Intt. Ed. Engliah 30:423 (1991); Lotstager et al., J. Am. 
Chem Soe. 1 10:4470 (.988); Lelsinger « al. Nucleoside & Nocleoude 13:1597 (1994); 
Chapters 2 and 3, ASC Symposium Series 580, "Carbohydrate Modifications in Antiaense 
Research", Ed. Y.S. Sanghui and P. Dan Cook; Mesmaeker el al., Bioorgamc & Medtonal 
Chem Lett. 4:395 (1994); lefts et al., J. Biomolecnlar NMR 34:17 (1994); Tetrahedron Lett. 
37-743 (1996)) and non-ribose backbones, including those described in U.S. Patent Nos. 
5 235 033 and 5,034,506, and Chapters 6 and 7, ASC Symposium Series 580, "Carbohydrate 
Modifications in Antiaense Research", Ed. Y.S. Sanghui and P. Dan Cook. Nucleio actds 
containing one or more carbooyclic sugars are also included within one definition of nuclerc 
acids (see Jenkins etal., Chem. Soc. Rev. (1995) pp 169-176). Several nucleic acid analogs 
are described in Rawls, C & E News June 2, 1997 page 35. All of these references are hereby 
expressly incorporated by reference. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide 
nucleic acid analogs. These backbones are substantially non-ionic under neutral conditions, in 
contrast to the highly charged phosphodiester backbone of naturally occurring nuclerc acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 
kinetics PNAs have larger changes in the melting temperature 0U for mismatched versus 
perfectly matched basepairs. DNA and RNA typically exhibit a 2-4°C drop in T m for an 
internal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9°C. Similarly, 
30 due to their non-ionic nature, hybridization of the bases attached to these backbones is 
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by Jin the a* ft. ft**, of a — - <— » "•"» f 
.heLrence The nucleic acid may be DN A, both genomic and cDNA, KNA or a nyD , 

Lhin. hypoxanthine, isocytosine, isoguauine, * "B-W » » 

• LJw incudes nuOeotides and nucleoside and nucleotide analogs, and moaned 

■ , Thus e a the individual units of a peptide nucleic 

naturally occurring analog structures. Thus, e.g. me inn 

acid, each containing a base, are referred to herein aa a nucleoside. 

A "label" or a "detectable moiety" is a composition detectable by 
spectioscopic, photochemical, biochemical immunochemical, chemica,, or other physical 
Ins Forexamp.e.usenal.ab.lainctade^fluoresceotdyes.clechon-dcnsereagen.s 

pcpddeorusedlodeuctantibodicsspecmcauyreacnvewimmepephde. The label* may be 
Uporated into tire breast canoer nnoieio acid, protema and antibodies a. an, 

i ncLdingmosemeurodade K nbedbyHun te e.a 1 .,Hg B!ffi lM : 945 OW"-*** 
: ni.014 (1974); Pain e, al, U-AM. **» < 1981 > mdN ^ 

Hitehsm^dca^ 

An "effector" or "effector moiet/ or eitector compuu 
^oundCor tinted, or conju^eiu^— y,«hreu*.»n 1 c«ornchen d « 1 bond,or 

The "effector" can be a variety of molecules including, e.g„ detection moreues includnig 
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togs, , toxin; acuvatable moicdes, a chemodmrapeuhc agent; a Upase, aa anttb 

^oiaotopeennntog-hard^-S-.^^*"- H ^, is0M abound, either 

A "labeled nucleic acid probe or oligonucleotide is one that is , 

nartners binds to the other, e.g., biotin, strepttvidm. 

Aansedhereina^ucleicacidproheoroHgonuclennde-iadennedaaa 

n „cleic acidcapableofbindhrg.oarargetnue.eioacidofcoinp^.aryae^uence^ 
one or more types of chemica, bonds, usuaUy dnough commentary base pamng, usuaUy 
through hydrogen bond formadon. Aa used herein, a probe may include nahiral (,.., A, G, C, 
or T) or modified bases (7-deazaguanosine, inosine, etc.). In addirion, me bases in a probe 
may be joined by a linkage other than aphosphodiester bond, so long aa tt does not 

„Mch me iionsumentbases axejotoedby ^ 

It wffl be understood 'by one of skill in the art to, probes may bind urge, sequences lacking 
complete complement with to probe sequence depending upon the stringency of the 
hybridization conditions. THeprobes are preferably directly labeled as with isotopes, 

strepl avidin complex may later bind. By assaying for die presence or absence of the probe, 
one can detectthe presence or absence of die sete. sequence or subsequence. Diagnoses or 

The term "recombinant" when used with reference, eg., to a cell, or nucleic 
asid, protein, or vector, indicates that the C.U, nucleic acid, protein or vector, has been 
modified by the introduction of a heterologous nucleic acid or protein or the alterabon of a 
native nucleic acid or protein, or that the ceU is derived from a eel. so modified. Thus, e g., 
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„, no, expressed at al.. By,he,erm fnucleic add, e.g.. usmg 

origm.Uyformed.v.viw.mgeneral.byO.eman.p to Ms manner, 

operablyUrjkage of diir^ent sequences is achieved-^an that are not 

normally joined, are both centered reconrbman me ^ or 

acid indicates tba, the nnoloic acid emprises two or more subacuencea Urn. am no, normally 
found in mo same mlauonship ,0 each other in nabore. For insburce, the nuc.erc ac,d . 
^icaUyrecombnum.lyprodnoed.navmg^o or more fences, e.g., fiom unrelated gene, 
an a»ged to n^eanew f unononmnt tt leioaoid,e.g.,apmmo,er 6 omones^ce^da 

codi region fiom another source. Singly, . —us pro,ein will ofien refer* 
, ormorosub S e,uence S ^are»o,fo»ndm,be S amereladon S nip,oeacho,h=r m nam re (e.g, 

a fusion protein). 

A "promomr" is defined aa an army of nucleic acid cental sequences mat 
aimc^oripfionofanudeioaoid. Aan^hemm.apromoUrmolndesneoessaaynuc,™ 
acidsequenoeanearfirosmnsimofn^fion.snohas.inmecaaoofapolymera^nwe 

' ^.whichoanbeloca.edaamncbasseveralnronsaodbaaepaimfiomdaes.ar.^of 
.mnscriptton.A-oona.ibrfive-pmmo.eria.promotermatisac.ivemrdermoa, . 

Ivconderenvironmenmiordeve.opmenulregu.afion. The term "operably Indeed refers 
30 „ a fimottonal linkage between a nucleic acid expression control sequence (such aa a 
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wherein the expression control sequeoo 

co^ndingtofceseoond sconce, ^rf^**" 

transcribed operably linked to a promoter. & 
The phrase "selectively (or specifically) hybndizes to 

lie L, hu, to no orher fences. Stringent conditions are J - 

higher temples. An extensive .ride to «he fctddta*. of nne.eic acrds « foun - 
Tijssen, Techniques in Biochemistry and Molecular Biology-Hybridization W ^^^^ t 
Les -Overview „f prhrcipies of h^ndizahon and One sheregy of nndere ao^ays 

l te npere«nre(unde, defined ionie strong*. pH, and nucleic concentraUon) a, which 50/. 

U.etnrge.se.nenonsarepresen.ine.oess.a.T.SOy.offireprohesarooocuproda 
, 5 " ihbrinnr) St nn g en.oondi tt «swinhe ttl o S eh 1 wMoh«»sa,«eon«n^on lS l^ 

Glides) and a. .east ahont tfC for »ong prohes (e.g., greater than 50 
3„ ^nlride. Forsel^veorspecmchyh^on.aposifivesignaiisat.eas.hvohrnes 
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hybridization conditions can be as followuig jx ^ 

• . « w ™ sv SSC 1% SDS, tacubahng at 65 O, wnn »» 
incubating at 42 C, or, 5xSSC,l/.t> , w foI low Agency 

0.1% SDS « 65«C. For PCR, a lemperahne of about 36 C . «P 

alndingonprimer long*. For high stringency PCR amplification, «»-*-* 
^ Wid although *» — y anneanng .etnperahnes can range *. *»« 50 C 

30 sec- 2 min, an annealing phase lasting 30 sec. -I rain., an 

KfcW* Pro.oeolsandgmdenneaforlowandmghshmgenoy^pMc.non 

Armlications, Academic Press, Inc. N.Y.). 

Nncleicacid.^dono.hybridizeteeachomeaundetalringenieondthonsare 

™soccnrs,e.g,whenac 0 pyofanneleieacidise I eateousmgmemaxm,nmcodon 

nndarmoderaWyshangenthyhriaizarioncondiUons. Exemplary Moderately stringent 
hybridization con«oi*» to clud.ahybnd^ 

l ./.SDSa.3rc,andaw a ahinlXSSCa.45'C. A positive hybridization is at least twice 
background. tnose of ordinary skill will readily recognize thai alternative hybridation and 
wa! h conditions can he utilized <o provide conditions of similar stringency. Additional 
Adelines for detennining hybridization patameters are provided in numerous reference, e.g., 
and Current Protocols in Molecular Biology, ed. Ansubel, et <d. 

The phrase "nractional effects" in the context of assays for testing compounds 
to, modulate activity of abreast cancer protein includes the detennhtafion of a parameter 
« is indirectly or directly under me influence of the breast cancer protein o, nuclerc acid, 
eg a ta ctional,physica3,oreh.nncaleffee,sncha S fteahili^«odecreasebrea,.ca„cer. I. 

includes Ugand binding activity, cell growth on soft agar, anchorage dependence; contact 
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orowth factor or serum dependence; tumor specific marker levels, inv 

me tastasis, and other characteristics of breast cancer cells. Tuncuonal efl 
vi/rn in vivo, and ex vivo activities. 

^ inspecboscopic MM fr* *— • *— * «*~—«" * 
shape), cinematographic, or solubiutypropemes fetoprotein, 

^urtngburding activity orbinding assays, e. g . binding » antibodies or otherhgands, and 
^clrcaa^boperfo^ed^breaa.oanceraasaystawotothosaofs.aU.a^ 
« such as an * vflro assays, e. g ., cell growth on soft agar, anchorage dependence; contact 

-«* factor or serum dependence; tumor specific marker levels; invasiveness ntto Mahtg.1; 
nnno, growfh and metastasis * vivo; mSNA and protein expression in cells undergoing 

, nte^andotherchaxactensdcsofbresstcattcarccUs. The Actional affects can be 
Seated by many means known to those skilled in me art, eg, mictoscopy for qututtflattve 
« qualitative measnres of alterations in morphological features, measurement of changes « 
RNA or protein levels for breast cancer-associated sequences, measurement of RN A stability, 
iCendficution of downsbeam or reporter gene expression (CAT, luciferase, f-gal, GFP and 

5 die like), e.g„ via ohemihnninescence, fluorescence, colorimenic reactions, antibody bmdmg, 
inducible markers, and Ugand binding assays. 

md po.ypep.ide sequences are used to refer to activating, inhibitory, or modulating molecules 
« compounds identified using i„ vitro and in vivo assays of breast cancer potynnolentide and 
30 polypeptide sequences. Inhibitors** compounds fl^^bmdm, partially orfotalfy block 
- ' '■ 24 



r7US02/02242 

WO 02/059377 



10 



20 



25 



activity or expression of breast caacerprofoms, ,.g., „ 

well as natura y ^ mclude) e&> 

chemical molecules and the hke. Such assays 

ofl or m ore b rea S .cancerpro^,e.g.,i,2,3 > 4,S, I 0, 15 , 20,25.30, 40, 50 rrn^s. 

Samples o, assays comprising breast eaneer proteins that are treateti with a 

(mtrMttd « inhibitors) are assigned a reiative protein activny value o 100/.. fiction 
ofapoiypeptideisaoHev^wbenaeaetiviryvalnereUtive^tooonti.lrsabon.SO/., 

p«5 W ,mo reP r.f^.y25^Activ»tion.fab^^po^« 

m „re prefer*!, .50%, more preferabiy 200-500% (i.e., two «o five fold hrgher reiative to fire 

^.^^ 1 „~^„ retetoany ^ etoce n g ro W « 1 ana 
The phrase "changes m cell growth reters xo any w & 

proration chamcteristics i» *. or In vivo, sneh as formation of foci, anchorage 

station of growth, loss of growth footer or sennn requirements, changes ur ecu 
^rphology.gamhrgorlosmgmunorUU^^ 

ability to form or suppress tumors when injected into suitable animal hosts, and/or 

IS 
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"Cancer ceta" Wormed" ceUs or Wormanoo" to tissue culture refer, 

new genetic material. Although transformation can anse from mfection wit* * 

ILmcomorationofnewgenonuc 

spontaneously or foUowmg exposure to a 

Transformation is associated with phenotypic changes, such as immortal^on of cells, 
0 abe^antgrowmcontro^^ 

Culture of Animal Cells a Manual of Basic Technique (3 rd ed. 1994)). 

"Antibody" refers to a polypeptide comprising a framework regum from an 
inununoglobulin gene or fragments thereof that specifically binds and recognizes an anUgen. 
. The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
15 epsilon,andmuconstantW 

genes Light chains are classified as either kappa or lambda. Heavy chains are classmedas 
gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM IgA IgD and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul, 

20 Fundamental Immunology. 

An exemplary immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair 
having one«Ught»(about 25 kD) and one 'W chain (about 50-70 Id)). The N-terminus 
of each chain defines a variable region of about 100 to 110 or more amino acids pnmanly 
25 responsibleforantigenrecognition. The terms variable light chain (V L ) and variable heavy 
chain (V H ) refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a numberof well- 
characterized fragments produced by digestion with various peptidases. Thus, e.g., pepsm 
digests an antibody below the disulfide linkages in the hinge region to produce F(ab)' 2> a 
30 terofFabwltichitselfisaUghtchamjomedtoVH-CHlbyadisuffideb^ TheF(ab)' 2 
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, ,o break the disulfide linkage in the hinge region, 
^.er^ueedunderrruidenndruonstohre^ 

essentially Fab with part of the binge region ^ ^ digestion of an intact 

^body, one of aWtt will appreciaK «ha, such ftagments may yn 

berein. alao includes anfibody fiagmena eifiier single 
^bndiea.ormoaesynlhesized^.ovousiugreaombnranDNAm 

C.inFvJorfiioaeideufifiednsingphagedispl.ylibranes^. ,g.. MoCafferty 

Monoclonal Antibodies: Principles and Practice (2d ed. 1986)). »e 
^oton^m.ybeu^to^humani.ed^^AB^e!,,^ 

specifically bind .o selected antigens (see, e. g ., McCafferiy « al, Name 348.552-554 
(1990); Marks el al. Biotechnology 10:779-783 (1992)). 

A "chimeric antibody" is an antibody molecule in which (a) the constant 

antibody, a.g., an enzyme, toxin, hormone, grow* factor, drug, etc, or (b) me .enable 
different or altered antigen specificity. 



30 

27 



PCT/US02/02242 
WO 02/059377 



HLber of genes MmMT -ows the generation of a gene expressron profi. that . 
n^of.es.a.of.tieee, That is, normal tissue (e.g., normal . 

Zl-tiasneormeta^ticbreaat — tiasnecanbecomparetiwtmnasne 

pro L of tisane in known different breast eaneer sfctes, information regardmg whtch genes 

obtained. , . u . 

The identification of sequences mat are differentially expressed m breast 

For example, a particular treatment regime may he evaluated: does a chemo.herapeu.,0 drug 
act to down-reguUte breast cancer, and thus tumor growth or reomtence, in a particular 
patient Similarly, diagnosis and treamren. outcomes may be done or confirmed by 
comparingpatien. sampleswim me known expression profiles. Metastatic tissue can also be 
^y^^deurminemestageofbreastoancerintiteussuo. Furthermore, these gene 
expression profiles (or individual genes) allow screenmg of dmg candidates wtth an eye to 
poking or altering a particular expression profile; e.g., screening can be done for drugs. 
M suppress me breast cancer expression profile. This may be done by making btoohps 
con^ae^of.heimp^breas.c^erg^ea.wWchc^menbe^^m^scr^. 

25 These methods can also be done on me protein basis; that is, protein expression levels of the 
br east cancer proteins can be evaluated for diagnostic purposes or to screen candidate agents. 
„ addition, the breast cancer nudeic acid sequences can he administer* for gene therapy 
p^tacludrnglhe administration of antisense nucleic acids, or me bre*s, cancer protems 
(including antibodies and other modulators thereof) administered as therapeutic drugs. 
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Vm the present invention provide, nucleic acid and protein sequences M 

^Uneo be,ow, breas, cancer sequences Mud. .hose lha, are « 

a higher level) in breas, cancer, as weU as .hose to, are down-regula.cn (t.e., expressed 

however as will be appreciated by those in .he art, breast cancer sequences fan, other 

JLer sequences are provided, torn vertebrates, including oannnaU, uttludtng rodent (rats, 

Lea, etc.) andp^cg., (dogs, caK, etc.). Breast cancer sequences fan, o^organtsn. 
may be obtained using the techniques outlined below. 

Breast cancer sequences can include both nucleic acid and ammo acid 
sequences. As will be appreciated by those in the art and is more fully outlined below, breast 

applications, which will detect naturally occurring nucleic acids, as well as screerung 
applications; e.g., biochips comprising nucleic acid probes or PGR microtiter plates with 
selected probes to the breast cancer sequences can be generated. 

A breast cancer sequence can be initially identified by substantial nucleic acid 
and/or amino acid sequence homology to the breast cancer sequences outlined herein. Such 
homology can be based upon the overall nucleic acid or amino acid sequence, and is 
generally determine* as outlined below, using either homology programs or hybridization 
conditions. 

For identifying breast cancer-associated sequences, the breast cancer screen 
typically includes comparing genes identified in different tissues, e.g., normal and cancerous 
tissues, or tumor tissue samples frompatients who have metastatic disease vs. non metastatic 
tissue Other suitable tissue comparisons include comparing breast cancer samples with 
metastatic cancer samples from other cancers, such as lung, breast, gastrointestinal cancers, 
ovarian etc. Samplesof different stages of breast cancer, e.g., survivor tissue, drug resistant 
states and tissue undergoing metastasis, are applied to biochips comprising nucleic acid 
probes The samples are first microdissected, if applicable, and treated as is known in the art 
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h fmKNA Suitablebiochipsareconunerciallyav.lable.e.g.from 
for the preparation of mRNA. bmtao ^ ^ ^ 

Aff^etrix. Gene expression profiles as described here* are g 

analyzed. , rhanees in expression as between 

In one embodiment, the genes showmg changes m xp 

Oat the target be disease specific, to nunimia possible stde effects. 

number are hereby express* incorporated by reference. GenBankislmotvninute art, see, 
e ,. Benson, DA, et A, Nucleic Acids Research 26:1-7 (1998) and 
, ht ^,„„.»cbi.nhn.niH. E „»/. Sequences are also availab.o in enter 

European Molecular Biology Laboratory (EMBL) and DNA Database of Japan (DDBJ). 
US Patent Application* 09/687,576, with the sr«ne assignee aa dr. presort apphotuton 

breast cancer is hereby expressly incorporated by reference. 

5 In another preferred embodiment, breast cancer sequences are those that are 

^g^^h^o^ti^Huae^nofu^gen^U^inbreaa, 

cucer tissue as compared to non-carcerous tissue (see. e. g .. Tables 1,2, 3, 15, 
^. re g rf ation- K uaedhe reta oft»n^a,leas«*o«.a W o- fo ldo lM g..pre^lya. 

^ about a three fold change, with a. leaa. about five-fold or higher being preferred. 
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example subcdlular toxicotogtcal mfotmattou can be generated to bener torect drug s*u 
L0 ^F^.paparprasaa.ada.^BCP^nucsconfaranaa.Coronado.CMIunall 12, 

devio! to predic. toe My «M«W — md " 

15 lipids, drugs, and the like). 

Thus in another embodiment, the prasant invention provides a database that 

Modes a. leas, one set of assay data. The data confined in the database is acquired, e.g 
.sing array analysis eitoersinglyortnabVaryfcnnat The da.ab.se can be in subtfantiaUy 
my form in which da* ean be maintained and transmitted, bo. is preferabiy an el«tonte 
20 database. The electronic database of .he invention can be maintained on any etatonto 
device allowing for the storage of and access to toe database, such aa a personal computer, 
b„, is preferably distributed on a wide area network, such as the World Wide Web. 

The focus of me present section on databases that include peptide seqnence 

daB Uforc.a ri «yofiUus.rauouo,dy...wi^ 
25 a^asescanbeasaembledforanyassaydauacquirednstagauassayofthemvenhon 

The compositions and methods for identifying and/or quantitating toe relaltve 
and/or absolu.= abundance of a varie.y of molecular and macromolecular speciea from a 

sequences deacribed herein, provide an abundance of information, which can be correlated 
30 with patooiogica! conditions, predisposition .0 disease, tog testing, toerapeutic mom,onng, 
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known in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a 

seouences to be catalogued and searched according to on. or mote pro.cn functton 
H « Pa.1 5,953,727 discloses a reWooal debase having sequ.ce records 

forohUirringM-l^seqnenceafemaecol.echonofpsrtisJlengU.seqnences. U.S. 

^sln.i.ar^aseqn^d^^in.geneda^hasedonnrede^of— 
b? W .en a key sequence and . Urge, sequence. U.S. Pa.en, 5,538,897 disclose amethod 
using mass spectroscopy fragmentation patterns of peptides to identify ammo acd . sequence* 
in computer da.abases by comparison of predicted mass spectra wittt .xperimentally-denved 
«. spectra using ac.oseness-of.fi. measure. U.S. Patau. 5,926,818 discloses a rnultt- 
dimensional database comprising a functionality for multt-dimensiona! data analysrs 
oescAed as ou-line analytical processing (OLAP), which entails me consofidatton of 
project and actual data according to more ftan one consoUdation path or ditnenston. U.S. 
PaKn. 5,295,261 rcports a hybrid debase structure in which ttte fields of each database 
^ord axe divide into N. classes, navigational and mfornrational da* with navrgaUonal 
fluids stored in a hierarchical topological map which can be viewed as a free sttuottne or as 
the merger of two or more such tree structures. 

See also Mount* ai, Bioinformatics {2001); Biological Sequence Analysts: 
Probabilistic Models of Proteins and Nucleic Acids (Durbin * «/., eds., 1999); 
Bioinforntatics^PracticalGuidetotheAnalysiso . , 

Oeullette eds., 1998)); Rashidi & Buehler, Bioinformatics: Basic Applications in Biologtcal 
Science and Medicine (1999); Introduction to Computational Molecular Biology (Setubal et 
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The present invention provides a computer database P , 

s^isfiomaoon.rol^samp.e^.obe^ofpa^ogical—.lna 
variation, a. k~ one of to sources ia a known penological tissue specunen, e.g., 
neoplastic lesion or anotiter tissue specunen to be aoa.yxed for breas, cancer. In another 

.arge, species in a samp.e: (1) a unique identification code, which can include, eg., a Urge, 
molecular structure and/or characteristic separation coordinate (e.g., elecfiophoreuc 

present in the sample. 

The invention also provides fo, the storage and retrieval of a collection of 

targe, data in a computer data storage apparatus, which can include magnetic disks, optical 
disks, magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, 
magnetic bubble memo,, devices, and other data storage devices, including CPU reg^e. 
aadon-CPUdatastoragearrays. Typically, tite Urge, dato recorda are stored as a bt. pariem 
taa „an-sy of magnetic domains onamagnefabtemediurn or as an^ruy of charge states or 
^istor gatosUtos.sueh.anarrayofceUamaDRAMdeviceCe.g.eaeheeUoompnaed of 

a transistor and a charge storage area, which may be on the transistor). In one embodtment, 
to invention provides such storage devices, and computer systems built therewith, 
comprising a bit pa«em encoding a protoin expression fingerprint record comprising untune 
identifiers for at leas. 10 target data records crosa-tabulaled with huge, aource. 
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^oftereuaveamountofapepddeorn^cacid^uenccmapooufscu^ 

determine* fiom . polypeptide or nucleic aeid sample of a specnren. 

The invention also preferably provides a magnede drsk, snob as an IBM- 

rl^uaOS, Soiaris, MX, SCO Uni,, VMS, MV, Macmtosb, *.) «oppy drstette or 

tbe invention in a ffle forma, smtabie for relieve, and processing in a eompnlensen sequence 
analysis, comparison, or relative quantitation method. 

The invention also provides a network, comprising a plurality of computing 
devices linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone hne, 
ISDN line, wireless network, optical fiber, or other suitable signal transmission medrum, 
whereby at least one network device (e.g., computer, disk array, etc.) comprises a pattern of 
magnetic domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM 
cells) composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
database comprising a plurality of assay resmts obtained by the method of the inventton. 

In a preferred embodiment, the invention provides a computer system for 
comparing a query target to a database containing an array of data structures, such as an assay 
result obtained by the method of the invention, and ranking database targets based on the 
degreeofidentityandgapweighttothetargetdau, A central processor is preferably 
, initialized to load and execute the computer program for alignment and/or comparison of the 
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----- — — — ^ 

Execution of the computer program results m the <*n*d p 
^.hodaume.wMchcomprisesabina.yd^pUonofanassayresuU. 

The «rge. data or record and .he compel program can be .renamed 

u ■ ™,i,,ih random access memory (e.g., DRAM, SKAM, 
5 secondary memory, which is typically random acc .^ondence 

SCRAM, or SDRAM). Targe, are ranted accordmg to md ^ 

h^aae^^y^sao^hmoingm.sdec.^^ W 

_ . w„f .he ouerv target and results are output via an I/O device, ror v 
samecharactmsucoftheouerytarge 

a central processor can be a conventional computer (e.g, Intel . 
,„ PA8000 SPARC,MIPS4400,Mn>S 10000,VAX,etc.) i apmgramcanbcaconmierc. 

vtTe'g ^.SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, 

15 r^^^anBme^r^.P^^^^"'"^' 01 
->-^^ on ^^ proridK ._ of aoompu,er^s»has 

^lonofp^^ueno.^mcotd.ob^bythem^odao^m^ 

on the basis of computed similarity values. 

Characteristics of breast cancer-associated proteins 

„ Breaatoanc^prnteinaofmepreaentinventionmaybectoiaedaaaecreud 

pro^.banamembran.pm.em.otm—pro.ema. Inone — 
Lc^pro.einisanmuace^pm.em. Inhacellular proteins may be found m die 

tin and rephcadon (Including, e.g, slgnalhig pathways,; aberrant expression o « 
3„ ^oftenmauhamunmgmatedor W^ular processes (see. 
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*a*a iWtt For example, many intraceUular proteins 

have enzymatic activity P _ Intracellular protems 

^tivitv nucleotide cyclase activity, polymerase activity an 

addition to to highly conserved sequences found in to enzymatic P 
in.ecac.ion. For example, Scc-homo.o^ (SH2) domabs bind rycc^e-phospho^ a,.d 
togets in a sequent decent manner. PTE domains, which are distinct ftom 
domains, also bind tyrosine pfc.sphoryh.ted targe*. SH3 domains bind to prohne-nch 
.argeht. in addition, PH domains, tetratiicopeptide repeal and WD domains to name only a 
few have been shown ,o mediate pro.ein-pro.ein Weraonons. Some oftose may also be 
toolvedut binding to phosphobpids or otorseoond messengers. As will be approbated by 
one of ordinary skill in to art, these motifs can be identified on the basis of prunary 
sequence; thus, an analysis of to sequence of proteins may provide insight into both to 
eazymatie potential of to molecule and/or molecules with whieh to protein may assoctate. 
One useful databaae is Pfam (protein families), whieh is a large eoUeetion of multiple 
sequence atignmentsandhiddenMarkov models covering many common protemdomams. 
Versions are available via to internet ftom Washington University ML Louis, to Sanger 
Center in England, and the Karolinska Institute in Sweden (see, *g„ Bateman « aL, Nuc. 
, cl . Ato . M :263-26«(2000) ; Sonnhammere,u,., <^»™< 
a,., Nuc. Acids Ses. 27:260-262 (1999); and Sonnhammer e< at., Nuc. Acids Res. 26:320-322- 



25 

(1998)). 



In another embodiment, to breast cancer sequences are transmembrane 
plains. Transmembrane proteins are modules to. span a phospholipid bilayer of a cell. 
They may have an intracellular domain, an extracelhtlar domain, or both. The nuracellular 
domahts of sttch proteins may have a number of function, inctadm, tose already desortbed 
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, , • . Forexampie the in.acellu.ar domain may have enzymatic activity 
for intracellular protents. For example, m F ™ oen tl y me intracellular 

^^Inemhranepmteh.mayeontah.tyomone.many— mne 

^lcye^^ r — ^^ria.eny, 
^ —r^leaomah.. Many ^ - — rs 
dontatn-pro.e^aameycen.ainTmemb^espanmngregtons. Charactenshcs of 
mayhefol.owedbyoh^^^o^.Wore.np.n^oftoanunoa^ 

linmeprotehtmaybe predicted^, e. g . PSORT web site ********* 

tmnamembrane protein receptors include, but are no, Umited to the tnsuhn 
Raptor, inauhn-ult. gmwth factor receptor, hnman gmwth hormone receptor, ghrcose 

JLr.^gmwm^r^l.Pnn^.in^^^eg.IL.. 
„ M re^*^ ^ ^ ^ protebs „ ^ „ 
conservedmotifs amfound repeat^ anmngvanonsexn^llulardomains^ Conserved 

ZenmardomamsaremvoWedmbmdm.tootnermo.ectdes. In one aspect, e— 
Iloaarefoondonteceptom. Pao.orsoaatbmdmerecep.tdon^mc.ude^ 

F L example growth factors snch as EOF, FGF and PDQF am circuiatntg growth factors m 
30 ^'Igna.reoepmm.im^avaHe^ofce^reaponses. Other factors tnchtde 
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^onhic factors and the like. Extracellular domains also 
bindtoceU-associatedmolecul^ ***** J glycosy lphosphatidylinositol (GPI) 

— tr— ------ 

t^v are readily accessible targets for immunotherapeutics, as are 
present invention as they are readily acces 

tvpicallypermeablized to provide access to intraocular protons. 

Fortoermore, transmembrane proteins to., have been made so.nb.e can be made to be 
15 se^tedthroughrocombinanrmernabyadd^ ' 
m another embodiment, the breast cancer proteins are secreted protetns; toe 
secretion of which can bo either constitutive or regulated. These proteins have a stgnal 
peptideorsignalsequenCha.targetstoemoleouleto.heseore.orypatoway. Secreted 

oncelUincloseproxinn^.otoocoU.ha.seore.od toe factor) or an endoenne manner (acting 
aspects of physiology. Breast cancer proteins toa, are secreted proteins are parucvuady 
for blood, plasma, serum, or stool tests. 

Use of breast cancer nucleic acids 

As described above, breast cancer sequence is initially identified by 

^stanudnucldcacid^ 
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*« ho based uoon the overall nucleic acid or 
secures outitod herein. Such homology can be has* usi „ geitlKrhOTolo8y 

^gmms or hybridisation conditions. Typically, linked sequel nrRNA ar 

ta Tab.es ,-25, ean be *agmen«s of Ur«er genes, U, toy are nueU* - 

■W in to con,ex, includes ending regions, non^ding regrons, and nurtures of c«tog 

seouences provided herein, extended sequences, in either dnechoa, of the breast cancer gen 

Once the breast cancer nucleic acid is identified, it can be cloned and, rf 
neceaaary, tta constihren, parts recombined to fonn to entire breas, cancer nuclerc aerd 
coding regions or to entire mRNA sequence. Once isolated from h natural source, e g, 
c 0 „ W nedwitoap.asntidorotovec«ororexcisedtore to masalin.arnuc.«.cao,d 

and isolate other breast cancer nucleic acids, e.g., extended coding regions. It can also be 
used as a "precursor" nucleic acid <o make modified or variant breast cancer nuclerc acrds 

and proteins. . , 

The breast cancer nucleic aeida of to present invention are used m several 

„ays in a firs, embodiment, nucleic acid probes to the breast cancer nucleic acids are made 

wi attached to bioehrps to be us.! in screening and diagnostic methods, as outlined below, 

« for administration, eg., for gene therapy, vaccine, and/or antisense applications. 

AUematively, to breas, cancer nucleic acids tot include coding regions of breast cancer 

pI „,eins can be pu, into expression vectors for to expression of breas, cancer protems, agam 

forscrecmngpinposesorforadministrationtoapatienl. 

In a preferred embodiment nucleic acid probes to breaa, cancer nncle,e amds 

, (torn to nneleic acid sequorces outlined in to figures and/or <he complement thereof) are 
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Mde . tt e» U c. d cao i ap^a^.o«hobiocMpa I =de Slgn e d «obe— y 
compfementory .o -he breast cancer nucleic adds, «. ***** <«~*' 

kybridizauon of the Urge, sequence and «he probes of the pmsen, invemron occus. * 

Igle snanded nucleic acids of toe present M. However, if toe number of muUhons 
i. so grea, «ha, no hybridization cm occur under even .he leas, stringent of hybridan 
conditions, ft. sequence is no, a compjemenuuy urge, sequence. Thus, by "subaanbally 
complementory" hereto is mean, that the probes are sufficiently complementary to the target 
sequences to hybridize under normal reaction conditions, particularly high atnngency 

conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single 
and partially double stranded. The strandedness of the probe is dictated by the structure, 
compo S iti 0 n,andpro P ertiesofthetargetsequence. In general, the nucleic acid probes range 
from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally whole 
genes are not used. In some embodiments, much longer nucleic acids can be used, up to 
hundreds of bases. 

In a preferred embodiment, more than one probe per sequence is used, with 
either overlapping probes or probes to different sections of the target being used. That is, 
two three, four or more probes, with three being preferred, are used to build in a redundancy 
for a particular target. The probes can be overlapping (i.e., have some sequence in common), 
or separate. In some cases, PCRprimers maybe used to amplify signal for higher sensitvtty. 

As will be appreciated by those in the art, nucleic acids can be attached or 
immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 
equivalents herein is meant the association or binding between the nucleic acid probe and the 
solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent By"non- 
covalent binding" and grammatical equivalents herein is meant one or more of electrostatic, 
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amoreculo, -.as, ^«-a,a grammatical 

sappor.orth.probcorbothmolec^.tamobfflz^nmay^mv.Weacom 

synthesized firs,, with subsequent attachment to the brochrp, or can be drrectly syn 
^ ^biochipcotnpttsesaant.bteaoUdanbsttate. 

JLdiscreteindMdual sites appropriate forU-e attachment or a^atton of tttennde, 

^^tt^t 

« of Poa*.. s^^very^.andMuae.bu^notU^.ed.o, 
^ and modified or functionalized gUss, plastic* (ineiuding acrylics, polystyrene and 
Lo^ofstyrenearrfomermateria!*,^^ 

!£LjLl.~l potysaeeharides.nytonormttoeeuutoae, reams, stUc, or sdtca- 
olrfcs etc In general, the substrates allow oprical detection and do not apprectably 

, rien.pLoBioohip.U.S. Apphcanon Seriai No. 09«70, 21 4,«edMamh .5, »». 

herein incorporated by reference in its entirety. < 

CHneraUyuteaubshateiaplanar.a.thoughaawiUbeapp.ciatedbymosem 

U.e^omerconfi^rionsotsuhsttatesn.ybeusedaawen.Forexan.ple^epn.besmay 
Teptaced on the inside surface of a tube, for flow-through sample anaiysU to rnmnmze 
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closed cell foams made of particular plastics. 

h, a preferred err—, toe surface of toe biochip and toe probe may be 
^vauzed wito chemica. tonctiona. groups for subsequent a—, of toe two •Thus, e.g., 

^oups, carboxy grnups, otto groups and toiol grnups, irife amino groups beu,g 
prfcularlypreferred. Ustag toese functional groops, toe probes oan be atiachedustng 

attached to surfaces comprising amino groups, e.g. nsing linkent as are known tn toe art, e.g., 

catalog, technical section on cross-linkers, pages 155-200). In addition, in some cases, 
additional linkecs, such as alky! groups (including substituted and heteroalkyl groups) may be 

used. . - * 

In this embodiment, oligonucleotides are synthesized as is known in the art, 

and then attached to the surface of the solid support. As will be appreciated by those stalled 

in the art, either the 5' or 3' terrninus may be attached to the solid support, or attachment may 

be via an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very 
strong, yet non-covalent. For example, biotinylated oUgonucleotides can be made, which 
bind to surfaces covalently coated with streptavidin, resulting in attachment. 

Alternatively, the oUgonucleotides may be synthesized on the surface, as is 
known in the art. For example, photoactivation techniques utilizing photopolymerization 
compounds and techniques are used. In a preferred embodiment, the nucleic acids can be 
synthesized in situ, using well known photolithographic techniques, such as those descnbed 
; m WO 95/251 16; WO 95/35505; U.S. Patent No, 5,700,637 and 5,445,934; and references 
cited within, all of which are expressly incorporated by reference; these methods of 
attachment form the basis of the Affimetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression 
level of breast cancer-associated sequences. These assays are typically performed in 
conjunction with reverse transcription. In such assays, a breast cancer-associated nucleic acid 
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quantitative PCR arc provided, e.g„ in fonts et «/., PCS frotoc 

,0 dyeanda3'q»=ncbta 8 a g entTh e probehybntotoaPCRproduct,bu t o 

eLdedduefoabloc.tingag«nta,ra«r.nd. Wb^d-PCRp^^p^". 

OeavageoftiteTaqManprobe. Tbi, cleavage separate, dr. 5' fluorescent dye and ft. 3 
quenching agent, thereby renting in an increase in fluorescence as a function of 

q s 6 ., jt,,,^. pime, en www2.perirawlmer.com). 

15 a mplmc^on( S cc,e.g.,Ute«tareprovidedbyPcrlcin-Eliner,e.g,www p 

Other suitable amplification methods include, bnt are not tented to, hgase 
chain reaction (l£R) (see Wu & Wallace, Genomics 4:560 (1989), Landegrenetol-, Science 
241-1077 (1988), and Barringer « at., Gene 89:117 (1990)), transcription amplication 

20 GmB m«n ) ..P^«^^^^ 874 ( W0 »^ ,P ^^ toad * ,eI ^ 



etc. 



Eipresslon of breast cancer proteins from nucleic acids 

I» a preferred embodiment, breast cancer nucleic acids, e.g„ encoding breast 
25 ^oerpmtein.raoused^n^eavarietyof^saiouv^mtoexpressbreaa.cauoer 

p^wbichoanmenbons^mac.enhrgasaays.asdesoribedbelow. Bxp^ton vectors 

andrecombinantDNA technology are well known, o those of skill inthe art (se.e.g 
A ^^,a^c^^ to ^(Fcn M udez*Hoe« te r,*1999))» 1 da K 

^toexpressproteras. The expression vectors may be either 

chromosomal vectors or veetots which integnue into a host genome. Generafly, Utese 
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. , „ «*tional and ttanslational regulatory nucleic acid operably 

kuown ,o utilise premo.cs, polyadenylatioo signals, -**->» 

Nucleic acid U "operably linked" wbeu it is placed urto a functional 

^ry^isor^lyliAedreDNAforapolypepbderfrtKexpreaaeaaa p P 

„ operably .inked re a cooing sequence if 1, is positioned so aa re facffi<a«e .nmslatiom 

„d. in d. case of a secretory leader, contiguous and in reading phase. However, enhance 
do no, have to be contiguous. Linking is typically accomplished by ligation at convement 
restriction sites. If such sites do not exist, synthetic oligonucleotide adaptors or tinker are 
used in accordance with conventional practice. Transcriptional and trenalational regularery 
nucleic acid will generally be appropriate * the host cell used to express tire breast cancer 
protein. Numerous types of appropriate expression vectors, and suitable regulatory 
sequences are known in the art for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may urclude, 
„„, are no, totted re, promoter sequences, ribosomal binding sites, tianscriotional tfart and 
^p sequences, unnslational rtart and s<op sequences, and enhancer or activarer sequences 
U a preferred embodiment tire regulatory sequences include a promoter and uunscnpuonal 

start and stop sequences. 

Promoter sequences encode either constitutive or inducible promoters. Hie 

pro motersmaybeeimernatura^ Hybrid 
promoters, which combine elements of more than one promoter, are also known m the art, 
and are useful in the present invention. 
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^ued to W o organisms, e.g. in — or tnsect ^ « 

voters arc well known in the art (e.g., Fernandez & Hoefiler, supra). 

^addWon.nraprefeneoembodimen^ee^resatonvectorc.n^nsa 

setectablenwkergeneto allow the selectiono, transformed host com. Selecrion genes are 
well known in the art and will vary with the host cell used. 

in u,e art through routine experimentation or optimization. For example, me use of 
constitutive promoters in to expression veotor will mquire optimizing the growth and 

U conditions for induction. In addition, in some embodiments, me nmmg of me harvest 
Uimportant For example, me haenloviral systems used in inseerceUexpreasron are !y«c 
viruses, and thus harvest time seleotion can be crooial for prodnct yteld. 

Appropriate host cells include yeast, bacteria, archaebactena, fungr, and msec. 
^^ceBa,hml»dingnummmnanc^Ofp»rnc»larmte^ TO &c^^ 

cerevHtae and odter yeasts, E. coU. BaCUus sMIis, SB ceHa, C129 ccUs, 293 «fc 
pteurospora, BHK, CHO, COS, HeLa cells, HUVEC (human umbiheal vein endothehal 
cM THP1 cenaCamacrophageoellUneJandvariouaomerhumancellsandceUhnea. 

In a preferred embodiment, the breast cancer proteins are expressed m 
,0 mammalian cells. Mammalian expression ayatemaare also known in the art, and include 
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the nromoters from mammalian viral genes, since the viral genes are otten mgiuy 
5 Idlest. « ^SV^O^yp^.— « 

- include those derived form SV40. 

The methods of introducing exogenous nucleic acid into mammalian hosts, as 
well as other hosts, is well known in the art, and will vary with the host cell used. 
Techniques include dextran-mediated transition, calcium phosphate precipitation, 
15 polybrenemediatedtransfection.protoplastmsior, 

encapsulation of the polynucleotide^) in liposomes, and direct microinjection of the DNA 

into nuclei. m . 

In a preferred embodiment, breast cancer proteins are expressed abacterial 

systems. Bacterial expression systems are well known in the art. Promoters from 
20 bacteriophagemayalsobeusedandareknownintheart. In addition, synthetic promoters 
and hybrid promoters are also useful; e.g., the tac promoter is ahybrid of the tip and lac 
promotersequences. Furthermore, a bacterial promoter can include naturally occurring 
promoters of r*n-bacteri^ 

initiate transcription. In addition to a functioning promoter sequence, an efficient nbosome 
25 binding site is desirable. The expression vector may also include a signal peptide sequence 
that provides for secretion of the breast cancer protein in bacteria. The protein is either 
secreted into the growth media (gram-positive bacteria) or into the periplasmic space, located 
between the inner and outer membrane of the cell (gram-negative bacteria). Thebactenal 
expression vector may also include a selectable marker gene to allow for the selection of 
30 bacterialstrainsthathavebeentransformed. Suitable selection genes include genes which 
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tt ,eart,suchascalciumcUoridetr«at m cn.,el«=troporation, a ndota S 

I„„ Me mb<>dime n Orea S .can«p ro te ta sareprod»ced 1 nu K «c.c 6 lls 
Express vectors for ft. ^fonnadonof tot c«m, andinpartio^bac^s-based 

expression vectors, are well known in the art. 

hrap.eferredenrbodtaen.breas.o^cerpro.einiaprodnoedntyeastee.la. 

Yeas, expre.sionsya.ema are weIlknow.in.he art, and include expression vectors for 

S^Haromyces cerevisiae, Candida albican* and C. Hansenula polymorph,, 

Kluyveromycesfi-agUis and K. tools. Pichia gwllerimondii and P. paswm, 

Schlzosaccharomycespombe, and Kammao lipoidica. 

The breast cancer protein may also be made as a fnsion protem, asmg 

techniques well known in the art. Thus, e.g., for the areata of monoclonal antibodies, ,f tire 
desWepi.opeUsmati.tirebreaa. cancer protein may he te eri .o a canier protein .0 form an 
tananogen. Alternativety, tiro breaa. cancer protein may he made as a fusion protem to 
tacrcase expression, or for otirer reason, For example, when ftebreaa. cancer protem ts a 
hreaa. cane peptide, the nucleic acid encoding tire peptide may be linked «o outer nucletc 
acid for expression purposes. 

to a preferred embodiment, the breast cancer protein is purtfled or raolated 
after expression. Breaa. cancer ptoteins may be isoiated or purified in a variety of ways 
taotvn to tttose skilled in tire art depending on what otirer components are present tn the 

I. Standard puriScation metirods include electophoreuc, molecular, hnmunologma! 
and chromatographic techniques, inchtding ion exchange, hydrophobic, affinUy, «* »«» 
Phase HPLC chromategraphy, and ctoomalofocusmg. For exampte, tire breaa. cancer proton 
rj he purified using a stendart anti-breast cancer protein antibody column. TJUranUration 

47 



PCT/US02/02242 
WO 02/059377 



10 



geaeral guidance in suitable purification techniques, see * 
LdegTeofpurificauonnecessarywiUvary depending ontheuseof Abreast cancer 

protein In some instances no purification will be necessary, 
protein, in ^ necessa rv the breast cancer proteins and 

Once expressed and purified if necessary, me o 

,. „ ™ wm avheusedasimmunoselecuon 
nucleic acids are useful in a number of applications. They may be used 

reagents, as vaccine reagents, as screening agents, etc. 
Variants of breast cancer proteins 

cancels as -pared ,o towfid-type sequence. That- a, ^™ J^T"' 

15 Alsomcludedwithinoneembodimentofbreastcanceiproteteofthepresent 
invcnnonareaminoacidsequeneevariants. These variants typically fell into one or more of 
to, classes: MM insertions! or deletions! variant. These variants ordmanly are 
prepared by site specific metagenesis ofnucleotides in to DNA encudin, to breast cancer 

20 pmUm.uamgcassetteorPCRmn^eneaUorotorteehmqueswelllmow.mtoart,^ 

cutture as onUlned above. However, variant bteast caneer protein fragments havmg up to 
about 100-150 residues may be prepared by in vino synthesis using established tecbntqueu. 

25 a feature to. sets torn apart from nahunlly occurring alleUe or interspecies vanatton of to 
breast oaneer ptotein amino add sequence. The variants typically exhibit the same 
^Utarive biological activity aa to nahnully occuning analogue, altough variants ean also 
„e selected which have modified chatacteristics aa will he more fully ouflined below. 

While to site or region for introducing an.amino acid sequence vartaoon .s 
30 predetermined, to mutation perse need no, be predetermined. For example, in order to 
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mutagenesis ** mutagenesis. Screening of fite mutants is done us,ng assays of breast 
^^^ 4 ^ om ^ wMyof ^^ :i ^onsusuaU, 

„ffl be on to order of from about 1 to 20 amino aeids, although eonsiderahly larger 
Illmayhe^ed. Pomona nnge from ahou. . «o aho, 20 residnes, ahhongh m 

some cases deletions may be much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to 
arfve a, a final derivative. Generally these ehangea are done on a few amino adds to 
mbl into the alteration of me motatie. However, larger ehanges may be tolerated in certatn 
circumstances When small alterations in me characteristics of the breast caneer proten. are 
a^ired, substitution, axe generally made in accordance with the amino aeid substitution 
relationships provided in the definition section. 

The variants typically exhibit the same qualitative biological activity and vail 
elicit the same immune response as the naturally-occurring analog, although variants also are 
selected to modify the characteristics of the breast cancer proteins as needed. Alternatively, 
the variant may be designed such that the biological activity of the breast cancer protem is 
altered. For example, glycosylate sites may be altered or removed. 

Substantial changes in function or immunological identity are made by 
selecting substitutions that are less conservative than those described above. For example, 
substitutions may be made which more significantly affect: the structure of the polypeptide 
backbone in the area of the alteration, for example the alpha-helical or beta-sheet structure; 
the charge or hydrophobic^ of the molecule at the target site; or the bulk of the side chain. 
The substitutions which in general are expected to produce the greatest changes m the 
polypeptide's properties are those in which (a) a hydrophilic residue, e.g. seryl or threonyl is 
substituted for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or 
alanyl; (b) acysteine or proline is substituted for (or by) any other residue; (c) aresidue 
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. . . .„ 6 . lysy , argtoyl, or histidyl, is substituted for (or by). 

■ « L type of covalen, modification Mute ..acting targeted annno 
TOp oof«n 8 u,v.anon Onaweofco gCT(tatiscvable 

po.ypephde. ^^ tta ^.^ te ^ portnKtri x 0 r S u*coibr«in aK 

my described below. Commonly used blinking agents include, e.g, 1.1- 
^4-aridosalicylicacid^ 

Other modifications include deamidatton of glutaminyl and asparagmyl 

l y ,a«on of me amino g^ups of the lysme, arghrine, and hisuuine aide ohama (Cretghton, 

tt^amine.andamidattonofanyC-teminalcartoxylgroup. 

Another type of covalent modification of the breast cancer polypeptide 

ofmepolypepdde. "Altering Ute nafive glycosylation pahem" is intended for purposes 

^polypepriae.and/o.addingoneocmorog.yoosy.aUonsi.ea^are^^m^ 
^^l.b^.cnnoerpol^G.^onpa^canbeal^mman, 

ways . ft. example the use of different cell types to express breast cancer-assoaated 
sequences can result in different glycosylate patterns. 
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sequence breast cancer polypeptide (for O linicea giy y 

partly by ^gaeDNAeaoodingmebrea^c^rpoypepUde.p^ 

Motomeansofincr^ingtanun.berofcarbohydra.omo.ePesonte 

10 polypeptide. Suchm e thod S ar=de30rib e dinthea I l,c.g.,mWO87/05330,an 

Wriston CRCCrU.Rev.Biochem., pp. 259-306 (1981). 

^ova.of.arbcbydr.^o^esp^onU.ebrea.lcancerpolypepUde 

Myb cac C o n p«sb^cb= mi ca 11 yor«^Ucanyorbym— sub^donocodo. 
en codta 6 for»inoao i dre S idu«lha t s CT v.as t arg« S forglycosyWo I , Chemrcal 

, 5 deg,ycosylation — are m me a« and *— . * ^ ^ 

« at, ^. Biophys., 259:52 (1987) and by Edge a, .(,« «•«> 

(.981) Enzyntetic olaavag. of cnrbohydrat. moieties on polypeptides ean be acteeved by the 
use of a variety of endo-and exo-giycosidasea aa deaeribed by Thotelcnra a, al. Mart. 

Enzymol., 138:350(1987). 
20 Another type of oovalent modification of breaat caneer compnsea hnkmg the 

breaateancerpolyneptidetooneofavarietyofnonproteinaeeonspolyrnera.e.g., 

polvemyieneglycohpoiypropylene^^^^^ 

U S. Paten, Nos. 4,640,835-, 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

Breaat caneer polypeptides of the present invention may also be modrfred rn a 
25 way to form chimerie moleonles comprising a brass, eaneer polypeptide fused to another, 
he ^.ogou S po 1 ^deor^aoidse <1 uenc,. to on.embod ta en,^.^c 

^ecnleoomprises.msionofabreaat cancer polypeptide wim a teg polypeptide whrch 
provides an epitope to which an anti-teg antibody can se.eotiv.ly bind. The epitope teg rs 
generaUy placed at me aminos c^xyl-terminns of me breaat cancer polypeptide. The 
30 presenceofancbepimpe-teggedfonnaofabr^tcancerpolypaptidecanbedetactedusmg 
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an antibody against the tag polypeptide. Also, provision of the epitope tag enables the breast 
cancer polypeptide to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of a breast cancer polypeptide with an 

5 immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the 
chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

Various tag polypeptides and their respective antibodies are well known in the 
art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; 
HIS6 and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field et 

10 al, Mol Cell Biol. 8:2159-2165 (1988)); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 
9E10 antibodies thereto (Evan et al, Molecular and Cellular Biology 5:3610-3616 (1985)); 
and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky et al, 
Protein Engineering 3(6):547-553 (1990)). Other tag polypeptides include the Flag-peptide 
(Hopp et al, BioTechnology 6:1204-1210 (1988)); the KT3 epitope peptide (Martin et al, 

15 Science 255:192-194 (1992)); tubulin epitope peptide (Skinner et al, J, Biol Chem. 

266:15163-15166 (1991)); and the T7 gene 10 protein peptide tag (Lutz-Freyermuth et al, 
Proc. Natl Acad. Set USA 87:6393-6397 (1990)). 

Also included are other breast cancer proteins of the breast cancer family, and 
breast cancer proteins from other organisms, which are cloned and expressed as outlined 

20 below. Thus, probe or degenerate polymerase chain reaction (PCR) primer sequences may be 
used to find other related breast cancer proteins from humans or other organisms. As will be 
appreciated by those in the art, particularly useful probe and/or PCR primer sequences 
include the unique areas of the breast cancer nucleic acid sequence. As is generally known in 
the art, preferred PCR primers are from about 15 to about 35 nucleotides in length, with from 

25 about 20 to about 30 being preferred, and may contain inosine as needed. The conditions for 
the PCR reaction are well known in the art (e.g., Innis, PCR Protocols, supra). 

Antibodies to breast cancer proteins 

In a preferred embodiment, when the breast cancer protein is to be used to 
30 generate antibodies, e.g., for immunotherapy or immunodiagnosis, the breast cancer protein 
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particularly linear epitopes. In a preferred embodiment, the epitope is unique; that rs, 
antibodies generated to a unique epitope show little or no cross-reactmty. 

Methods of preparingpolyclonal antibodies are known to the skilled artisan 
(e.g.,CoUgan,^m;andHarlow & Lane,^a). Polyclonal antibodies ean be raised ma 
mammal, e.g.,by one or more injections of an immunizing agent and, if desired, an adjuvant. 
Typically, the immunizing agent and/or adjuvant will be injected in the mammal by multiple 
subcutaneous or intraperitoneal injections. The immunizing agent may include a protem 
encoded by a nucleic acid of the figures or fragment thereof or a fusion protein thereof. It 
may be useful to conjugate the m^umzing agent to a protein known to be immunogemc m 
the mammal being immunized. Examples of such immunogenic proteins include but are not 
limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean 
trypsin inhibitor. Examples of adjuvants which may be employed include Freund's complete 
adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose 
dicorynomycolate). The immunization protocol may be selected by one skilled in the art 

without undue experimentation. 

The antibodies may, alternatively, be monoclonal antibodies. Monoclonal 
antibodies may be prepared using hybridoma methods, such as those described by Kohler & 
Milstein, Nature 256:495 (1975). In a hybridoma method, a mouse, hamster, or other 
appropriate host animal, is typically immunized with an immunizing agent to elicit 
lymphocytes that produce or are capable of producing antibodies that will specifically bind to 
the immunizing agent. Alternatively, the lymphocytes may be immunized in vitro. The 
immunizing agent will typically include apolypeptide encoded by a nucleic acid of Tables 1- 
25orfragmentthereof,orafusionproteinthereof. Generally, either peripheral blood 
lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen cells or lymph 
node cells are used if non-human mammalian sources are desired. The lymphocytes are then 
fused with an immortalized cell line using a suitable fusing agent, such as polyethylene 
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L 59 ,03 (1986)). Inunortalized cell ita. »— BT«— -* — — *• 

toe^furttanortalizedcen, For* if parental ceUs lack tt-een^e 
the uniusea, uum mrPPT or HPRT) the culture medium 

hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), ft 

medium^wWchsubstaacesp^^ 

In one embodiment, the antibodies are bispeeific antibodies. Bxspectfic 

todies aremonoclonal, preferably human or humamzed, antibodies that have bindmg 
specificities for at least two different antigens or that have bmding specifiers for two 
epitopes on*e same antigen. In one embodiment, one of the binding speofiahes , or a 

other antigen, and preferably for a cell-surface protein or receptor or receptor subumt, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 

multivalent reagents. 

fa a preferred embodiment, the antibodies to breast cancer protein are capable 

of reducing or eliminating a biological function of a breast cancer protein, as is described 
below That is, the addition of anti-breast cancer protein antibodies (either polyclonal or 
preferably monoclonal) to breast ^^vc***^^^™**** 
or ehminate the breast cancer. Generally, at least a 25% decrease in activity, growth, S1 ze or 
the like is preferred, with at least about 50% being particularly preferred and about a 95- 
1 00% decrease being especially preferred. 

to a preferred embodiment the antibodies to the breast cancer protems are 

humanized antibodies (e.g., Xenerex B to 

DesignLabsJnc.) Humanized forms of non-human (e.g., murine) antibodies are clumenc 
molecules of immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, 
Fab Fab' F(ab')2 or other antigen-binding subsequences of antibodies) whichcontain 
30 minimalsequen^ Humanized antibodies include 
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human specie, (donor antibody) such as mouse, rat or rabbit having 

■„ r mm ei„stances Fv framework residues of the human 
affinity and capacity. In some instances, HumaniK d antibodies 

^oglobulin are replaced by corresponding non-humau residue,. Human** 

I^Rer— kse,ueocea. fn general, a humanised antibody w, oompnse 
^.antia.lyaUofa.l^one.andWicmlymo.variabledomains.mwmchaUor 

substantially an of me CDR regions correspond «o those of a non-human immunoglobulin 
aadah or substantially all of me framework (FR) regions are those of .human 

„ .east . portion of an immunoglobulin consent region (Fo), typicaUy that of a human 
tenunoglobutin (Jones e, al., N^e 321:522-525 (1986); Riechmann e, al.NaHre 
332-323-329 (1988); andPresta, Can. Op. SHU*. Biol. 2:593-596 (1992)). Hum— 
can'bc esaentiaUy performed following the method of Winter andco-workers (Jones et a,.. 
N^e 321:522-525 (1986); Riechmann e, al.NaHre 332:323-327 (1988); Verhoeyen m *. 
Science 239:1534-1536 (1988)), by substituting roden, CDRa or CDR sequences for the 
ccresponding sequences of a human antibody. Accordingly, such humanized antibodies are 
chimeric antibodies (U.S. Pa.en.No. 4,816,567), wherein substantially .ess man an mfact 
human variable domain has been substituted by the corresponding sequence from a non- 

human species. . u 

Human antibodies can also be produced using various techniques known m the 

art, including phage display libraries (Hoogenboom & Winter, J. Mol Biol. 227:38! (1991); 

are also available for the preparation of human monoclonal antibodies (Cole et al, 
MonoclonalAnHbodies^CancerTheraw^ 

147(l)-86-95 (1991)). Similarly, human antibodies can be made by introducing of human 
i^unoglobulin loci into transgenic animals, e.g., mice in which the endogenous 
i^unoglobulmgeneshave^^ Upon challenge, 

) human antibody production is observed, which closely resembles that seen in humans in all 
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5,661,0.6, and in to Mowing — publications. Ma* eraf ^ 
ll.Uh.rtJU-WIWOWi Morrrso»,»»re 368.812 
783 (1992), Lonoerge Neuberger, Maura 

S <1994);Fishwild a, 0 ;., »*ra Bi<-<ec»»o(ag, 14.845-51 (1996). N 6 

M«M» 14:826 (1996)-, Lonberg & Huszar, Men,. Re,. Inmm*. 13.65-93 (1995). 
MM J^^.^^of^, cancer wito an ^body^ad 

0 (patient) A^h»^i.<l-»^-^^ T ^ , ^™" m , 

with an antigen .o which antibodies a. raised. As appreciated by one of ordnrary *D .„ .be 
m. to antigen may be provided by injecting a polypeptide against which antibod.es are 
desired to be raised into a recipient, or contacting to recipient with a nucleic ac,d capable 
15 expressing to antigen and nnder conditions for expression of to antigen, leadtng to an 

immune response. . 

In a preferred embodiment to breast cancer proteins against which anttbod.es 

are raised are secreted proteins as described above. Without being bound by theory, 
antibodiea used for treahnent, bind and prevent to secreted protein from bunting «o « 
20 receptor, thereby inactivating to secreted breast cancer protem. 

In another preferred embodiment, to breast cancer protem to winch 

25 aoabodymay^edowr.-regnlationofthettanamembranebrenatcanoerprotem. Aswtilbe 
apprectatedbyoneof ordinary sW lI in to art, to antibody may be a competitive, non- 

toast cancer protein The antibody is also an antagonist of to breast cancer protem. 
p^ctoantihodyprevenuaotivationoftobansmembrane breast cancer protem. In one 
30 aspect, when to antibody prevents to binding of other molecules to to breaa, cancer 
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„ n rr«* fln rihodv may also be used to target or 
•« . ft. call to cytotoxic agents, including, but not touted to TNF-ct, " ' 

mediating cytotoxicity or amigen-dependen, cytotoxtctty (ADCC). Thus, brea,. 

^^ A n^y..^»y^.co. to x ta ,.oc^.-n^°' 

otherwise provide means to loeaUy ablate cells. Sector 
h another preferred embodiment, the antibody rs conjugated to an effector 
noi ely neeffeetormoie^canbeanynumberofmolecu.es.includinglaheuingmore^ 

pro «ei, manodreraapecdteurerapeudemoievmodnla^naeactivityofmoleculea 

«, inhibit enzymatic activity such as protease or coUaget^se or protein lonase ac«m.y 

associated with breast cancer. 

In a preferred embodiment, the therapeutic moiety can also he a cytotoxrc 
^.muusmedrod.targeuug.hecytotoxicagen.tobreas, cancer tissue or ceUs results tn 
.^onm^numlKrof^c^.uaereby^aymp^ass^w.* 
hreastcanee, Cytotoxic agen* are numerous and varied and luclude, bu, are no, hmrted to, 
cy.otoxiedrugsor^naoractive^agmeutsofsuchu.xms. Suitable toxms and tbeu 

ch^ curcm.cro.in.phenomycmenomycinandmehlte. Cytotoxic agents also mclude 
l5 radiochemicals made by cunjugatog radioisotopes to antibodies raised against breas, cancer 

Jcled arc, but also serves to reduce deleterious side effects that may be associated vnttr 
30 the therapeutic moiety. 
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In another preferred embodiment, the breast cancer protein against which the 
antibodies are raised is an intracellular protein. In this case, the antibody may be conjugated 
to a protein which facilitates entry into the cell. In one case, the antibody enters the cell by 
endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 
5 the individual or cell. Moreover, wherein the breast cancer protein can be targeted within a 
cell, i.e., the nucleus, an antibody thereto contains a signal for that target localization, i.e., a 
nuclear localization signal. 

The breast cancer antibodies of the invention specifically bind to breast cancer 
proteins. By "specifically bind*' herein is meant that the antibodies bind to the protein with a 
10 K<j of at least about 0.1 mM, more usually at least about 1 |iM, preferably at least about 0.1 
liM or better, and most preferably, 0.01 \M or better. Selectivity of binding is also 
important. 

Detection of breast cancer sequence for diagnostic and therapeutic applications 

15 In one aspect, the RNA expression levels of genes are determined for different 

cellular states in the breast cancer phenotype. Expression levels of genes in normal tissue 
(i.e., not undergoing breast cancer) and in breast cancer tissue (and in some cases, for varying 
severities of breast cancer that relate to prognosis, as outlined below) are evaluated to provide 
expression profiles. An expression profile of a particular cell state or point of development is 

20 essentially a "fingerprint" of the state. While two states may have any particular gene 
similarly expressed, the evaluation of a number of genes simultaneously allows the 
generation of a gene expression profile that is reflective of the state of the cell. By comparing 
expression profiles of cells in different states, information regarding which genes are 
important (including both up- and down-regulation of genes) in each of these states is 

25 obtained. Then, diagnosis may be performed or confirmed to determine whether a tissue 
sample has the gene expression profile of normal or cancerous tissue. This will provide for 
molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 

30 patterns within and among cells and tissue. Thus, a differentially expressed gene can 
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qualitatively have its expression altered, including an activation or inactivation, in, e.g., 
normal versus breast cancer tissue. Genes may be turned on or turned off in a particular state, 
relative to another state thus permitting comparison of two or more states. A qualitatively 
regulated gene will exhibit an expression pattern within a state or cell type which is 
5 detectable by standard techniques. Some genes will be expressed in one state or cell type, but 
not in both. Alternatively, the difference in expression may be quantitative, e.g., in that 
expression is increased or decreased; i.e., gene expression is either upregulated, resulting in 
an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 

10 standard characterization techniques as outlined below, such as by use of Affymetrix 
GeneChip™ expression arrays, Lockhart, Nature Biotechnology 14:1675-1680 (1996), 
hereby expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PCR, northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is at least 

1 5 about 50%, more preferably at least about 100%, more preferably at least about 1 50%, more 
preferably at least about 200%, with from 300 to at least 1000% being especially preferred. 

Evaluation may be at the gene transcript, or the protein level. The amount of 
gene expression may be monitored using nucleic acid probes to the DNA or RNA equivalent 
of the gene transcript, and the quantification of gene expression levels, or, alternatively, the 

20 final gene product itself (protein) can be monitored, e.g., with antibodies to the breast cancer 
protein and standard immunoassays (ELISAs, etc.) or other techniques, including mass 
spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to breast 
cancer genes, i.e., those identified as being important in a breast cancer phenotype, can be 
evaluated in a breast cancer diagnostic test. 

25 In a preferred embodiment, gene expression monitoring is performed 

simultaneously on a number of genes. Multiple protein expression monitoring can be 
performed as well. Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the breast cancer nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of breast cancer sequences in 
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a particular cell. The assays are further described below in the example. PCR techniques can 
be used to provide greater sensitivity. 

In a preferred embodiment nucleic acids encoding the breast cancer protein are 
detected. Although DNA or RNA encoding the breast cancer protein may be detected, of 
5 particular interest are methods wherein an mRNA encoding a breast cancer protein is 
detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 
complementary to and hybridizes with the mRNA and includes, but is not limited to, 
oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 

10 examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is 
detected. In another method detection of the mRNA is performed in situ. In this method 
permeabilized cells or tissue samples are contacted with a detectably labeled nucleic acid 
probe for sufficient time to allow the probe to hybridize with the target mRNA. Following 

1 5 washing to remove the non-specifically bound probe, the label is detected. For example a 
digoxygenin labeled riboprobe (RNA probe) that is complementary to the mRNA encoding a 
breast cancer protein is detected by binding the digoxygenin with an anti-digoxygenin 
secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3- 
indoyl phosphate. 

20 In a preferred embodiment, various proteins from the three classes of proteins 

as described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The breast cancer proteins, antibodies, nucleic acids, modified proteins and cells 
containing breast cancer sequences are used in diagnostic assays. This can be performed on 
an individual gene or corresponding polypeptide level. In a preferred embodiment, the 

25 expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

As described and defined herein, breast cancer proteins, including 
intracellular, transmembrane or secreted proteins, find use as markers of breast cancer. 
30 Detection of these proteins in putative breast cancer tissue allows for detection or diagnosis 
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of breast cancer. In one embodiment, antibodies are used to detect breast cancer proteins. A 
preferred method separates proteins from a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the like). Following separation of proteins, the breast cancer protein is 
5 detected, e.g., by immunoblotting with antibodies raised against the breast cancer protein. 
Methods of immunoblotting are well known to those of ordinary skill in the art. 

In another preferred method, antibodies to the breast cancer protein find use in 
in situ imaging techniques, e.g., in histology (e.g., Methods in Cell Biology: Antibodies in 
Cell Biology, volume 37 (Asai, ed. 1993)). In this method cells are contacted with from one 

1 0 to many antibodies to the breast cancer protein(s). Following washing to remove non- 
specific antibody binding, the presence of the antibody or antibodies is detected. In one 
embodiment the antibody is detected by incubating with a secondary antibody that contains a 
detectable label. In another method the primary antibody to the breast cancer protein(s) 
contains a detectable label, e.g. an enzyme marker that can act on a substrate. In another 

15 preferred embodiment each one of multiple primary antibodies contains a distinct and 

detectable label. This method finds particular use in simultaneous screening for a plurality of 
breast cancer proteins. As will be appreciated by one of ordinary skill in the art, many other 
histological imaging techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the 

20 ability to detect and distinguish emissions of different wavelengths. In addition, a 
fluorescence activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing breast 
cancer from blood, serum, plasma, stool, and other samples. Such samples, therefore, are 
useful as samples to be probed or tested for the presence of breast cancer proteins. 

25 Antibodies can be used to detect a breast cancer protein by previously described 
immunoassay techniques including ELISA, immunoblotting (western blotting), 
immunoprecipitation, BIACORE technology and the like. Conversely, the presence of 
antibodies may indicate an immune response against an endogenous breast cancer protein. 

In a preferred embodiment, in situ hybridization of labeled breast cancer 

30 nucleic acid probes to tissue arrays is done. For example, arrays of tissue samples, including 
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breast cancer tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, 
supra) is then performed. When comparing the fingerprints between an individual and a 
standard, the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the 
findings. It is further understood that the genes which indicate the diagnosis may differ from 
5 those which indicate the prognosis and molecular profiling of the condition of the cells may 
lead to distinctions between responsive or refractory conditions or may be predictive of 
outcomes. 

In a preferred embodiment, the breast cancer proteins, antibodies, nucleic 
acids, modified proteins and cells containing breast cancer sequences are used in prognosis 

10 assays. As above, gene expression profiles can be generated that correlate to breast cancer, 
in terms of long term prognosis. Again, this may be done on either a protein or gene level, 
with the use of genes being preferred. As above, breast cancer probes may be attached to 
biochips for the detection and quantification of breast cancer sequences in a tissue or patient. 
The assays proceed as outlined above for diagnosis. PCR method may provide more 

15 sensitive and accurate quantification. 

Assays for therapeutic compounds 

In a preferred embodiment members of the proteins, nucleic acids, and 
antibodies as described herein are used in drug screening assays. The breast cancer proteins, 
20 antibodies, nucleic acids, modified proteins and cells containing breast cancer sequences are 
used in drug screening assays or by evaluating the effect of drug candidates on a "gene 
expression profile" or expression profile of polypeptides. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes after treatment with a candidate 
25 agent (e.g., Zlokarnik, et al, Science 279:84-8 (1998); Heid, Genome Res 6:986-94, 1996). 

In a preferred embodiment, the breast cancer proteins, antibodies, nucleic 
acids, modified proteins and cells containing the native or modified breast cancer proteins are 
used in screening assays. That is, the present invention provides novel methods for screening 
for compositions which modulate the breast cancer phenotype or an identified physiological 
30 function of a breast cancer protein. As above, this can be done on an individual gene level or 
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by evaluating the effect of drug candidates on a "gene expression profile". In a preferred 
embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokarnik, supra. 
5 Having identified the differentially expressed genes herein, a variety of assays 

may be executed. In a preferred embodiment, assays may be run on an individual gene or 
protein level That is, having identified a particular gene as up regulated in breast cancer, test 
compounds can be screened for the ability to modulate gene expression or for binding to the 
breast cancer protein. "Modulation" thus includes both an increase and a decrease in gene 

1 0 expression. The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing breast cancer, with changes of at least 
10%, preferably 50%, more preferably 100-300%, and in some embodiments 300-1000% or 
greater. Thus, if a gene exhibits a 4-fold increase in breast cancer tissue compared to normal 
tissue, a decrease of about four-fold is often desired; similarly, a 10-fold decrease in breast 

1 5 cancer tissue compared to normal tissue often provides a target value of a 10-fold increase in 
expression to be induced by the test compound. 

The amount of gene expression may be monitored using nucleic acid probes 
and the quantification of gene expression levels, or, alternatively, the gene product itself can 
be monitored, e.g., through the use of antibodies to the breast cancer protein and standard 

20 immunoassays. Proteomics and separation techniques may also allow quantification of 
expression. 

In a preferred embodiment, gene expression or protein monitoring of a number 
of entities, i.e., an expression profile, is monitored simultaneously. Such profiles will 
typically involve a plurality of those entities described herein.. 
25 In this embodiment, the breast cancer nucleic acid probes are attached to 

biochips as outlined herein for the detection and quantification of breast cancer sequences in 
a particular cell. Alternatively, PCR may be used. Thus, a series, e.g., of microtiter plate, 
may be used with dispensed primers in desired wells. A PCR reaction can then be performed 
and analyzed for each well. 
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Expression monitoring can be performed to identify compounds that modify 
the expression of one or more breast cancer-associated sequences, e.g., a polynucleotide 
sequence set out inTable 17. Generally, in a preferred embodiment, a test modulator is added 
to the cells prior to analysis. Moreover, screens are also provided to identify agents that 
5 modulate breast cancer, modulate breast cancer proteins, bind to a breast cancer protein, or 
interfere with the binding of a breast cancer protein and an antibody or other binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes any molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 

10 indirectly alter the breast cancer phenotype or the expression of a breast cancer sequence, 
e.g., a nucleic acid or protein sequence. In preferred embodiments, modulators alter 
expression profiles, or expression profile nucleic acids or proteins provided herein. In one 
embodiment, the modulator suppresses a breast cancer phenotype, e.g. to a normal tissue 
fingerprint. In another embodiment, a modulator induced a breast cancer phenotype. 

15 Generally, a plurality of assay mixtures are run in parallel with different agent concentrations 
to obtain a differential response to the various concentrations. Typically, one of these 
concentrations serves as a negative control, i.e., at zero concentration or below the level of 
detection. 

Drug candidates encompass numerous chemical classes, though typically they 
20 are organic molecules, preferably small organic compounds having a molecular weight of 
more than 100 and less than about 2,500 daltons. Preferred small molecules are less than 
2000, or less than 1500 or less than 1000 or less than 500 D. Candidate agents comprise 
functional groups necessary for structural interaction with proteins, particularly hydrogen 
bonding, and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, 
25 preferably at least two of the functional chemical groups. The candidate agents often 
comprise cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic 
structures substituted with one or more of the above functional groups. Candidate agents are 
also found among biomolecules including peptides, saccharides, fatty acids, steroids, purines, 
pyrimidines, derivatives, structural analogs or combinations thereof. Particularly preferred 
30 are peptides. 
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In one aspect, a modulator will neutralize the effect of a breast cancer protein. 
By "neutralize" is meant that activity of a protein is inhibited or blocked and the consequent 
effect on the cell. 

In certain embodiments, combinatorial libraries of potential modulators will be 
5 screened for an ability to bind to a breast cancer polypeptide or to modulate activity. 

Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating- the property 
and activity of those variant compounds. Often, high throughput screening (HTS) methods 

1 0 are employed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve 
providing a library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 

15 display a desired characteristic activity. The compounds thus identified can serve as 

conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical 
compounds generated by either chemical synthesis or biological synthesis by combining a 
number of chemical "building blocks" such as reagents. For example, a linear combinatorial 

20 chemical library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of 
chemical building blocks called amino acids in every possible way for a given compound 
length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical 
compounds can be synthesized through such combinatorial mixing of chemical building 
blocks (Gallop et al t J. Med. Chem. 37(9): 1233-1251 (1994)). 

25 Preparation and screening of combinatorial chemical libraries is well known to 

those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries {see, e.g., U.S. Patent No. 5,010,175, Furka, Pept. Prot. Res. 37:487-493 
(1991), Houghton et aL, Nature, 354:84-88 (1991)), peptoids (PCT Publication No WO 
91/19735), encoded peptides (PCT Publication WO 93/20242), random bio-oligomers (PCT 

30 Publication WO 92/00091), benzodiazepines (U.S. Pat. No. 5,288,514), diversomers such as 
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hydantoins, benzodiazepines and dipeptides (Hobbs et al, Proc. Nat. Acad, Sci. USA 
90:6909-6913 (1993)), vinylogous polypeptides (Hagihara et al y J. Amer. Chem. Soc. 
1 14:6568 (1992)), nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding 
(Hirschmann et al, J. Amer. Chem. Soc. 114:9217-9218 (1992)), analogous organic syntheses 
5 of small compound libraries (Chen et al 9 J. Amer. Chem. Soc. 1 16:2661 (1994)), 

oligocarbamates (Cho, etal, Science 261:1303 (1993)), and/or peptidyl phosphonates 
(Campbell et al, J. Org. Chem. 59:658 (1994)). See, generally, Gordon etal, J. Med. Chem. 
37:1385 (1994), nucleic acid libraries {see, e.g., Strategene, Corp.), peptide nucleic acid 
libraries {see, e.g., U.S. Patent 5,539,083), antibody libraries {see, e.g., Vaughn et al, Nature 

10 Biotechnology 14(3):309-314 (1996), and PCT/US96/10287), carbohydrate libraries {see, 
e.g., Liang et al, Science 274:1520-1522 (1996), and U.S. Patent No. 5,593,853), and small 
organic molecule libraries {see, e.g., benzodiazepines, Baum, C&EN, Jan 18, page 33 (1993); 
isoprenoids, U.S. Patent No. 5,569,588; thiazolidinones and metathiazanones, U.S. Patent No. , 
5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 5,519,134; morpholino compounds, 

15 U.S. Patent No. 5,506,337; benzodiazepines, U.S. Patent No. 5,288,514; and the like). 

Devices for the preparation of combinatorial libraries are commercially 
available {see, e.g, 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). 

20 A number of well known robotic systems have also been developed for 

solution phase chemistries. These systems include automated workstations like the 
automated synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, 
Japan) and many robotic systems utilizing robotic arms (Zymate H, Zymark Corporation, 
Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif.), which mimic the manual 

25 synthetic operations performed by a chemist. Any of the above devices are suitable for use 
with the present invention. The nature and implementation of modifications to these devices 
(if any) so that they can operate as discussed herein will be apparent to persons skilled in the 
relevant art. In addition, numerous combinatorial libraries are themselves commercially 
available {see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, Tripos, Inc., St. Louis, 
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MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, 
Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect enhancement or inhibition of breast cancer gene transcription, 
5 inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 
polypeptide activity. 

High throughput assays for the presence, absence, quantification, or other 
properties of particular nucleic acids or protein products are well known to those of skill in 
the art. Similarly, binding assays and reporter gene assays are similarly well known. Thus, 

10 e.g., U.S. Patent No. 5,559,410 discloses high throughput screening methods for proteins, 
U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic acid 
binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available 

15 {see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 
typically automate entire procedures, including all sample and reagent pipetting, liquid 
dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 

20 as a high degree of flexibility and customization. The manufacturers of such systems provide 
detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring 

25 proteins or fragments of naturally occurring proteins. Thus, e.g. 9 cellular extracts containing 
proteins, or random or directed digests of proteinaceous cellular extracts, may be used. In 
this way libraries of proteins may be made for screening in the methods of the invention. 
Particularly preferred in this embodiment are libraries of bacterial, fungal, viral, and 
mammalian proteins, with the latter being preferred, and human proteins being especially 
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preferred Particularly useful test compound will be directed to the class of proteins to which 
the target belongs, e.g., substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 
30 amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 
5 to about 15 being particularly preferred. The peptides may be digests of naturally occurring 
proteins as is outlined above, random peptides, or "biased" random peptides. By 
"randomized" or grammatical equivalents herein is meant that each nucleic acid and peptide 
consists of essentially random nucleotides and amino acids, respectively. Since generally 
these random peptides (or nucleic acids, discussed below) are chemically synthesized, they 

10 may incorporate any nucleotide or amino acid at any position. The synthetic process can be 
designed to generate randomized proteins or nucleic acids, to allow the formation of all or 
most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence 

15 preferences or constants at any position. In a preferred embodiment, the library is biased. 
That is, some positions within the sequence are either held constant, or are selected from a 
limited number of possibilities. For example, in a preferred embodiment, the nucleotides or 
amino acid residues are randomized within a defined class, e.g., of hydrophobic amino acids, 
hydrophilic residues, sterically biased (either small or large) residues, towards the creation of 

20 nucleic acid binding domains, the creation of cysteines, for cross-linking, prolines for SH-3 
domains, serines, threonines, tyrosines or histidines for phosphorylation sites, etc., or to 
purines, etc. ' 

Modulators of breast cancer can also be nucleic acids, as defined above. 

As described above generally for proteins, nucleic acid modulating agents may 
25 be naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
For example, digests of procaryotic or eucaryotic genomes may be used as is outlined above 
for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical 
moieties, a wide variety of which are available in the literature. 
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After the candidate agent has been added and the cells allowed to incubate for 
some period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 
the sample may be treated to lyse the cells, using known lysis buffers, electroporation, etc., 
with purification and/or amplification such as PCR performed as appropriate. For example, 
an in vitro transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a 
fluorescent, a chemilumine§cent, a chemical, or a radioactive signal, to provide a means of 
detecting the target sequence's specific binding to a probe. The label also can be an enzyme, 
such as, alkaline phosphatase or horseradish peroxidase, which when provided with an 
appropriate substrate produces a product that can be detected. Alternatively, the label can be 
a labeled compound or small molecule, such as an enzyme inhibitor, that binds but is not 
catalyzed or altered by the enzyme. The label also can be a moiety or compound, such as, an 
epitope tag or biotin which specifically binds to streptavidin. For the example of biotin, the 
streptavidin is labeled as described above, thereby, providing a detectable signal for the 
bound target sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct 
hybridization assays or can comprise "sandwich assays", which include the use of multiple 
probes, as is generally outlined in U.S. Patent Nos. 5,681,702, 5,597,909, 5,545,730, 

5.594.117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 

5.594.118, 5,359,100, 5,124,246 and 5,681,697, all of which are hereby incorporated by 
reference. In this embodiment, in general, the target nucleic acid is prepared as outlined 
above, and then added to the biochip comprising a plurality of nucleic acid probes, under 
conditions that allow the formation of a hybridization complex, 

A variety of hybridization conditions may be used in the present invention, 
including high, moderate and low stringency conditions as outlined above. The assays are 
generally run under stringency conditions which allows formation of the label probe 
hybridization complex only in the presence of target. Stringency can be controlled by 
altering a step parameter that is a thermodynamic variable, including, but not limited to, 
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temperature, formamide concentration, salt concentration, chaotropic salt concentration pH, 
organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is 
generally outlined in U.S. Patent No. 5,681,697. Thus it may be desirable to perform certain 
5 steps at higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. 
Components of the reaction may be added simultaneously, or sequentially, in different orders, 
with preferred embodiments outlined below. In addition, the reaction may include a variety 
of other reagents. These include salts, buffers, neutral proteins, e.g. albumin, detergents, etc. 
10 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 

The assay data are analyzed to determine the expression levels, and changes in 
15 expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the breast cancer phenotype. 
In one embodiment, screening is performed to identify modulators that can induce or 
suppress a particular expression profile, thus preferably generating the associated phenotype. 
In another embodiment, e.g., for diagnostic applications, having identified differentially 
20 expressed genes important in a particular state, screens can be performed to identify 

modulators that alter expression of individual genes. In an another embodiment, screening is 
performed to identify modulators that alter a biological function of the expression product of 
a differentially expressed gene. Again, having identified the importance of a gene in a 
particular state, screens are performed to identify agents that bind and/or modulate the 
25 biological activity of the gene product. 

In addition screens can be done for genes that are induced in response to a 
candidate agent. After identifying a modulator based upon its ability to suppress a breast 
cancer expression pattern leading to a normal expression pattern, or to modulate a single 
breast cancer gene expression profile so as to mimic the expression of the gene from normal 
30 tissue, a screen as described above can be performed to identify genes that are specifically 
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modulated in response to the agent. Comparing expression profiles between normal tissue 
and agent treated breast cancer tissue reveals genes that are not expressed in normal tissue or 
breast cancer tissue, but are expressed in agent treated tissue. These agent-specific sequences 
can be identified and used by methods described herein for breast cancer genes or proteins. 
5 In particular these sequences and the proteins they encode find use in marking or identifying 
agent treated cells. In addition, antibodies can be raised against the agent induced proteins 
and used to target novel therapeutics to the treated breast cancer tissue sample. 

Thus, in one embodiment, a test compound is administered to a population of 
breast cancer cells, that have an associated breast cancer expression profile. By 

10 "administration" or "contacting" herein is meant that the candidate agent is added to the cells 
in such a manner as to allow the agent to act upon the cell, whether by uptake and 
intracellular action, or by action at the cell surface. In some embodiments, nucleic acid 
encoding a proteinaceous candidate agent (i.e., a peptide) may be put into a viral construct 
such as an adenoviral or retroviral construct, and added to the cell, such that expression of 

15 the peptide agent is accomplished, e.g., PCT US97/01019. Regulatable gene therapy systems 
can also be used. 

Once the test compound has been administered to the cells, the cells can be 
washed if desired and are allowed to incubate under preferably physiological conditions for 
some period of time. The cells are then harvested and a new gene expression profile is 
20 generated, as outlined herein. 

Thus, e.g., breast cancer tissue may be screened for agents that modulate, e.g., 
induce or suppress the breast cancer phenotype. A change in at least one gene, preferably 
many, of the expression profile indicates that the agent has an effect on breast cancer activity. 
By defining such a signature for the breast cancer phenotype, screens for new drugs that alter 
25 the phenotype can be devised. With this approach, the drug target need not be known and 
need not be represented in the original expression screening platform, nor does the level of 
transcript for the target protein need to change. 

In a preferred embodiment, as outlined above, screens may be done on 
individual genes and gene products (proteins). That is, having identified a particular 
30 differentially expressed gene as important in a particular state, screening of modulators of 
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either the expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as '"breast cancer proteins" or 
a "breast cancer modulatory protein". The breast cancer modulatory protein may be a 
fragment, or alternatively, be the full length protein to the fragment encoded by the nucleic 
5 acids of the Tables. Preferably, the breast cancer modulatory protein is a fragment. In a 
preferred embodiment, the breast cancer amino acid sequence which is used to determine 
sequence identity or similarity is encoded by a nucleic acid of Table 25. In another 
embodiment, the sequences are naturally occurring allelic variants of a protein encoded by a 
nucleic acid of Table 25. In another embodiment, the sequences are sequence variants as 

10 further described herein. 

Preferably, the breast cancer modulatory protein is a fragment of 
approximately 14 to 24 amino acids long. More preferably the fragment is a soluble 
fragment. Preferably, the fragment includes a non-transmembrane region. In a preferred 
embodiment, the fragment has an N-terminal Cys to aid in solubility. In one embodiment, the 

15 C-terminus of the fragment is kept as a free acid and the N-terminus is a free amine to aid in 
coupling, i.e., to cysteine. 

In one embodiment the breast cancer proteins are conjugated to an 
immunogenic agent as discussed herein. In one embodiment the breast cancer protein is 
conjugated to BSA. 

20 Measurements of breast cancer polypeptide activity, or of breast cancer or the 

breast cancer phenotype can be performed using a variety of assays. For example, the effects 
of the test compounds upon the function of the breast cancer polypeptides can be measured 
by examining parameters described above. A suitable physiological change that affects 
activity can be used to assess the influence of a test compound on the polypeptides of this 

25 invention. When the functional consequences are determined using intact cells or animals, 
one can also measure a variety of effects such as, in the case of breast cancer associated with 
tumors, tumor growth, tumor metastasis, neovascularization, hormone release, transcriptional 
changes to both known and uncharacterized genetic markers (e.g., northern blots), changes in 
cell metabolism such as cell growth or pH changes, and changes in intracellular second 
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messengers such as cGMP. In the assays of the invention, mammalian breast cancer 
polypeptide is typically used, e.g., mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in 
vitro. For example, a breast cancer polypeptide is first contacted with a potential modulator 
5 and incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, 
the breast cancer polypeptide levels are determined in vitro by measuring the level of protein 
or mRNA. The level of protein is measured using immunoassays such as western blotting, 
ELIS A and the like with an antibody that selectively binds to the breast cancer polypeptide or 
a fragment thereof. For measurement of mRNA, amplification, e.g., using PCR, LCR, or 

10 hybridization assays, e.g., northern hybridization, RNAse protection, dot blotting, are 
preferred. The level of protein or mRNA is detected using directly or indirectly labeled 
detection agents, e.g., fluorescently or radioactively labeled nucleic acids, radioactively or 
enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using the breast cancer 

1 5 protein promoter operably linked to a reporter gene such as luciferase, green fluorescent 
protein, CAT, or 0-gai. The reporter construct is typically transfected into a cell. After 
treatment with a potential modulator, the amount of reporter gene transcription, translation, or 
activity is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on 

20 individual genes and gene products (proteins). That is, having identified a particular 

differentially expressed gene as important in a particular state, screening of modulators of the 
expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "breast cancer proteins." 
The breast cancer protein may be a fragment, or alternatively, be the full length protein to a 

25 fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes 
is performed. Typically, the expression of only one or a few genes are evaluated. In another 
embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
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differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or 
isolated gene product is used; that is, the gene products of one or more differentially 
5 expressed nucleic acids are made. For example, antibodies are generated to the protein gene 
products, and standard immunoassays are run to determine the amount of protein present. 
Alternatively, cells comprising the breast cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a breast 
cancer protein and a candidate compound, and determining the binding of the compound to 

10 the breast cancer protein. Preferred embodiments utilize the human breast cancer protein, 
although other mammalian proteins may also be used, e.g. for the development of animal 
models of human disease. In some embodiments, as outlined herein, variant or derivative 
breast cancer proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the breast cancer 

15 protein or the candidate agent is non-diffusably bound to an insoluble support having isolated 
sample receiving areas (e.g. a microtiter plate, an array, etc.). The insoluble supports may be 
made of any composition to which the compositions can be bound, is readily separated from 
soluble material, and is otherwise compatible with the overall method of screening. The 
surface of such supports may be solid or porous and of any convenient shape. Examples of 

20 suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are 
typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 
teflon™, etc. Microtiter plates and arrays are especially convenient because a large number 
of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 

25 with the reagents and overall methods of the invention, maintains the activity of the 

composition and is nondiffiisable. Preferred methods of binding include the use of antibodies 
(which do not sterically block either the ligand binding site or activation sequence when the 
protein is bound to the support), direct binding to "sticky" or ionic supports, chemical 
crosslinking, the synthesis of the protein or agent on the surface, etc. Following binding of 

30 the protein or agent, excess unbound material is removed by washing. The sample receiving 
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areas may then be blocked through incubation with bovine serum albumin (BSA), casein or 
other innocuous protein or other moiety. 

In a preferred embodiment, the breast cancer protein is bound to the support, 
and a test compound is added to the assay. Alternatively, the candidate agent is bound to the 
5 support and the breast cancer protein is added. Novel binding agents include specific 
antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 
analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 

10 protein binding, functional assays (phosphorylation assays, etc.) and the like. 

The determination of the binding of the test modulating compound to the 
breast cancer protein may be done in a number of ways. In a preferred embodiment, the 
compound is labeled, and binding determined directly, e.g., by attaching all or a portion of 
the breast cancer protein to a solid support, adding a labeled candidate agent (e.g., a 

1 5 fluorescent label), washing oflF excess reagent, and determining whether the label is present 
on the solid support. Various blocking and washing steps may be utilized as appropriate. 

In some embodiments, only one of the components is labeled, e.g., the 
proteins (or proteinaceous candidate compounds) can be labeled. Alternatively, more than 
one component can be labeled with different labels, e.g., l25 I for the proteins and a fluorophor 

20 for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

In one embodiment, the binding of the test compound is determined by 
competitive binding assay. The competitor is a binding moiety known to bind to the target 
molecule (i.e., a breast cancer protein), such as an antibody, peptide, binding partner, ligand, 

25 etc. Under certain circumstances, there may be competitive binding between the compound 
and the binding moiety, with the binding moiety displacing the compound. In one 
embodiment, the test compound is labeled. Either the compound, or the competitor, or both, 
is added first to the protein for a time sufficient to allow binding, if present. Incubations may 
be performed at a temperature which facilitates optimal activity, typically between 4 and 

30 40°C. Incubation periods are typically optimized, e.g., to facilitate rapid high throughput 
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screening. Typically between 0.1 and 1 hour will be sufficient. Excess reagent is generally 
removed or washed away. The second component is then added, and the presence or absence 
of the labeled component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
5 compound. Displacement of the competitor is an indication that the test compound is binding 
to the breast cancer protein and thus is capable of binding to, and potentially modulating, the 
activity of the breast cancer protein. In this embodiment, either component can be labeled. 
Thus, e.g., if the competitor is labeled, the presence of label in the wash solution indicates 
displacement by the agent. Alternatively, if the test compound is labeled, the presence of the 
1 0 label on the support indicates displacement. 

In an alternative embodiment, the test compound is added first, with 
incubation and washing, followed by the competitor. The absence of binding by the 
competitor may indicate that the test compound is bound to the breast cancer protein with a 
higher affinity. Thus, if the test compound is labeled, the presence of the label on the 
1 5 support, coupled with a lack of competitor binding, may indicate that the test compound is 
capable of binding to the breast cancer protein. 

In a preferred embodiment, the methods comprise differential screening to 
identity agents that are capable of modulating the activity of the breast cancer proteins. In 
this embodiment, the methods comprise combining a breast cancer protein and a competitor 
20 in a first sample. A second sample comprises a test compound, a breast cancer protein, and a 
competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the breast cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 
25 agent is capable of binding to the breast cancer protein. 

Alternatively, differential screening is used to identify drug candidates that 
bind to the native breast cancer protein, but cannot bind to modified breast cancer proteins. 
The structure of the breast cancer protein may be modeled, and used in rational drug design 
to synthesize agents that interact with that site. Drug candidates that affect the activity of a 
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breast cancer protein are also identified by screening drugs for the ability to either enhance or 
reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably 
control and test samples are performed in at least triplicate to obtain statistically significant 
5 results. Incubation of all samples is for a time sufficient for the binding of the agent to the 
protein. Following incubation, samples are washed free of non-specifically bound material 
and the amount of bound, generally labeled agent determined. For example, where a 
radiolabel is employed, the samples may be counted in a scintillation counter to determine the 
amount of bound compound. 

10 A variety of other reagents may be included in the screening assays. These 

include reagents like salts, neutral proteins, e.g. albumin, detergents, etc. which may be used 
to facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 

15 of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
compound capable of modulating the activity of a breast cancer protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising breast cancer 
proteins. Preferred cell types include almost any cell. The cells contain a recombinant 

20 nucleic acid that encodes a breast cancer protein. In a preferred embodiment, a library of 
candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous 
or subsequent exposure of physiological signals, e.g. hormones, antibodies, peptides, 
antigens, cytokines, growth factors, action potentials, pharmacological agents including 

25 chemotherapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate breast cancer agents are identified. 
Compounds with pharmacological activity are able to enhance or interfere with the activity of 
the breast cancer protein. Once identified, similar structures are evaluated to identify critical 

30 structural feature of the compound. 
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In one embodiment, a method of inhibiting breast cancer cell division is 
provided. The method comprises administration of a breast cancer inhibitor. In another 
embodiment, a method of inhibiting breast cancer is provided. The method comprises 
administration of a breast cancer inhibitor. In a further embodiment, methods of treating cells 
5 or individuals with breast cancer are provided. The method comprises administration of a 
breast cancer inhibitor. 

In one embodiment, a breast cancer inhibitor is an antibody as discussed 
above. In another embodiment, the breast cancer inhibitor is an antisense molecule. 

A variety of cell growth, proliferation, and metastasis assays are known to 
1 0 those of skill in the art, as described below. 

Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 
transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 
1 5 such as semi-solid or soft agar. The transformed cells, when transfected with tumor 

suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 
grow. Soft agar growth or colony formation in suspension assays can be used to identify 
modulators of breast cancer sequences, which when expressed in host cells, inhibit abnormal 
cellular proliferation and transformation. A therapeutic compound would reduce or eliminate 
20 the host cells' ability to grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft. 

Techniques for soft agar growth or colony formation in suspension assays are 
described in Freshney, Culture of Animal Cells a Manual of Basic Technique (3 rd ed., 1994), 
herein incorporated by reference. See also, the methods section of Garkavtsev et al (1996), 
25 " supra, herein incorporated by reference. 

Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until 
they touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
30 continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 

78 



# 



WO 02/059377 PCT/US02/02242 



higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
pattern of normal surrounding cells. Alternatively, labeling index with ( 3 H)-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 
supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 
normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with ( 3 H)-thymidine at saturation density is a 
preferred method of measuring density limitation of growth. Transformed host cells are 
transfected with a breast cancer-associated sequence and are grown for 24 hours at saturation 
density in non-limiting medium conditions. The percentage of cells labeling with ( 3 H)- 
thymidine is determined autoradiographically. See, Freshney (1994), supra. 

Growth factor or serum dependence 

Transformed cells have a lower serum dependence than their normal 
counterparts {see, e.g., Temin, J. Natl. Cancer Instl 37:167-175 (1966); Eagle et al, J. Exp. 
Med. 131 :836-879 (1970)); Freshney, supra. This is in part due to release of various growth 
factors by the transformed cells. Growth factor or serum dependence of transformed host 
cells can be compared with that of control. 

Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter "tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells {see, e.g., 
Gullino, Angiogenesis, tumor vascularization, and potential interference with tumor growth. 
in Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985)). Similarly, Tumor 
angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
counterparts. See, e.g., Folkman, Angiogenesis and Cancer, Sem Cancer Biol. (1992)). 

Various techniques which measure the release of these factors are described in 
Freshney (1994), supra. Also, see, Unkless etal. , 1 Biol. Chem. 249:4295-4305 (1974); 
Strickland & Beers, J. Biol. Chem. 251 :5694-5702 (1976); Whur et al, Br. J. Cancer 42:305- 
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312 (1980); Gullino, Angiogenesis, tumor vascularization, and potential interference with 
tumor growth, in Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985); 
Freshney Anticancer Res. 5:111-130 (1985). 

5 Invasiveness into Matrigel 

The degree of invasiveness into Matrigel or some other extracellular matrix 
constituent can be used as an assay to identify compounds that modulate breast cancer- 
associated sequences. Tumor cells exhibit a good correlation between malignancy and 
invasiveness of cells into Matrigel or some other extracellular matrix constituent. In this 
10 assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 
gene in these host cells would decrease invasiveness of the host cells. 

Techniques described in Fresliney (1994), supra, can be used. Briefly, the 
level of invasion of host cells can be measured by using filters coated with Matrigel or some 
other extracellular matrix constituent Penetration into the gel, or through to the distal side of 
1 5 the filter, is rated as invasiveness, and rated histologically by number of cells and distance 

moved, or by prelabeling the cells with l25 I and counting the radioactivity on the distal side of 
the filter or bottom of the dish. See, e.g., Freshney (1984), supra. 

Tumor growth in vivo 

20 Effects of breast cancer-associated sequences on cell growth can be tested in 

transgenic or immune-suppressed mice. Knock-out transgenic mice can be made, in which 
the breast cancer gene is disrupted or in which a breast cancer gene is inserted. Knock-out 
transgenic mice can be made by insertion of a marker gene or other heterologous gene into 
the endogenous breast cancer gene site in the mouse genome via homologous recombination. 

25 Such mice can also be made by substituting the endogenous breast cancer gene with a 

mutated version of the breast cancer gene, or by mutating the endogenous breast cancer gene, 
e.g., by exposure to carcinogens, 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 

30 is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
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that possess germ cells partially derived from the mutant cell line. Therefore, by breeding the 
chimeric mice it is possible to obtain a new line of mice containing the introduced genetic 
lesion [see, e.g., Capecchi et al, Science 244:1288 (1989)). Chimeric targeted mice can be 
derived according to Hogan et al, Manipulating the Mouse Embryo: A Laboratory Manual, 
5 Cold Spring Harbor Laboratory (1988) and Teratocarcinomas and Embryonic Stem Cells: A . 
Practical Approach, Robertson, ed., IRL Press, Washington, D.C., (1987). 

Alternatively, various immune-suppressed or immune-deficient host animals 
can be used. For example, genetically athymic "nude" mouse {see, e.g., Giovanella et al, J. 
Natl. Cancer Inst. 52:921 (1974)), a SCID mouse, a thymectomized mouse, or an irradiated 

10 mouse (see, e.g, Bradley et al., Br. J. Cancer 38:263 (1978); Selby et al.,Br. J. Cancer 
41 :52 (1980)) can be used as a host. Transplantable tumor cells (typically about 10 6 cells) 
injected into isogenic hosts will produce invasive tumors in a high proportions of cases, while 
normal cells of similar origin will not. In hosts which developed invasive tumors, cells 
expressing a breast cancer-associated sequences are injected subcutaneously. After a suitable 

1 5 length of time, preferably 4-8 weeks, tumor growth is measured (e.g., by volume or by its two 
largest dimensions) and compared to the control. Tumors that have statistically significant 
reduction (using, e.g., Student's T test) are said to have inhibited growth. 

Polynucleotide modulators of breast cancer 

20 Antisense Polynucleotides 

In certain embodiments, the activity of a breast cancer-associated protein is 
down-regulated, or entirely inhibited, by the use of antisense polynucleotide, i.e., a nucleic 
acid complementary to, and which can preferably hybridize specifically to, a coding mRNA 
nucleic acid sequence, e.g., a breast cancer protein mRNA, or a subsequence thereof. 

25 Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise 
naturally-occurring nucleotides, or synthetic species formed from naturally-occurring 
submits or their close homologs. Antisense polynucleotides may also have altered sugar 
30 moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and other 
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sulfur containing species which are known for use in the art. Analogs are comprehended by 
this invention so long as they function effectively to hybridize with the breast cancer protein 
mRNA. See, e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

Such antisense polynucleotides can readily be synthesized using recombinant 
5 means, or can be synthesized in vitro. Equipment for such synthesis is sold by several 
vendors, including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 

Antisense molecules as used herein include antisense or sense 
oligonucleotides. Sense oligonucleotides can, e.g., be employed to block transcription by 

10 binding to the anti-sense strand. The antisense and sense oligonucleotide comprise a single- 
stranded nucleic acid sequence (either RNA or DNA) capable of binding to target mRNA 
(sense) or DNA (antisense) sequences for breast cancer molecules. A preferred antisense 
molecule is for a breast cancer sequences in Tables 1-25, or for a ligand or activator thereof. 
Antisense or sense oligonucleotides, according to the present invention, comprise a fragment 

15 generally at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The 
ability to derive an antisense or a sense oligonucleotide, based upon a cDNA sequence 
encoding a given protein is described in, e.g., Stein & Cohen (Cancer Res. 48:2659 (1988 and 
van der Krol et al (BioTechniques 6:958 (1988)). 

20 Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and 
inhibit transcription of breast cancer-associated nucleotide sequences. A ribozyme is an 
RNA molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes 
have been described, including group I ribozymes, hammerhead ribozymes, hairpin 

25 ribozymes, RNase P, and axhead ribozymes (see, e.g., Castanotto et al, Adv. in 

Pharmacology 25: 289-3 17 (1994) for a general review of the properties of different 
ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel et al, 
Nucl. Acids Res. 18:299-304 (1990); European Patent Publication No. 0 360 257; U.S. Patent 
30 No. 5,254,678. Methods of preparing are well known to those of skill in the art (see, e.g., 
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WO 94/26877; Ojwang et aL, Proc. Natl Acad. Set USA 90:6340-6344 (1993); Yamada et 
aL, Human Gene Tlierapy 1:39-45 (1994); Leavitt et al 9 Proa Natl Acad. Sci USA 92:699- 
703 (1995); Leavitt et aL, Human Gene Therapy 5:1151-120 (1994); and Yamada etal, 
Virology 205: 121-126 (1994)). 
5 Polynucleotide modulators of breast cancer may be introduced into a cell 

containing the target nucleotide sequence by formation of a conjugate with a ligand binding 
molecule, as described in WO 91/04753. Suitable ligand binding molecules include, but are 
not limited to, cell surface receptors, growth factors, other cytokines, or other ligands that 
bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does 

10 not substantially interfere with the ability of the ligand binding molecule to bind to its 

corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of breast 
cancer may be introduced into a cell containing the target nucleic acid sequence, e.g., by 
formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 

15 understood that the use of antisense molecules or knock out and knock in models may also be 
used in screening assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, methods of modulating breast cancer in cells or 
organisms are provided. In one embodiment, the methods comprise administering to a cell an 
anti-breast cancer antibody that reduces or eliminates the biological activity of an endogenous 

20 breast cancer protein. Alternatively, the methods comprise administering to a cell or 
organism a recombinant nucleic acid encoding a breast cancer protein. This may be 
accomplished in any number of ways. In a preferred embodiment, e.g. when the breast 
cancer sequence is down-regulated in breast cancer, such state may be reversed by increasing 
the amount of breast cancer gene product in the cell. This can be accomplished, e.g., by 

25 overexpressing the endogenous breast cancer gene or administering a gene encoding the 
breast cancer sequence, using known gene-therapy techniques, e.g.. In a preferred 
embodiment, the gene therapy techniques include the incorporation of the exogenous gene 
using enhanced homologous recombination (EHR), e.g. as described in PCT/US93/03868, 
hereby incorporated by reference in its entirety. Alternatively, e.g. when the breast cancer 
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sequence is up-regulated in breast cancer, the activity of the endogenous breast cancer gene is 
decreased, e.g. by the administration of a breast cancer antisense nucleic acid. 

In one embodiment, the breast cancer proteins of the present invention may be 
used to generate polyclonal and monoclonal antibodies to breast cancer proteins. Similarly, 
5 the breast cancer proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify breast cancer 
antibodies useful for production, diagnostic, or therapeutic purposes. In a preferred 
embodiment, the antibodies are generated to epitopes unique to a breast cancer protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. The breast cancer 
10 antibodies may be coupled to standard affinity chromatography columns and used to purify 
breast cancer proteins. The antibodies may also be used as blocking polypeptides, as outlined 
above, since they will specifically bind to the breast cancer protein. 

Methods of identifying variant breast cancer-associated sequences 

1 5 Without being bound by theory, expression of various breast cancer sequences 

is correlated with breast cancer. Accordingly, disorders based on mutant or variant breast 
cancer genes may be determined. In one embodiment, the invention provides methods for 
identifying cells containing variant breast cancer genes, e.g., determining all or part of the 
sequence of at least one endogenous breast cancer genes in a cell. This may be accomplished 

20 using any number of sequencing techniques. In a preferred embodiment, the invention 

provides methods of identifying the breast cancer genotype of an individual, e.g., detennining 
all or part of the sequence of at least one breast cancer gene of the individual. This is 
generally done in at least one tissue of the individual, and may include the evaluation of a 
number of tissues or different samples of the same tissue. The method may include 

25 comparing the sequence of the sequenced breast cancer gene to a known breast cancer gene, 
i.e., a wild-type gene. 

The sequence of all or part of the breast cancer gene can then be compared to 
the sequence of a known breast cancer gene to determine if any differences exist. This can be 
done using any number of known homology programs, such as Bestfit, etc. In a preferred 

30 embodiment, the presence of a difference in the sequence between the breast cancer gene of 
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the patient and the known breast cancer gene correlates with a disease state or a propensity 
for a disease state, as outlined herein. 

In a preferred embodiment, the breast cancer genes are used as probes to 
determine the number of copies of the breast cancer gene in the genome. 
5 In another preferred embodiment, the breast cancer genes are used as probes to 

determine the chromosomal localization of the breast cancer genes. Information such as 
chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the breast 
cancer gene locus. 

10 

Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of a breast cancer protein 
or modulator thereof, is administered to a patient. By 'therapeutically effective dose" herein 
is meant a dose that produces effects for which it is administered. The exact dose will depend 

15 on the purpose of the treatment, and will be ascertainable by one skilled in the art using 
known techniques (e.g., Ansel et aL, Pharmaceutical Dosage Forms and Drug Delivery; 
Lieberman, Pharmaceutical Dosage Forms (vols. 1-3,1 992), Dekker, ISBN 0824770846, 
082476918X, 0824712692, 0824716981; Lloyd, The Art, Science and Technology of 
Pharmaceutical Compounding (1999); and Pickar, Dosage Calculations (1999)). As is 

20 known in the art, adjustments for breast cancer degradation, systemic versus localized 

delivery, and rate of new protease synthesis, as well as the age, body weight, general health, 
sex, diet, time of administration, drug interaction and the severity of the condition may be 
necessary, and will be ascertainable with routine experimentation by those skilled in the art. 
U.S. Patent Application N. 09/687,576, further discloses the use of compositions and 

25 methods of diagnosis and treatment in breast cancer is hereby expressly incorporated by 
reference. 

A "patient" for the purposes of the present invention includes both humans 
and other animals, particularly mammals. Thus the methods are applicable to both human 
therapy and veterinary applications. In the preferred embodiment the patient is a mammal, 
30 preferably a primate, and in the most preferred embodiment the patient is human. 
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The administration of the breast cancer proteins and modulators thereof of the 
present invention can be done in a variety of ways as discussed above, including, but not 
limited to, orally, subcutaneously, intravenously, intranasally, transdermally, 
intraperitoneally, intramuscularly, intrapulmonaiy, vaginally, rectally, or intraocularly. In 
5 some instances, e.g., in the treatment of wounds and inflammation, the breast cancer proteins 
and modulators may be directly applied as a solution or spray. 

The pharmaceutical compositions of the present invention comprise a breast 
cancer protein in a form suitable for administration to a patient. In the preferred embodiment, 
the pharmaceutical compositions are in a water soluble form, such as being present as 

10 pharmaceutically acceptable salts, which is meant to include both acid and base addition 
salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that retain the 
biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 

15 propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 
methanesulfonic acid, ethanesulfonic acid, pi-toluenesulfonic acid, salicylic acid and the like. 
, "Pharmaceutically acceptable base addition salts" include those derived from inorganic bases 
such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 

20 manganese, aluminum salts and the like. Particularly preferred are the ammonium, 

potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, ciiethylamine, triethylamine, 

25 tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the 
following: carrier proteins such as serum albumin; buffers; fillers such as microcrystalline 
cellulose, lactose, corn and other starches; binding agents; sweeteners and other flavoring 
agents; coloring agents; and polyethylene glycol. 



86 



WO 02/059377 



• 

PCT/US02/02242 



The pharmaceutical compositions can be administered in a variety of unit 
dosage forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that breast cancer protein modulators (e.g., antibodies, 
5 antisense constructs, ribozymes, small organic molecules, etc.) when administered orally, 
should be protected from digestion. This is typically accomplished either by complexing the 
molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 
packaging the molecuie(s) in an appropriately resistant carrier, such as a liposome or a . 
protection barrier. Means of protecting agents from digestion are well known in the art. 

1 0 The compositions for administration will commonly comprise a breast cancer 

protein modulator dissolved in a pharmaceutical^ acceptable carrier, preferably an aqueous 
carrier. A variety of aqueous carriers can be used, e.g., buffered saline and the like. These 
solutions are sterile and generally free of undesirable matter. These compositions may be 
sterilized by conventional, well known sterilization techniques. The compositions may 

15 contain pharmaceutical^ acceptable auxiliary substances as required to approximate 

physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
and the like, e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, 
sodium lactate and the like. The concentration of active agent in these formulations can vary 
widely, and will be selected primarily based on fluid volumes, viscosities, body weight and 

20 the like in accordance with the particular mode of administration selected and the patient's 
needs (e.g., Remington 's Pharmaceutical Science (15th ed., 1980) and Goodman & Gillman, 
The Pharmacologial Basis of Therapeutics (Hardman et a/.,eds., 1996)). 

Thus, a typical pharmaceutical composition for intravenous administration 
would be about 0. 1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per 

25 patient per day may be used, particularly when the drug is administered to a secluded site and 
not into the blood stream, such as into a body cavity or into a lumen of an organ. 
Substantially higher dosages are possible in topical administration. Actual methods for 
preparing parenterally administrable compositions will be known or apparent to those skilled 
in the art, e.g. 9 Remington 's Pharmaceutical Science and Goodman and Gillman, The 

30 Pharmacologial Basis of Therapeutics, supra. 
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The compositions containing modulators of breast cancer proteins can be 
administered for therapeutic or prophylactic treatments. In therapeutic applications, 
compositions are administered to a patient suffering from a disease (e.g., a cancer) in an 
amount sufficient to cure or at least partially arrest the disease and its complications. An 
5 amount adequate to accomplish this is defined as a 'therapeutically effective dose." Amounts 
effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administrations of the compositions may be administered 
depending on the dosage and frequency as required and tolerated by the patient. In any event, 
the composition should provide a sufficient quantity of the agents of this invention to 

10 effectively treat the patient. An amount of modulator that is capable of preventing or slowing 
the development of cancer in a mammal is referred to as a "prophylactically effective dose." 
The particular dose required for a prophylactic treatment will depend upon the medical 
condition and history of the mammal, the particular cancer being prevented, as well as other 
factors such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 

1 5 treatments may be used, eg., in a mammal who has previously had cancer to prevent a 

recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 
of developing cancer. 

It will be appreciated that the present breast cancer protein-modulating 
compounds can be administered alone or in combination with additional breast cancer 

20 modulating compounds or with other therapeutic agent, e.g., other anti-cancer agents or 
treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in Tables 1-25, such as antisense polynucleotides 
or ribozymes, will be introduced into cells, in vitro or in vivo. The present invention provides 
25 methods, reagents, vectors, and cells useful for expression of breast cancer-associated 
polypeptides and nucleic acids using in vitro (cell-free), ex vivo or in vivo (cell or 
organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 
30 introducing foreign nucleotide sequences into host cells may be used. These include the use 
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of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and any of the other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell {see, 
e.g., Berger & Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology 
5 volume 152 (Berger), Ausubel et aL, eds., Current Protocols (supplemented through 1999), 
and Sambrook et aL, Molecular Cloning - A Laboratory Manual (2nd ed., Vol. 1-3, 1989. 



administered as therapeutic agents, and can be formulated as outlined above. Similarly, 
breast cancer genes (including both the fiill-length sequence, partial sequences, or regulatory 
1 0 sequences of the breast cancer coding regions) can be administered in a gene therapy 
application. These breast cancer genes can include antisense applications, either as gene 
therapy (i.e. for incorporation into the genome) or as antisense compositions, as will be 
appreciated by those in the art. 



15 vaccine compositions to stimulate HTL, CTL and antibody responses.. Such vaccine 

compositions can include, e.g., lipidated peptides (see, e.g.,Vitiello, A. et al.,J. Clin. Invest. 
95:341 (1995)), peptide compositions encapsulated in poly(DL-lactide-co-glycolide) (*TLG") 
microspheres (see, e.g., Eldridge, et al. 9 Molec. Immunol. 28:287-294, (1991); Alonso et aL, 
Vaccine 12:299-306 (1994); Jones et aL, Vaccine 13:675-681 (1995)), peptide compositions 

20 contained in immune stimulating complexes (ISCOMS) (see, e.g., Takahashi et aL, Nature 
344:873-875 (1990); Hu et aL, Clin Exp Immunol. 113:235-243 (1998)), multiple antigen 
peptide systems (MAPs) (see, e.g., Tarn, Proc. Natl. Acad. Sci. U.S.A. 85:5409-5413 (1988); 
Tarn, J. Immunol. Methods 196:17-32 (1996)), peptides formulated as multivalent peptides; 
peptides for use in ballistic delivery systems, typically crystallized peptides, viral delivery 

25 vectors (Perkus, et aL, In: Concepts in vaccine development (Kaufinann, ed., p. 379, 1996); 
Chakrabarti, et aL, Nature 320:535 (1986); Hu et aL, Nature 320:537 (1986); Kieny, et aL, 
AIDS Bio/Technology 4:790 (1986); Top et aL, J. Infect. Dis. 124:148 (1971); Chanda et aL, 
Virology 175:535 (1990)), particles of viral or synthetic origin (see, e.g., Kofler et aL, J. 
Immunol. Methods. 192:25 (1996); Eldridge etal., Sem. Hematol. 30:16 (1993); Falo etaL, 

30 Nature Med. 7:649 (1995)), adjuvants (Warren et aL, Annu. Rev. Immunol. 4:369 (1986); 



In a preferred embodiment, breast cancer proteins and modulators are 



Breast cancer polypeptides and polynucleotides can also be administered as 
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Gupta et al, Vaccine 1 1 :293 (1993)), liposomes (Reddy et al, J. Immunol 148: 1585 (1992); 
Rock, Immunol Today 17:131 (1996)), or, naked or particle absorbed cDNA (Ulmer, et al, 
Science 259:1745 (1993); Robinson et al, Vaccine 11:957 (1993); Shiver et al, In: Concepts 
in vaccine development (Kaufinann, e&, p. 423, 1996); Cease & Berzofsky, Annu. Rev. 
5 Immunol 12:923 (1994) and Eldridge et al, Sem. Hematol 30:16 (1993)). Toxin-targeted 
delivery technologies, also known as receptor mediated targeting, such as those of Avant 
Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a 
substance designed to protect the antigen from rapid catabolism, such as aluminum hydroxide 

10 or mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 
Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 
MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 
Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 

15 aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 
tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 

20 Vaccines can be administered as nucleic acid compositions wherein DNA or 

RNA encoding one or more of the polypeptides, or a fragment thereof, is administered to a 
patient. This approach is described, for instance, in Wolff eL al, Science 247:1465 (1990) as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; 
WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 

25 include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 

cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated delivery 
{see, e.g., U.S. Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the 
invention can be expressed by viral or bacterial vectors. Examples of expression vectors 

30 include attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
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vaccinia virus, e.g., as a vector to express nucleotide sequences that encode breast cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 
5 Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 
described in Stover et al, Nature 351 :456-460 (1991). A wide variety of other vectors useful 
for therapeutic administration or immunization e.g. adeno and adeno-associated virus vectors, 
retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the like, 
will be apparent to those skilled in the art from the description herein {see, e.g., Shata et al, 

10 MolMed Today 6:66-71 (2000); Shedlock et al, JLeukoc Biol 68:793-806 (2000); Hipp et 
aljn Vivo 14:571-85 (2000)). 

Methods for the use of genes as DNA vaccines are well known, and include 
placing a breast cancer gene or portion of a breast cancer gene under the control of a 
regulatable promoter or a tissue-specific promoter for expression in a breast cancer patient. 

1 5 The breast cancer gene used for DNA vaccines can encode full-length breast cancer proteins, 
but more preferably encodes portions of the breast cancer proteins including peptides derived 
from the breast cancer protein. In one embodiment, a patient is immunized with a DNA 
vaccine comprising a plurality of nucleotide sequences derived from a breast cancer gene. For 
example, breast cancer-associated genes or sequence encoding subfragments of a breast 

20 cancer protein are introduced into expression vectors and tested for their immunogenicity in 
the context of Class I MHC and an ability to generate cytotoxic T cell responses. This 
procedure provides for production of cytotoxic T cell responses against cells which present 
antigen, including intracellular epitopes. 

In a preferred embodiment, the DNA vaccines include a gene encoding an 

25 adjuvant molecule with the DNA vaccine. Such adjuvant molecules include cytokines that 
increase the immunogenic response to the breast cancer polypeptide encoded by the DNA 
vaccine. Additional or alternative adjuvants are available. 

In another preferred embodiment breast cancer genes find use in generating 
animal models of breast cancer. When the breast cancer gene identified is repressed or 

30 diminished in cancer tissue, gene therapy technology, e.g., wherein antisense RNA directed 
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to the breast cancer gene will also diminish or repress expression of the gene. Animal models 
of breast cancer find use in screening for modulators of a breast cancer-associated sequence 
or modulators of breast cancer. Similarly, transgenic animal technology including gene 
knockout technology, e.g. as a result of homologous recombination with an appropriate gene 
5 targeting vector, will result in the absence or increased expression of the breast cancer 
protein. When desired, tissue-specific expression or knockout of the breast cancer protein 
may be necessary. 

It is also possible that the breast cancer protein is overexpressed in breast 
cancer. As such, transgenic animals can be generated that overexpress the breast cancer 
10 protein. Depending on the desired expression level, promoters of various strengths can be 
employed to express the transgene. Also, the number of copies of the integrated transgene 
can be determined and compared for a determination of the expression level of the transgene. 
Animals generated by such methods find use as animal models of breast cancer and are 
additionally useful in screening for modulators to treat breast cancer. 

15 

Kits for Use in Diagnostic and/or Prognostic Applications 

For use in diagnostic, research, and therapeutic applications suggested above, 
kits are also provided by the invention. In the diagnostic and research applications such kits 
may include any or all of the following: assay reagents, buffers, breast cancer-specific nucleic 

20 acids or antibodies, hybridization probes and/or primers, antisense polynucleotides, 

ribozymes, dominant negative breast cancer polypeptides or polynucleotides, small molecules 
inhibitors of breast cancer-associated sequences etc. A therapeutic product may include 
sterile saline or another pharmaceuticaily acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions 

25. (i.e., protocols) for the practice of the methods of this invention. While the instructional 
materials typically comprise written or printed materials they are not limited to such. Any 
medium capable of storing such instructions and communicating them to an end user is 
contemplated by this invention. Such media include, but are not limited to electronic storage 
media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the 
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like. Such media may include addresses to internet sites that provide such instructional 
materials. 

The present invention also provides for kits for screening for modulators of 
breast cancer-associated sequences. Such kits can be prepared from readily available 
5 materials and reagents. For example, such kits can comprise one or more of the following 
materials: a breast cancer-associated polypeptide or polynucleotide, reaction tubes, and 
instructions for testing breast cancer-associated activity. Optionally, the kit contains 
biologically active breast cancer protein. A wide variety of kits and components can be 
prepared according to the present invention, depending upon the intended user of the kit and 
1 0 the particular needs of the user. Diagnosis would typically involve evaluation of a plurality 
of genes or products. The genes will be selected based on correlations with important 
parameters in disease which may be identified in historical or outcome data. 



15 EXAMPLES 

Example 1: Tissue Preparation, Labeling Chips, and Fingerprints 

Purifying total RNA from tissue sample using T RIzol Reagent 

20 The sample weight is first estimated. The tissue samples are homogenized in 

1 ml of TRIzol per 50 mg of tissue using a homogenizer (e.g., Polytron 3 100). The size of 
the generator/probe used depends upon the sample amount A generator that is too large for 
the amount of tissue to be homogenized will cause a loss of sample and lower RNA yield. A 
larger generator (e.g., 20 mm) is suitable for tissue samples weighing more than 0.6 g. Fill 

25 tubes should not be overfilled. If the working volume is greater than 2 ml and no greater than 
10 ml, a 15 ml polypropylene tube (Falcon 2059) is suitable for homogenization. 

Tissues . should be kept frozen until homogenized. The TRIzol is added 
directly to the frozen tissue before homogenization. Following homogenization, the insoluble 
material is removed from the homogenate by centrifugation at 7500 x g for 15 min. in a 

30 Sorvail superspeed or 12,000 x g for 10 min. in an Eppendorf centrifuge at 4°C. The cleared 
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homogenate is then transferred to a new tube(s). Samples may be frozen and stored at -60 to 

-70°C for at least one month or else continue with the purification. 

The next process is phase separation. The homogenized samples are incubated 
for 5 minutes at room temperature. Then, 0.2 ml of chloroform per 1ml of TRIzol reagent is 
5 added to the homogenization mixture. The tubes are securely capped and shaken vigorously 
by hand (do not vortex) for 15 seconds. The samples are then incubated at room temp, for 
2-3 minutes and next centrifiiged at 6500 rpm in a Sorvall superspeed for 30 min. at 4oC. 

The next process is RNA Precipitation. The aqueous phase is transferred to a 
fresh tube. The organic phase can be saved if isolation of DNA or protein is desired. Then 
10 0.5 ml of isopropyl alcohol is added per 1ml of TRIzol reagent used in the original 

homogenization. Then, the tubes are securely capped and inverted to mix. The samples are 
then incubated at room temp, for 10 minutes an centrifiiged at 6500 rpm in Sorvall for 20 

min.at4°C. 

The RNA is then washed. The supernatant is poured off and the pellet washed 
1 5 with cold 75% ethanol. 1 ml of 75% ethanol is used per 1 ml of the TRIzol reagent used in 
the initial homogenization. The tubes are capped securely and inverted several times to 
loosen pellet without vortexing . They are next centrifiiged at <8000 rpm (<7500 x g) for 5 
minutes at 4°C. 

The RNA wash is decanted. The pellet is carefully transferred to an 
20 Eppendorf tube (sliding down the tube into the new tube by use of a pipet tip to help guide it 
in if necessary). Tube(s) sizes for precipitating the RNA depending on the working volumes. 
Larger tubes may take too long to dry. Dry pellet. The RNA is then resuspended in an 
appropriate volume (e.g., 2 -5 ug/ul) of DEPC H2O. The absorbance is then measured. 

The poly A+ mRNA may next be purified from total RNA by other methods 

25 such as Qiagen* s RNeasy kit. The poly A + mRNA is purified from total RNA by adding the 
oligotex suspension which has been heated to 37°C and mixing prior to adding to RNA. 
The Elution Buffer is incubated at 70°C. If there is precipitate in the buffer, warm up the 2 x 
Binding Buffer at 65°C. The the total RNA is mixed with DEPC-treated water, 2 x Binding 
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Buffer, and Oligotex according to Table 2 on page 16 of the Oligotex Handbook and next 

incubated for 3 minutes at 65°C and 10 minutes at room temperature. 

The preparation is centrifiiged for 2 minutes at 14,000 to 18,000 g, preferably, 
at a "soft setting," The supernatant is removed without disturbing Oligotex pellet. A little bit 
5 of solution can be left behind to reduce the loss of Oligotex. The supernatant is saved until 

satisfactory binding and elution of poly A + mRNA has been found. 

Then, the preparation is gently resuspended in Wash Buffer OW2 and pipetted 
onto the spin column and centrifiiged at full speed (soft setting if possible) for 1 minute. 

Next, the spin column is transferred to a new collection tube and gently 
1 0 resuspended in Wash Buffer OW2 and centrifiiged as described herein. 

Then, the spin column is transferred to a new tube and eluted with 20 to 100 ul 

of preheated (70°C) Elution Buffer. The Oligotex resin is gently resuspended by pipetting up 
and down. The centrifugation is repeated as above and the elution repeated with fresh elution 
buffer or first eluate to keep the elution volume low. 

1 5 The absorbance is next read to determine the yield, using diluted Elution • 

Buffer as the blank. 

Before proceeding with cDNA synthesis, the mRNA is precipitated before 
proceeding with cDNA synthesis, as components leftover or in the Elution Buffer from the 
Oligotex purification procedure will inhibit downstream enzymatic reactions of the mRNA. 

20 0.4 vol. of 7.5 M NH40Ac + 2.5 vol. of cold 100% ethanol is added and the preparation 
precipitated at -20°C 1 hour to overnight (or 20-30 min. at -70°C), and centrifiiged at 
14,000-16,000 x g for 30 minutes at 4°C. Next, the pellet is washed with 0.5 ml of 80% 
ethanol (-20°C) and then centrifiiged at 14,000-16,000 x g for 5 minutes at room temperature. 
The80% ethanol wash is then repeated. The last bit of ethanol from the pellet is then dried 

25 without use of a speed vacuum and the pellet is then resuspended in DEPC H 2 0 at lug/ul 
concentration. 

Alternatively the RNA mav be purified using other methods (e.g.. Oiagen's RNeasv kitt. 
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No more than 100 ug is added to the RNeasy column. The sample volume is 
adjusted to 1 00 ul with RNase-free water 350 ul Buffer RLT and then 250 ul ethanol 
(100%) are added to the sample. The preparation is then mixed by pipetting and applied to an 
RNeasy mini spin column for centrifugation (15 sec at >10,000 rpm). If yield is low, reapply 
5 the flowthrough to the column and centrifuge again. 

Then, transfer column to a new 2 ml collection tube and add 500 ul Buffer 
RPE and centrifuge for 15 sec at >10,000rpm. The flowthrough is discarded. 500 ul Buffer 
RPE and is then added and the preparation is centriuged for 15 sec at >l 0,000 rpm. The 
flowthrough is discarded, and the column membrane dried by centrifiiging for 2 min at 
10 maximum speed. The column is transferred to a new 1.5-ml collection tube. 30-50 ul of 

RNase-free water is applied directly onto column membrane. The column is then centrifiiged 
for 1 min at >1 0,000 rpm and the elution step repeated. 

The absorbance is then read to determine yield. If necessary, the material may 
be ethanol precipitated with ammonium acetate and 2.5X volume 100% ethanol. 

15 

First Strand cDNA Synthesis 

The first strand can be make using using Gibco's "Superscript Choice System 
for cDNA Synthesis" kit. The starting material is 5 ug of total RNA or 1 ug of polyA-f 
mRNAl. For total RNA, 2 ul of Superscript RT is used; for polyA+ mRNA, 1 ul of 
20 Superscript RT is used. The final volume of first strand synthesis mix is 20 ul. The RNA 
should be in a volume no greater than 10 ul. The RNA is incubated with 1 ul of 100 pmol 

T7-T24 oligo for 10 min at 70°C followed by addition on ice of 7 ul of: 4ul 5X 1 st Strand 
. Buffer, 2 ul of 0.1M DTT, and 1 ul of lOmM dNTP mix. The preparation is then incubated at 
37°C for 2 min before addition of the Superscript RT followed by incubation at 37°C for 1 
25 hour. 

Second Strand Synthesis 

For the second strand synthesis, place 1st strand reactions on ice and add: 91 

ul DEPC H 2 0; 30 ul 5X 2nd Strand Buffer, 3 ul lOmM dNTP mix; 1 ul 10 U/ul E.coli DNA 

30 Ligase; 4 ul 10 U/ul E.coli DNA Polymerase; and 1 ul 2 U/ul RNase H. Mix and incubate 2 

96 



WO 02/059377 PCT/US02/02242 



hours at 16°C. Add 2 ul T4 DNA Polymerase. Incubate 5 min at 16°C. Add lOulof 0.5M 
EDTA. 

Cleaning up cDNA 

5 The cDNA is purified using Phenol:Chlorofonn:Isoamyl Alcohol (25:24:1) 

and Phase-Lock gel tubes. The PLG tubes are centrifuged for 30 sec at maximum speed. 
The cDNA mix is then transferred to PLG tube. An equal volume of 
phenol:chloroform:isamyl alcohol is then added, the preparation shaken vigorously (no 
vortexing), and centrifuged for 5 minutes at maximum speed. The top aqueous solution is 
10 transferred to a new tube and ethanol precipitated by adding 7.5X 5M NH40Ac and 2.5X 
volume of 100% ethanol. Next, it is centrifuged immediately at room temperature for 20 
min, maximum speed. The supernatant is removed, and the pellet washed with 2X with cold 
80% ethanol. As much ethanol wash as possible should be removed before air drying the 
pellet; and resuspending it in 3 ul RNase-free water. 

15 

In vitro Transcription (IVT) and labeling with biotin 

In vitro Transcription (IVT) and labeling with biotin is performed as follows: 
Pipet 1.5 ul of cDNA into a thin-wall PCR tube. Make NTP labeling mix by combining 2 ul 
T7 lOxATP (75 mM) (Ambion); 2 ul T7 lOxGTP (75 mM) (Ambion); 1.5 ul T7 lOxCTP (75 

20 mM) (Ambion); 1 .5 ul T7 lOxUTP (75 mM) (Ambion); 3.75 ul 10 mM Bio-1 1-UTP 

(Boehringer-Mannheim/Roche or Enzo); 3.75 ul 10 mM Bio-16-CTP (Enzo); 2 ul lOx T7 
transcription buffer (Ambion); and 2 ul lOx T7 enzyme mix (Ambion). The final volume is 
20 ul. Incubate 6 hours at 37°C in a PCR machine. The RNA can be furthered cleaned. 
Clean-up follows the previous instructions for RNeasy columns or Qiagen's RNeasy protocol 

25 handbook. The cRNA often needs to be ethanol precipitated by resuspension in a volume 
compatible with the fragmentation step. 

Fragmentation is performed as follows. 15 ug of labeled RNA is usually 
fragmented. Try to minimize the fragmentation reaction volume; a 10 ul volume is 
recommended but 20 ul is all right. Do not go higher than 20 ul because the magnesium in 

30 the fragmentation buffer contributes to precipitation in the hybridization buffer. Fragment 
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RNA by incubation at 94 C for 35 minutes in 1 x Fragmentation buffer (5 x Fragmentation 
buffer is 200 mM Tris-acetate, pH 8.1; 500 mM KOAc; 150 mM MgOAc). The labeled 
RNA transcript can be analyzed before and after fragmentation. Samples can be heated to 
65°C for 15 minutes and electrophoresed on 1% agarose/TBE gels to get an approximate idea 
5 of the transcript size range 

For hybridization, 200 ul (10 ug cRNA) of a hybridization mix is put on the 
chip. If multiple hybridizations are to be done (such as cycling through a 5 chip set), then it 
is recommended that an initial hybridization mix of 300 ul or more be made. The 
hybridization mix is: fragment labeled RNA (50 ng/ul final cone); 50 pM 948-b control 
1 0 oligo; 1 .5 pM BioB; 5 pM BioC; 25 pM BioD; 100 pM CRE; 0. 1 mg/ml herring sperm DNA; 
0.5 mg/ml acetylated BSA; and 300 ul with lxMES hyb buffer. 

The hybridization reaction is conducted with non-biotinylated IVT (purified 
by RNeasy columns) (see example 1 for steps from tissue to IVT): The following mixture is 
prepared: 

1 5 IVT antisense RNA; 4 *ig: ul 

Random Hexamers (1 \ig/\il): 4 ^1 
H 2 0: 

14 jal 

Incubate the above 14 ul mixture at 70°C for 10 min.; then put on ice. 
20 The Reverse transcription procedure uses the following mixture: 

0.1MDTT: 3^1 

SOX dNTP mix: 0.6 ul 

H 2 0: 2.4 pi 

Cy3 or Cy5 dUTP (ImM): 3 ^1 
25 SS RT II (BRL): I \il 



16 ul 

The above solution is added to the hybridization reaction and incubated for 30 min., 42°C. 

Then, 1 jil SSII is added and incubated for another hour before being placed on ice. 
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The SOX dNTP mix contains 25mM of cold dATP, dCTP, and dGTP, lOmM 



of dTTP and is made by adding 25 \il each of lOOmM dATP, dCTP, and dGTP; 10 |al of 
100mMdTTPtol5 jilH 2 0.] 

RNA degradation is performed as follows. Add 86 \il H20, 1 .5 jxl 1M NaOH/ 
5 2 mM EDTA and incubate at 65°C, 10 min.. For U-Con 30, 500 jj.1 TE/sample spin at 7000 g 
for 10 min, save flow through for purification. For Qiagen purification, suspend u-con 
recovered material in 500 pi buffer PB and proceed using Qiagen protocol. For DNAse 
digestion, add 1 ul of 1/100 dilution of DNAse/30 ul Rx and incubate at 37°C for 15 min. 
Incubate at 5 min 95°C to denature the DNAse. 



Sample preparation 

For sample preparation, add Cot-1 DNA, 10 jxl; SOX dNTPs, 1 20X SSC, 2.3 \xl; Na pyro 
phosphate, 7.5 fxl; 10 mg/ml Herring sperm DNA; 1 ul of 1/10 dilution to 21.8 final vol. Dry 
in speed vac. Resuspendin 15 plH 2 0. Add 0.38 jal 10% SDS. Heat 95°C, 2 min and slow 
15 cool at room temp, for 20 min. Put on slide and hybridize overnight at 64°C. Washing after 
the hybridization: 3X SSC/0.03% SDS: 2 min., 37.5 ml 20X SSC+0.75ml 10% SDS in 
250ml H 2 0; IX SSC: 5 min., 12.5 ml 20X SSC in 250ml H 2 0; 0.2X SSC: 5 min., 2.5 ml 20X 
SSC in 250ml H2O. Dry slides and scan at appropiate PMTs and channels. 
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TABLE 1: Figure 1 from BRCA 001 US 

5 Table 1 shows genes, (incorporated in their entirety here and throughout the application 
where primekeys are provided), downregulated in tumor tissue compared to normal breast 
tissue. 



10 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Unigene number 

Unigene Title: Unigene gene title 

15 R1 : Ratio of normal breast tissue to tumor 



Pkey ExAccn UnigenelO UnigeneTitle 

20 100472 D90084 Hs.1023 pyruvate dehydrogenase (Rpoamlde) alpha 

100499 T51986 Hs.283108 hemoglobin, gamma G 
100545 M55405 gb:Homo sapiens mucin (MUC-3) mRNA, part 

100549 BE142019 Hs.222056 Homo sapiens cDNA FU11572 fis, done HE 

100613 X52078 Hs.101047 transcription factor 3 (E2A Immunoglobul 

25 100635 BE259039 Hs.129953 Ewing sarcoma breakpoint region 1 

100645 X16841 Hs.167988 neural eel! adhesion molecule 1 
100654 A03758 NM_000477*:Homo sapiens albumin (ALB), m 

100702 L27065 gb:Human neurofibromatosis 2 (NF2) mRNA, 

100915 M60832 Hs.249239 collagen, type VIII, alpha 2 

30 100971 BE379727 Hs.83213 fatty acid binding protein 4, adipocyte 

101125 AJ250562 Hs.82749 transmembrane 4 superfarrury member 2 

101166 M90424 Hs.2099 Bpocalln 1 (protein migrating faster th 

101184 NMJ501674 Hs.460 activating transcription factor 3 

101336 NrvL006732 Hs.75678 FBJ murine osteosarcoma viral oncogene h 

35 101367 X03350 Hs.4 alcohol dehydrogenase 1B (class I), beta 
101447 M21305 gb:Human alpha satellite and satellite 3 

101461 N98569 Hs76422 phosphatase A2, group HA (platelets. 

101511 M27826 Hs.267319 endogenous retroviral protease 

101634 AV650262 Hs.75765 GR02 oncogene 

40 101736 M74447 Hs.502 transporter^ ATP-binding cassette, sub 
102208 U22961 gb:Human mRNA clone with similarity to L 

102297 Nftl.001504 Hs.198252 G protein-coupled receptor 9 

102450 U48251 Hs.75871 protein kinase C binding protein 1 

102515 U89337 Hs.169886 tenascinXB 

45 102571 U60115 Hs.239069 four and a half UM domains 1 

102800 AA313538 gb:EST185419 Colon carcinoma (HCC) ceD 

102857 NMJ006744 Hs.76461 retinol-binding protein 4. interstitial 

102990 AA829286 Hs.332053 serum amyloid A1 

103434 X98085 Hs.54433 tenascin R (restrictin, janusin) 

50 103747 AA081995 gb.*zn26d06.r1 Stratagene neuroepithelium 

103750 AA126129 gbrzm78c07.rl Stratagene neuroepithelium 

103812 AA137107 Hs.326391 Homo sapiens, done MGC:16636, mRNA, com 

103851 AA326216 Hs.8719 hypothetical protein MGC1 136 

104080 AB041036 Hs.57771 kailikretn 1 1 (KLK11; TLSP; PRSS20; hipp 

55 104093 R50727 Hs.336970 ESTs 

104106 AA422123 gb:zv26h1Zr1 Scares JihHMPu_S1 Homo sapi 

.104109 AL353957 Hs.284181 hypothetical protein DKFZp434PG531 

104250 F06638 Hs.12440 Homo sapiens done 24734 mRNA sequence 
104340 AA426189 gb:zw1 1e09.r! SoaresJihHMPu_.S1 Homo sapl 

60 104492 N73185 Hs.94285 EST 

104506 N91071 Hs.109650 ESTs 

104511 N99542 Hs.572 orosomucoid 1 

104532 A1498763 Hs.203013 hypothetical protein FU12746 

100 
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104536 R24024 Hs.158101 Homo sapiens cDNA FU14673 lis, done NT 5 

104572 Y11312 Hs.132463 phosphoinositide-3-kinase t dass 2, beta 5 

104659 AW969769 Hs.105201 ESTs 5 

104677 AA009764 Hs.190380 ESTs 10 

5 104711 AA017245 Hs.32794 ESTs 10 

104731 AA019300 Hs.125070 ESTs, Moderately similar to I54374 gene 10 

104764 AI039243 Hs.278585 ESTs 5 

105005 A1298208 Hs.28605 ESTs 10 

105036 AA130390 Hs.25549 hypothetical protein FU20898 10 

10 105105 R61532 Hs.87016 hypothetical protein RJ22938 5 

105231 AW970043 Hs.238039 hypofoetical protein FU1 1090 5 

105239 AA221036 gb:zr03M2.r1 Stratagene NT2 neuronal pr 10 

105921 AA421973 Hs.169119 ESTs. Weakly similar to T25731 hypotheti 5 

105957 BE242857 Hs.27021 hypothetical protein FU11 159 5 

15 106052 N79885 Hs.6382 ESTs, Highly similar to T00391 hypotheti 10 

106119 AL359624 Hs.11387 KIAA1453 protein 5 

106181 A1803651 Hs.191608 ESTs 10 

106194 AW976171 Hs.286194 hypothe0cal protein FU22233 5 

106283 A1085846 Hs.25522 KIAA1808 protein 10 

20 106379 AL042069 Hs.1 19021 DKFZP434N061 protein 10 

106451 AW235928 Hs,313182 ESTs 10 

106491 AA135688 Hs.10083 Homo sapiens, done IMAGE:41 39786, mRNA, 10 

106700 AA906434 Hs.3776 zinc finger protein 216 5 

106782 AW054886 Hs.25682 Homo sapiens mRNA tor K1AA1863 protein, 10 

25 106851 AI458623 gb:tk04g09jc1 NCLCGAPJLU24 Homo sapiens 5 

106870 AI983730 Hs.26530 serum deprivation response (phosphatidyl 5 

106892 AI347578 Hs.124015 hypothetical protein MGC26Q5 5 

106954 AF128847 Hs.204038 mdolethytorruneN<methyttransferase 5 

106991 AJ223811 Hs.30127 bypofiietical protein 5 

30 107103 A1446183 Hs.9572 ESTs, Highly similar to CYA5_HUMAN ADENY 5 

107124 AB006532 Hs.31442 RecQ protein-like 4 10 

107148 AI005036 Hs.334305 GS1999full 10 

107214 AF127026 Hs.5394 myosin IA 10 

107242 AB020672 Hs.175411 WAA0865 protein 10 

35 107331 AI905985 Hs.111805 ESTs 10 

107351 U51704 Hs.323428 ESTs, Moderately similar to ALU8_HUMAN A 5 

107423 W26652 Hs.6163 PTEN induced putative kinase 1 5 

107447 W28516 Hs.19210 hypothetical protein MGC1 1308 . 10 

107451 AL042425 Hs.283976 hypthettoal protein PR02389 10 

40 107453 AI092790 Hs.334703 hypoftefical protein FU14529 5 

107459 W38002 Empirically selected from AFFX single pr 10 

107683 N53167 Hs.47623 ESTs 10 

107711 W96141 Hs.220687 ESTs 10 

107754 AA017462 Hs.269244 ESTs 10 

45 107757 BE621721 Hs.280792 hypofoetical protein FU12387 similarto 10 

107864 AA025060 Hs.61246 ESTs * 10 

107872 BE271708 Hs.95110 ESTs, Weakly similar to A55943 1-phospha 10 

107888 AA025836 Hs.191637 ESTs 5 

107997 AL049176 Hs.82223 chordin-fike 10 

50 108056 AA043675 Hs.62633 ESTs 10 

108081 AA093668 Hs.28578 musdeblind (DrosophflaHike 5 

108113 AA012881 Hs.72531 hypothetical protein FU 11838 10 

108238 AA059473 Hs.66783 EST 10 

108257 AA677927 Hs.144269 ESTs 5 

55 108335 AA070500 gb2m70h03^1 Stratagene neuroepitheiium 5 

108351 AA071193 gb*^f79b1Zs1 Soares_pineaLglarttl-N3HPG 10 

108382 NM.006770 Hs.67726 macrophage receptor with collagenous str 5 

108392 AA075124 gb:zm86a01.s1 Stratagene ovarian cancer 10 

108441 AA079079 gb:zm97c09.s1 Stratagene colon HT29 (937 10 

60 108446 AA085383 gb:zn13g03.s1 Stratagene hNT neuron (937 10 

108497 AA074897 gb:zm85a05.r1 Stratagene ovarian cancer 10 

108604 AA934589 Hs.49696 ESTs 5 

108662 AF117646 Hs.156637 Cas-Br-M (murine) ectroplc retroviral tr 5 

108706 AA121820 Hs.74569 K1AA0842 protein 10 

65 108738 AA126583 Hs.158725 ESTs 10 

108827 AI273692 Hs.1 10470 ESTs 10 

109123 AI028376 Hs.73232 ESTs 10 
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109389 AA101325 Hs.86154 hypothetical protein RJ12457 10 

109546 F01449 Hs.26954 Homo sapiens mRNA; cDNA DKFZp762G123 (fr 5 

109919 R40604 Hs.129539 ESTs, Weakly similar to MCAT_HUMAN MITOC 10 

110008 AJ094674 Hs.30524 ring finger protein 24 10 

5 110141 H46749 Hs.31540 ESTs 10 

110354 W22165 Hs.22586 ESTs 5 

110433 AW294162 Hs.301062 UOP^cety^lpra^galactosamine:poiyp 10 

110448 H51276 Hs.13526 hypofoefical protein FU12688 - 10 

110455 H52576 gkyt85e08.r1 Soaresj>inealgtandJI3HPG 5 

10 110540 H72639 Hs.167608 ESTs 5 

110553 H60593 Hs.124990 ESTs 10 

110976 AL044174 Hs.159526 patched (Drosophiia) homolog 10 

110987 Af753316 Hs.26034 ESTs 5 

111158 N66616 Hs.138629 H^apiens mRNA for subteiomertc repeats 5 

15 111168 AI798376 gb.1r34bG7.x1 NCLCGAP_Ov23 Homo sapiens 10 

111187 AJ224864 Hs.9688 teuiaxyte membrane antigen 5 

111307 AA641636 Hs.37477 ESTs, Weakly similar to T46908 hypothefi 5 

111400 R00144 Hs.189771 ESTs 10 

111498 AI168511 gb»w9Qh09.s1 SoaresJetaUiver_spteen_ 10 

20 111651 R16733 Hs.20499 ESTs 10 

111738 R26065 gbyh39d03.s1 Scares placenta Nb2HP Homo 5 

111803 AA593731 Hs.325823 ESTs, Moderately similar to ALU5JHUMAN A 10 

111995 R42333 Hs.302292 ESTs ' 10 

112071 AL117490 Hs.47225 Ras-asscdated protein Rap1 10 

25 112204 NM_006668 Hs.25121 cytochrome P450. subfamily 46 (cholester 10 

112258 R51889 Hs.24990 ESTs 5 

112490 R31094 Hs^4378 ESTs 10 

112588 R77302 gb:yi75h08.s1 Scares placenta Nb2HP Homo 10 

112654 BE618629 Hs.268809 ESTs 5 

30 112784 T98628 Hs.191290 ESTs 5 

112817 AI057205 Hs.14584 ESTs 5 

112885 AA581428 Hs.5021 EST 10 

112913 T16837 Hs.4241 ESTs 5 

1 13149 T51588 gb:yb27e06.s1 Stratagene fetal spleen (9 10 

35 113174 T54659 Hs.301 755 Homo sapiens cDNARJ 11 465 lis, done HE 5 

113203 AA743563 Hs.10305 ESTs 5 

113299 AW207424 Hs.332594 ESTs 10 

113367 N92359 Hs.14518 ESTs, Moderately similar to A48752 B-cel 10 

113457 R16763 Hs.268679 ESTs 5 

40 113563 AA913635 Hs,326413 Homo sapiens cONA FU20812 lis, clone AD 10 

113574 R06874 Hs.268628 ESTs, Moderately similar to ALU1_HUMAN A 5 

113776 AI791905 Hs.95549 hypothetical protein 10 

113790 AI244311 Hs.26912 ESTs 10 

113807 W07586 Hs.8045 ESTs 3 

45 113958 W86195 gb:zh54e05.s1 Soares_fetaLfiver_spteen_ ♦ 10 

114211 Z39319 Hs.27347 EST ' 10 

114254 AB0 18263 Hs.180338 tumor necrosis factor receptor superfami 5 

114349 AA745976 Hs.28273 ESTs 5 

114449 AA020736 gb:ze63b11.s1 Soares retina N2MHR Homo 5 

50 114484 AA034378 Hs.267319 endogenous retroviral protease 5 

114576 AA065096 gb:zm50a02.s1 Stratagene fibroblast (937 5 

114624 AA081507 gb:znQ5b10.r1 Stratagene hNT neuron (937 5 

114844 AA234826 Hs.87386 EST 5 

114906 AA234462 Hs.87350 ESTs 5 

55 115624 AK000725 Hs.50579 hypothetical protein FU2071 8 . 3 

115666 AF173081 Hs.178215 Vertebrate UN7 homolog 1, Tax interact! 5 

115712 AB020649 Hs.74569 WAA0842 protein 5 

115889 AA398841 Hs.39850 hypothetical protein FU20517 10 

115949 AI478427 Hs.43125 esophageal cancer related gene 4 protein 10 

60 116107 AL133916 Hs.172572 hypothetical protein RJ20093 10 

116180 AA463902 Hs.13522 ESTs, Weakly similar to 138022 hypotheti 5 

116267 AW9687Q3 Hs.30085 hypothetical protein RJ231 86 5 

116291 AW410377 Hs.41502 hypothetical protein FU21276 5 

116527 AW194253 Hs.68607 ESTs 10 

65 116659 BE314852 Hs.168694 Homo sapiens done 23763 unknown mRNA, p 5 

116708 F10526 Hs.70001 ESTs, Moderately simflar to JC61 69 nude 5 

117058 AW801808 p>:ll34JM007W1040(W)62^7UM0070 r^ 5 
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117151 AI803656 Hs.42373 ESTs 5 

117226 N20468 gb:yx39b10.s1 Soares melanocyte 2NbHM Ho 10 

117323 AI472863 Hs.43387 ESTs 5 

117571 N34417 Hs,44584 ESTs 3 

5 117624 N26627 H&82364 ESTs, Weakly similar to JC4124 pregnancy 5 

117673 N40551 Hs.t84Q43 Homo sapiens Ets-1 binding protein (E1B) 10 

117847 N49285 Hs.182391 ESTs 10 

117877 AW263476 Hs.44268 myeHn gene expression factor 2 10 

117919 BE222341 Hs.279472 ESTs 5 

10 118049 N53145 gb:yv55fl)9.s1 Soares fetal liver spleen 3 

118413 AW955696 Hs.90960 ESTs 10 

118613 AHJ78236 Hs.49688 ESTs 5 

118664 N70907 Hs.230619 EST 10 

118858 AL122040 Hs.102981 Homo sapiens mRNA; cDNA OKFZp434G1972 (f 3 

15 118902 AA993527 Hs.293907 hypothetical protein FU23403 5 

119039 AI160570 Hs.252097 pregnancy specific beta«1-glycoproteln 6 3 

119159 AF142419 Hs.15020 homolog of mouse quaking QKI (KH domain 5 

119216 AA514422 Hs.221849 ESTs 5 

119317 AK002001 Hs.51305 v-maf musculoaponeurotic fibrosarcoma (a 10 

20 119366 T77892 gb:yd20ms1 Soares fetal fo/er spleen 5 

119378 T81824 Hs.90949 EST 5 

119528 W38051 Empirically selected from AFFX single pr 10 

119792 AL049798 Hs.80552 dermatopontin 3 

119800 AF086332 Hs!58314 ESTs 10 

25 119817 AF088061 Hs.159690 ESTs 5 

119835 AF086429 Hs.58429 ESTs 5 

119923 AW803308 -Hs.62954 ferritin, heavypolypeptlde 1 5 

119961 U34249 Hs.337461 Human putative zinc finger protein (ZNFB 5 

120379 AL042725 gbf>KFZp434B1822.r1 434 (synonym: htes3) 10 

30 120931 AW136934 Hs.97162 ESTs 5 

121037 AA907743 Hs.142373 ESTs 5 

121282 AA401695 Hs.97334 ESTs 5 

121382 AA405763 Hs.111939 Homo sapiens cDNA RJ20470 lis, clone KA 5 

121764 AA421452 Hs.164851 ESTs, Weakly similar to KIAA0926 protein 5 

35 122034 AK000229 Hs.98017 Homo sapiens cDMA FU20222 Ms, clone CO 10 

122441 AA447555 Hs.99116 EST 10 

122756 AA458945 Hs.95898 ESTs 10 

122771 AW135093 Hs.97282 ESTs, Highly similar to G100_HUMAN 1 10 K 5 

123601 AA609122 Hs.112645 Homo sapiens mRNA; cDNA DKFZp434D2472 (f 5 

40 123623 AI024595 Hs.97508 a disintegrin and metaltoprotetnase doma 5 

123941 AA621529 gb:af47a02.s1 SoaresJotalfetusJ^b2HF8_ 10 

124215 H62570 gb.7r44aOI.rl Soares fetal Ever spleen 5 

124276 H83465 gb:ys91a1 1.s1 Soares refinaN2b5HR Homo 5 

124680 AK001527 Hs.163953 hypothetical protein FU10665 5 

45 125099 NIVL014312 Hs.112377 cortical thymocyte receptor (Jtlaevis 10 

125121 T98199 Hs.48403 hypothetical protein RJ 10847 10 

125188 BE299567 Hs.271749 ESTs, Moderately similar to ALUS JiUMAN A 5 

125284 NMJ02666 Hs.103253 periBpIn 10 

125906 BE256206 Hs. 17775 p75NTR-associated cell death executor; 0 5 

50 128484 AA485421 Hsi70503 ESTs. Weakly similar to ALU7_HUMAN ALU S 10 

128511 NMJJ02250 Hs.10082 potassium intermediate/small conductance 10 

128538 R44214 Hs.101189 ESTs 5 

128606 C16161 Hs.283040 hypothetical protein PR02543 5 

128850 AA193106 Hs.18081 7 chromosome 11 open reading frame 23 10 

55 128870 H39537 HsJ5309 eukaryotic translation elongation factor 10 

128903 AW150717 Hs.296176 STAT induced STAT Inhfeitor 3 10 

128931 N62889 Hs.107242 Homo sapiens cONAFUl 2965 Rs, clone NT 10 

129001 AA443323 Hs.107812 BPOZ protein 5 

129091 AA056483 Hs.301463 Human Chromosome 16 BAC done CIT987SK-A 5 

60 129101 NM.013403 Hs.108665 zinedin 10 

129146 AL117472 Hs.108924 SH3-<Jomain protein 5 (ponsln) .5 

129213 AI146494 Hs.109525 ESTs, Weakly similar to IRX2_HUMAN IROQU 3 

129228 U40714 Hs.239307 tyrosyURNA synthetase 5 

129265 AA530892 Hs.171695 dual specificity phosphatase 1 5 

65 129285 BE617015 Hs.11006 ESTs, Moderately similar to T17372 plasm 10 

129346 AF110141 Hs.288908 WAS protein family, member 2 10 

129368 NMJJ03877 Hs. 1 10776 STAT induced STAT inhibitor-2 5 
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129371 X06828 Hs,110802 von Willebrand factor 5 

129381 AW2458G5 Hs.1 10903 claudin 5 (transmembrane protein deleted 10 

129440 W37944 Hs.4007 Sarcolemmakissoclated protein 5 

129441 BE061069 Hs.301943 WAA0467 protein 10 
5 129516 AF020038 Hs.11223 isocitrate dehydrogenase 1 (NADP+), solu 10 

129554 BE222078 Hs.113089 ESTs 10 

129684 BE622468 Hs.11924 ESTs, Weakfy similar to I38022 hypothetl 5 

129702 A1304966 Hs.12035 ESTs, Weakly similar to I38022 hypothetl 5 

129778 AK001676 Hs.12457 hypothetical protein FU1081 4 10 

10 129893 AK000956 Hs.13209 hypothetical protein FU10094 5 

129928 AI338993 Hs.134535 ESTs 5 

129973 AJ251760 Hs.273385 guanine nucleotide binding protein (G pr 5 

129977 NA/L000399 Hs.1395 early growth response 2 (Krox-20 (Drosop 5 

130014 NM_001158 Hs.143102 amine oxidase, copper containing 2 (reti 5 

15 130085 M62402 Hs.274313 insulin-Gke growth factor binding prate 10 

130089 AA452006 Hs.333199 ESTs 5 

130162 W80711 Hs.319946 Homo sapiens mRNA tor WAA1727 protein, 5 

130243 D88435 Hs.153227 cycOn G associated kinase 10 

130315 AI241084 Hs.154353 nonselective sodium potassium/proton exc 5 

20 130339 AA435746 gb2f79e03.s1 SoaresJesfis^NHT Homo sap 5 

130400 V00517 Hs.283108 hemoglobin, gamma G 10 

130436 NM.001928 Hs.155597 D component of complement (adipsin) 10 

130478 X72308 Hs.251526 small inducible cytokine A7 (monocyte ch 5 

130480 BE222978 Hs.15760 MYG1 protein 10 

25 130494 AW390834 Hs.75874 pregnancy-associated plasma protein A 5 

130563 BE270472 Hs.279900 HSPC015 protein 10 

130589 AL110226 Hs.16441 OKFZP434H204 protein 10 

130606 A1652143 Hs.288382 hypothetical protein HJ131 11 5 , 

130634 AI769067 Hs.127824 ESTs, Weakly similar to T28770 hypotheti 3 

30 130683 AA993269 Hs.17872 Homo sapiens, clone IMAGE:3875012, mRNA 10 

130689 NM_006691 Hs.17917 extracellular link domain-containing 1 10 

130716 AA232075 Hs.18259 XPA binding protein 1; putative ATP(GTP) 5 

130718 AF263462 Hs.18376 KIAA1319 protein 10 

130722 N41322 Hs.18441 ESTs 5 

35 130798 M81349 Hs.1955 serum amyloid A4, constitutive 10 

130840 BE048821 Hs.20144 small Inducible cytokine subfamily A (Cy 10 

131184 AB040935 Hs.23954 cerebral ceil adhesion molecule 10 

131261 AA360419 Hs.171776 Inositol(myo)-1{or4)-monophosphatase1 10 

131282 X03350 Hs.4 alcohol dehydrogenase 1B (class 1). beta 10 

40 131328 AW939251 Hs.25647 v-fos FBJ murine osteosarcoma viral onco 10 

131340 AK000393 Hs.25817 BTB (POZ) domain containing 2 < 5 

131341 AF1 10908 Hs.297660 TNF receptor-associated factor 3 5 
131406 H83294 Hs.284122 Wnt inhibitory fector-1 5 
131489 BE394648 Hs.27414 hypothetical protein 5 

45 131543 AW966881 Hs.41639 programmed cell death 2 10 

131692 BE559681 Hs.30736 KIAA0124 protein 5 

131753 AA829286 Hs.332053 serum amyloid A1 10 

131756 AA443966 Hs.31$95 ESTs 10 

131785 H69342 Hs.26320 TRAB 10 protein 10 

50 131815 AA021258 Hs.32753 ESTs 5 

131819 BE244961 Hs.173103 FE65-UKE2 5 

131828 AJ000263 Hs.278658 keratin, hair, basic, 6 (monilethrix) 10 

131888 AW294659 Hs.34054 Homo sapiens cONA: FU22488 lis, done H 5 

131927 AJ003112 Hs.34780 doubtecortex; fesencephaly, X4mked (d 5 

55 131949 AK000010 Hs.258798 hypothetical protein FU20003 10 

132115 H81604 Hs.178471 KIAA0798 gene product 5 

132177 X80818 Hs.178078 glutamate receptor, metabotropic 4 5 

13236 AA467752 Hs.195161 ESTs 5 

132426 AW118072 Hs.89981 dlacylglycerol kinase, zeta (104kD) 10 

60 132477 S68874 Hs.170917 prostaglandin E receptor 3 (subtype EP3) • 5 

132675 AI291496 Hs.5476 Homo sapiens, clone IMAGE:3530123. mRNA, 10 

132796 NM-006283 Hs.173159 transforming, acidic coiled-cofl contain 10 

132898 W28548 Hs.224829 ESTs 10 

132905 NMJ304235 Hs.7934 KnippeWike factor 4 (gut) 10 

65 132953 BE175645 Hs.321264 LBP protein 32 5 

133116 BE563966 Hs.6529 ESTs, Weakly similar to 178885 serine/th 5 

133120 NM_003278 Hs,65424 tetranectin (plasminogen-blndlng protein 10 
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133139 AF052138 Hs.6580 Homo sapiens cDNA: FU23227 fis, done C 5 

133163 AA668224 Hs.6634 Homo sapiens cDNA: FU22547 fis, clone H 5 

133268 AW956781 Hs.293937 ESTs, Weakly similar to FXD2..HUMAN FORKH 5 

133272 NAL002776 Hs.69423 kaifikreln 10 (KLK10)(PRSSL1)(nes1) 5 

5 133379 AA207059 gb:zq80h09.s1 Stratagene hNT neuron (937 5 

133407 AF017987 Hs.7306 secreted frizzied-related protein 1 5 

133552 H21497 Hs.7471 BBP-Gke protein 1 5 

133702 L02321 Hs.75652 glutathione S-transferaseM5 5 

133719 H26904 Hs75726 apolfpoprotein 0 5 

10 133731 N71725 Hs.272572 hemoglobin, alpha 2 10 

133789 T85626 Hs.76239 hypothetical protein FU20608 5 

134007 AF072441 Hs.7B40 catdneurin binding protein 1 10 

134055 D86062 Hs.182423 ES1 (zebrafish) protein, human homolog o 10 

134111 A1372588 Hs.8022 TU3A protein 10 

15 134117 AA081846 Hs.7921 Homo sapiens mRNA; cDNA DKFZp566E183 (fr 10 

134177 BE243319 HsJ9672 K1AA0652 gene product 5 

134308 AW905827 Hs.81454 ketohexoklnase (fructokinase) 10 

134361 BE549343 Hs.82208 acyt-Coenzyme A dehydrogenase, very long 5 

134369 AF207664 Hs.8230 a disintegrirvTike and metalloprotease ( 5 

20 134449 L34155 Hs.83450 famlnln, alpha 3 (nteein (150kD), kalini 5 

134467 A1190413 Hs.8373 ESTs 10 

134496 M64936 gb:Horno sapiens retinoicadd-inducible 10 

134510 NMJJQ2757 Hs.250870 mltogen-actjvated protein kinase kinase 10 

134550 M26315 Hs.85258 CD8 antigen, alpha polypeptide (p32) 10 

25 134577 BE244323 Hs.85951 exportin, tRNA (nudear export receptor 5 

134591 U73394 Hs.1 66085 killer cell immunogtobulin^Oke receptor 5 

134678 AL008583 Hs.182595 dyneh, axonemai, light polypeptide 4 5 

134728 D10216 Hs.89394 POU domain, dass 1, transcription facto 5 

134758 NMJ00078 Hs.89538 choiesteryi ester transfer protein, plas 10 

30 134786 T29618 Hs.89640 TEK tyrosine kinase, endothelial (venous 10 

134912 T87521 Hs.261457 ESTs 5 

134963 NM.O03394 Hs.91985 wingless-type MMTV integration site faml 10 

134969 H22570 Hs.172572 hypothetical protein FU20093 5 

135001 AA302517 Hs.92732 KIAA1444 protein 5 

35 135066 X04430 Hs.93913 InterteuHn 6 (Interferon, beta 2) 10 

135173 AL036557 Hs.95910 putative lymphocyte G07G1 switch gene 10 

135197 U76456 Hs.190787 tissue inhibitor of metaltoproteinase 4 5 

135219 AB002361 Hs.96633 KIAA0363 protein 5 

135250 U83171 Hs.97203 small IndudWe cytokine subfamily A (Cy 5 

40 135304 AA416829 Hs.191597 ESTs 5 

135337 AA905406 Hs.9905 ESTs, Weakly similar to unnamed protein 3 

135417 X55019 Hs.99975 cholinergic receptor, nicotinic, delta p 10 

101367 X03350 Hs.4 alcohol dehydrogenase 1B (dass I), beta 5 • 

128870 H39537 Hs.75309 eukaryotic translation elongation factor 5 

45 129381 AW245805 Hs.110903 daudin 5 (transmembrane protein deleted 5 

130085 M62402 Hs.274313 tnsunn-iike growth factor binding prote 5 

130689 NM.006691 Hs.17917 extracellular link domainoontaining 1 10 

133120 NM003278 Hs.65424 tetranectin (piasminogen-binding protein 3 

133407 AF017987 Hs.7306 secreted frizzied-related protein 1 5 

50 133731 N71725 Hs.272572 hemoglobin, alpha 2 5 

134369 AF207664 Hs.8230 a oTsWegrirHike and metalloprotease ( 5 

135066 X04430 Hs.93913 Interteukin 6 pnterferon, beta 2) . 10 

135173 AL036557 Hs.95910 putative lymphocyte GQ/G1 switch gene 5 

322580 AK001852 Hs.274151 llgatin 5 

55 408790 AW580227 Hs.47860 neurotrophic tyrosine kinase, receptor, type 2 10 

418043 AW377752 Hs.83341 AXL receptor tyrosine kinase 5 

427458 BE208364 Hs.29283 ESTs, Weakly similar to LKHU proteoglycan link 5 

446674 AA563892 Hs.306000 solute carrier family 4 (anion exchanger), memb 10 

449826 UB5642 Hs.138506 ESTs 5 

60 RC_H15814_s HumanapM1niRfWforGS3109(noveladiposespecificcoliagen 10 

YEL024w/RIP1 EST - YEL024w/RIP1 3 
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Table 1 A shows the accession numbers for those pkeys lacking unigenelD's for Table 1. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number: Gene duster number 
Accession: Genbank accession numbers 



Pkey 

108446 
108497 



112224.1 
110079J2 



15 



20 



25 



124215 1597154.1 

30 117058 1219924.1 

110455 46874.1 

111168 38585.1 



35 



CAT Number Accessions 



40 



45 



111498 
104340 
103747 
134496 



411008.1 
46289.10 
117944 J 
46501.1 



AI684569AA257011 



103750 
105239 



118365.1 
34624.1 



50 120379 34624.3 



55 



60 



114624 
106851 
108392 
100545 
100654 

100702 
102208 



AA085383 AA126091 AA074174 AA075373 AA079120 AA070831 AA075978 AA075372 AA 128503 

AA074897 AA113914 AA084871 AA079329 AA071309 AA084710 AA129030 AA075042 AA074794 AA071453 AA078803 

AA148628 AA122204 AA074159 M126185 AA0791 17 AA127089 M07091 2 AA079280 AA131372 AA078833 AA0710B7 

AA076131 AA071047 AA079401 AA083070 AA102076 AA1 15163 AA074 1 98 AA1 34725 AA1 13889 M121 103 AA075041 

AA065148 AA071310 AA101 144 AA079659 AA078931 AA079209 AA070928 AA068994 AA069817 AA076187 AA069053 

M131489 AA071308 AA063317 AA070156 AA071430 AA076056 AA075684 AA070053 AA126283 M126078 AA075895 

AA079208 AA074583 AA071086 AA079623 AA070627 AA078802 AA076622 AA065051 AA079143 AA071 1 10 AA079434 

AA148748 AA079230 AA085188 AA074485 AA070580 AA076151 AA083166 AA085118 AA079450 AA085044 AA120938 

AA079200 AA100188 AA081472 AA122355 AA129031 AA085362 AA069220 AA070940 AA075968 AA074563 AA084027 

AA115929 

H62570 H59063 

AW801806 H90434BE086530 

H52576AF085971H52172 

AI798376 S46400 AW811617 AW811616 W00557 BE142245 AW858232 AW861851 AW858362 AA232351 AA218567 

AA055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 M173345 AA199942 AA223384 

AA227092 AA227080 T12379 AA092174 T61139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 

AA157718 AA157719 AA 100472 AA100774 AA130756 AA157705 AA157730 AA157715 AA053524 AW849581 AW854566 

C05254 AW882836 T92637 AW812621 AA206583 AA209204 BE156909 AA226824 AI829309 AW991957 N66951 AA527374 

H66215 AA045564 AI694265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 

AW817659 BE081531 H59570 

AI168511 AI022712AA700366 R07371 R07324 

AA426189 F15201 

AA081995AA101099 

M64936 A1025512 AI382987 BE061777 AA089966 BE169930 T41 176 AW594624 BE502415 AA121893 A1269283 T4031 1 

AI079277 AC41318 BE327710 AW975215 AW896268 AA884990 BE327514 
AA126129 AA 1 26033 AA082561 

AA221036 R87170 BE537068 BE544757 C18935 AW812058 T92565 AA227415 AA233942 AA223237 AA66B403 AA601627 
AW863839 BE061833 BE000620 AW981 170 AW84751 9 AA308542 AW821833 AW945688 C04699 AA205504 AA377241 
AW821667 AA055720 AW817981 AW856468 AA155719 AA179928 T03007 AW754298 AA227407 AA113928 AA307904 



11 1686 J 
322947.1 
113549.1 
22955.11 
Bgr.HT2969 

flgr.HT3413 
6735.9 



AL042725 BE083316 AW975610 AA457591 BE062092 AI655202 AA714296 AI267264 AI075321 AA223285 AA071122 

AA227849 AA216700 AI596002 AA101867 AA099426 AA135997 AL041698 T02815 T51824 AA207189 T59230 T51868 

AA663341 BE165757 AW818104 AW392886 AA584918 AA099408 AW856396 AW861859 AA053045 

AA081507 M070071 AA070840 AA084362 

AI458623 AA639708 AA485409 R22065 AA485570 

AA075124AA075208 

M55405AW752552 

A03758 A06977 A15293 017029 D17107 D17171 L00132 L00133 M12523 M13075 M13076 M92816 U22961 V00494 V00495 

X51363 X51364 X51365 

L27065 

U22961 AA203623 AA503337 AM 74733 AI192802 C06092 AA035357 Al 1906 19 AJ 199244 AI828450 AA602296 AJ378195 
A1209170 A1186653 AI127795 AI183846 H77389 A1589465 AA629390 H94306 AI018388 R68584 AA027196 AI745413 
AI685092 AI093426 AI623873 AI074570 N50096 AA047486 N25060 AA327614 AI042512 A1383957 AA156873 AI333101 
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N70806 AI141254 A1383191 AI4Q1237 A1080709 AI093400 W84549 T90806 R00012 W01413 AA630557 A1378348 AI559265 
AA877103 W84464 AA625146 R68379 AI133207 Al 132980 AI133214 A1064826 AI061615 AI133473 AI174852 AI133404 
AI133272 V00494 M12523 M12523 AI207526 AI133120 AI064802 A1174993 AI114729 AI061645 A1064716 AI064959 H77388 
T85706 AF075298 A11 10799 D17107 NMJJ00477 AF190168 R50724 AI248416 AI207432 AM33684 A1133345 A1174710 
AI133290 AI133304 AI174948 AI207484 AI1 10717 AF074624 All 14515 AFQ63516 All 10642 A11 14559 AH 14498 AI114759 
AI207568 AI064960 A1174753 All 14666 R69184 R00011 AI064997 T60501 AI207701 T71735 AA385318 H73569 T60496 
H94399 AI133158 T74675 AA484750 T73413 T55909 R50261 T72061 N80533 T51189 T74936 AI207490 AM 32925 A1064701 
AI174748 AI1 14663 AH33104 AI132999 AI133100 AI064925 AI064979 AI133063 AA343347 T69091 AA233989 T39772 
AI444620 T52290 D16931 T40012 T48403 T58926 T69195 AI133061 T50850 AI400677 AJ091 136 AA334608 T5741 1 Z20979 
N56507 T87485 AI133622AA343370T40075 T69671 T53849T74820 AF075316 AI1 10818 T40121 T57381 AI114468 
AA332728 T51362 AI1 14589 R06691 AI1 10629 AF063503 AI140543 AA334661 AA332720 AA343262 T73513 T86549 
AI1 14840 T57284 T39981 T61407 T72757 T51749 T56630 AA343125 T72126 R94135 T83028 T39972 T39896 AI174786 
AI132926 R09237 A1064838 AH33660 T60398 T88753 T55930 T92126 AI444602 T60996 AJ1 14792 H9391 1 AI133106 
R10779 AI065020 T90925 T50889 D17029 AI133703 AA333805 AI133040 A1133017 AI064857 A11 10730 AF074637 AI207567 
H71080 773217 AA343950 AI174743 AA334224 AA334281 R06692 T64739 T40163 T60628 T81661 T73179 R01842 
AA501730 T39931 T39662 T40136 AA334904 T71425 H77784 R00874 AI065049 T84512 T55918 AI207595 T39951 
AA005016 T60361 T69176 T73356 T58795 T61233 T39955 T60612 AI114676 AI064778 AA035710 W52763 AM 14786 
T83564 AA341859 T81684 T55769 AM 14710 T51 776 AA343213 AI1 14714 T58102 AI110809 R28984 AI174854 AA305675 
AA343592 T53836 T46869 T64721 T55508 W05241 T54019 757945 T60513 T48364 AF075308 W86731 T82851 T48269 
H54053 T7321 1 AI1 14590 T48317 T55965 T74857 R84226 T56552 T52231 774946 776976 R02576 T95666 AI203974 
AI189471 AA005147 AI478102 AI207662 AI192792 AI766421 AI064737 AW051713 AA936693 AI1331 17 AI766232 Ai913646 
T83962 AI065112 A1207689 AI174684 AI207702 T81475 AI133325 AI032512 AA701 169 AI936354 AM 14720 AI433289 
AAO46980 AI823482 AM 14536 AA860651 AW242644 R07469 AW300438 AI133416 AW271670 AI991363 T78943 AI823481 
AA845518 AA719124 AA883454 T68850 769115 AI935509 AJ150977 762890 771374 T68294 AI174774 76741 1 T68318 
A1064689 T56624 T69010 768982 768302 AI332829 T72908 AI064819 AI205880 T62895 T69430 T95111 AA025050 773330 
W52657 771984 T691 18 W92684 AH 14860 762093 T61797 AI522333 T73322 H92981 75601 8 T6181 1 757232 AI336158 
T61821 769457 762900 T62912 772917 T46885 AI702448 757212 T57203 R94581 T71311 T61819 789358 T67708 770918 
T59168 AI18711 1 764308 T62071 T69427 Al 114750 T60430 R09734 769033 T69141 769453 T67908 R16809 T69394 
A1207729 T55839 790273 773339 AW194909 T75486 771850 771305 T71287 T53877 T73452 T68852 N75290 AI312890 
T67751 A1174983 751679 754851 H69880 N73734 AA443453 773466 H69672 N53869 T68447 D11809 D12412 T64300 
T28321 T55864 



123941 genbanKjAA621529 AA621529 
118049 genbanLN53145 N53145 



35 



102800 14782.20 AA313538 U88895 U88902 
104106 AA422123_LatAA422123J 



45 



40 



111738 genbanU*26065 R26065 

113149 genbankJT51588 T51588 

113958 genbankJV86195 W86195 

108335 genbanl<uAA070500 AA070500 

108351 genbankuAA071193 AA071193 

108441 genbankJ\A079079 AA079079 

124276 genbankj«3465 H83465 

101447 entre*_M21305 M21305 

117226 genbanLN20468 N20468 



133379 genbanLAA207059 AA207059.AA2Q7241 
119366 genbank_T77892 T77892 
119528 NOT^OUND.entre^WSSOSI W38051 



50 



112588 genbank_R77302 R77302 

114449 genbankJW>20736 AA020736 

114576 genbanK^AA065096 AA065096 

107459 W38002jsjrt W38002.S 



130339 genbank^AA435746 AA435746 



107 



WO 02/059377 



PCT/US02/02242 



TABLE 2: Figure 2 from BRCA 001 US 
S Table 2 shows genes downregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probesei Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

1 0 UnkjenelD: Unkjene number 

Unigene Tide: Unlgene gene tille 



R1: 



Ratio of normal breast tissue to tumor 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



Pkey 


ExAccn 


UnigenelD 


100499 


T51986 


Hs.283108 


100549 


BE142019 


Hs.222056 


100654 


A03758 




100971 


BE379727 


Hs.83213 


101184 


NMJJ01674 


Hs.460 


101336 


NM_006732 


Hs.75678 


101367 


X03350 


Hs.4 


101447 


M21305 




101461 


N98569 


Hs.76422 


101511 


M27826 


Hs.267319 


101736 


M74447 


Hs.502 


102208 


U22961 




102450 


U48251 


Hs.75871 


102800 


AA313538 




102857 


NM_006744 


Hs.76461 


102990 


AA829286 


Hs.332053 


103747 


AA081995 




103812 


AA137107 


Hs.326391 


104093 


R50727 


Hs.336970 


104109 


AL353957 


Hs.284181 


104250 


F06638 


Hs.12440 


104492 


N73185 


Hs.94285 


104506 


N91071 


Hs.109650 


104532 


AI498763 


Hs.203013 


104677 


AA009764 


HS.190380 


104711 


AA017245 


Hs.32794 


104731 


AA019300 


Hs.125070 


105005 


AI298208 


Hs.28805 


105036 


AA1 30390 


Hs.25549 


105239 


AA221036 




106052 


N79885 


Hs.6382 


106181 


AI803651 


Hs.191608 


106283 


AI085846 


Hs.25522 


106379 


AL042069 


Hs.1 18021 


106451 


AW235928 


Hs.313182 


106491 


AA135688 


Hs.10083 


106782 


AW054886 


Hs.25682 


107124 


AB006532 


Hs.31442 


107148 


AI005036 


Hs.334305 


107214 


AF127026 


Hs.5394 


107242 


AB020672 


Hs.175411 


107331 


AI905985 


Hs.111805 


107447 


W28516 


Hs.19210 


107451 


AL042425 


Hs.283976 


107872 


BE271708 


Hs.95110 


108351 


AA071193 




109546 


F01449 


Hs.26954 


110433 


AW294162 


Hs.301062 


110976 


AL044174 


Hs.159526 



Unlgene Tittle R1 

hemoglobin, gamma G 10 

Homo sapiens cDNA FU11572 fc, clone HE 10 

NM.000477*:Homo sapiens albumin (ALB), m 10 

fatty add binding protein 4. adipocyte 10 

activating transcription factor 3 10 

FBJ murine osteosarcoma viral oncogene h 10 

alcohol dehydrogenase 1 B (class I), beta 10 

gb:Human alpha satellite and satellite 3 10 

phosphoBpase A2, group HA (platelets, 10 

endogenous retroviral protease 10 

transporter 2, ATP-binding cassette, sub-famO 10 

gfcHiiman mRNA done wimsiniflaj^ 10 

protein kinase C binding protein 1 10 

gb:EST185419 Colon carcinoma (HCC) ceil 10 

re tinol-binding protein 4, interstitial 1 0 

serum amyloid A1 10 

gbzn26d06.rl Stratagene neunoepfthelium 10 

Homo sapiens, clone MGC:16638, mRNA, com 10 

ESTs 10 

hypothetical protein DKFZp434P0531 10 

Homo sapiens clone 24734 mRNA sequence 10 

EST 10 

ESTs 10 

hypothetical protein RJ12748 10 

ESTs 10 

ESTs 10 

ESTs, Moderately similar to I54374 gene 1 0 

ESTs 10 

hypothetical protein FU20898 10 

gbzr03f12r1 Stratagene NT2 neuronal pr 10 

ESTs, Highly similar to T00391 hypofoeti 10 

ESTs 10 

KIAA1808 protein 10 

DKFZP434N0S1 protein 10 

ESTs 10 

Homo sapiens, done IMAGE:4139786, mRNA, 10 

Homo sapiens mRNA for KIAA1863 protBin, 10 

RecQ protein-like 4 10 

GS1999fuH 10 

myosin IA 10 

K1AA0865 protein 10 

ESTs 10 

hypothetical protein MGC1 1308 10 

hypthetal protein PR02389 10 

ESTs, WeaWy similar to A55943 1-phospha 10 

gbzf79b12.s1 Soares_pineaLglandJJ3HPG 10 
Homo sapiens mRNA; cDNA DKFZp762G123 (fr 10 

UDP^^tyklpha^alactosajnirie:polyp 10 

patched (Drosophaa) homolog 5 
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111168 AI798376 gb:tr34b07jc1 NCLCGAP.Ov23 Homo sapiens 10 

111651 R16733 Hs.20499 ESTs 10 
111803 AA593731 Hs.325823 ESTs, Moderately similar to ALUS JiUMAN A 10 
114484 AA034378 Hs.267319 endogenous retroviral protease 10 

5 125284 NM002666 Hs.103253 perillpln 10 

128850 AA193106 Hs.180817 chromosome 11 open reading frame 23 5 

128903 AW150717 Hs.296176 STATfrxlucedSTATinhbaor3 10 

129346 AF110141 Hs.288908 WAS protein family, member 2 10 

129381 AW245805 Hs.1 10903 claudin 5 (transmembrane protein deleted 10 

10 129516 AFO20038 Hs.11223 fcocitrate dehydrogenase 1 (NADP+), solu 10 

129554 BE222078 Hs.113069 * ESTs 10 

130085 M62402 Hs.274313 Insulin-like growth factor binding prate 10 

130243 D88435 Hs.153227 cydin G associated kinase 10 

130400 V00517 Hs.283108 hemoglobin, gamma G 10 

15 130436 NlvL001928 Hs.155597 D component of complement (adipsin) 10 

130563 BE270472 Hs.279900 HSPC015 protein 10 

130589 AL110226 Hs.16441 DKF2P434H204 protein 10 

130683 AA993269 Hs.17872 Homo sapiens, clone IMAGE3875012, mRNA 10 

130689 NMJJ06691 Hs.17917 extracellular Dnk comairKontalnlng 1 10 

20 130689 AA046747 Hs.17917 extracellular link domain-containing 1 10 

130718 N70196 Hs.18376 KIAA1319 protein 10 

130798 M81349 Hs.1955 serum amyloid A4, constitutive 10 

130840 BE048821 H&20144 small inducible cytokine subfamily A (Cy 10 

131184 AB040935 Hs23954 cerebral cell adhesion molecule 10 

25 131282 X03350 Hs.4 alcohol dehydrogenase 1B (class I), beta 10 

131328 AW939251 Hs.25647 v-fos FBJ murine osteosarcoma viral onco 10 

131543 AW966881 Hs.41639 programmed cell deafo 2 10 

131753 AA829286 Hs.332053 serum amyloid A1 10 

131785 H69342 Hs.26320 TRABID protein 10 

30 131828 AJ000263 Hs.278656 keratin, hair, basic, 6 (monilethrix) 10 

132426 AW118072 Hs.89981 diacylglycerol kinase, zeta (104kD) 10 

132675 AJ291496 Hs.5476 Homo sapiens, clone IMAG&35301 23, mRNA, 10 

132898 W28548 Hs.224829 ESTs 10 

132905 NIVL004235 Hs.7934 KruppeHike factor 4 (gut) 10 

35 133120 NM.003278 Hs.65424 tetranectin (p^asininoger^binding protein 10 

133407 AF017987 Hs.7306 secreted mzzied-related protein 1 10 

133719 H26904 Hs.75736 apolipoprotein D 10 

134007 AF072441 Hs.7840 caldneurin binding protein 1 10 

134055 D86062 Hs.182423 ES1 (zebrafish) protein, human homolog o 10 

40 134111 AI372588 Hs.8022 TU3A protein 5 

134117 AA081846 Hs,7921 Homo sapiens mRNA; cONA DKFZp566E183 (fr 5 

134177 BE243319 Hs.79672 WAA0652 gene product 10 

134369 AF207664 Hs.8230 a dlslntegrin-Oke and metalloprotease ( 10 

134496 M64936 gb:Homo sapiens retinoic acioMnducible 10 

45 134510 NMJ02757 Hs.250870 mitogen^ctivated protein kinase kinase 10 

134550 M26315 Hs.85258 C08 anflgen, ajpha polypeptide (p32) 5 

134758 NMJW0078 Hs.89538 cholesteryl ester transfer protein, plas 5 

134963 NlvL003394 Hs.91985 wingless-type MMTV integration site farn) 10 

135066 X04430 Hs.93913 Interleukin 6 (interferon, beta 2) 10 

50 408790 AW580227 Hs.47860 neuixjtrophictyros^e kinase, receptor, type 2 10 

446674 AA563892 Hs.306000 solute carrier famiy 4 (anion exchanger), memo 10 
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TABLE 2 A 

Table 2A shows the accession numbers for those pkeys lacking unigenelD's for Table 2. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset Identifier number 

CAT number. Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 



111168 38585 1 AI798376 S46400AW811617AW811616W00557BE142245AW858232 AW861851 AW858362AA232351 AA218567 
AA055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 M243093 AA173345 AA199942 AA223384 
20 AA227092 AA227080 T12379 AA092174 T61 139 AA149776 AA699829 AW879188 AW813567 AW813538 A1267168 AA157718 

AA157719 AA100472 AA1Q0774 AA130756 AA157705 AA157730 AA157715 AA053524 AW849581 AW854566 C05254 
AW882836 T92637 AW812621 AA206583 AA209204 BE156909 AA226824 A1829309 AW991957 N66951 AA527374 H66215 
AA045564 A1694265 H60B08 AA149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 AW817659 
BE081531 H59570 
25 103747 117944.1 AA081995M101099 

134496 46501.1 M84936 AI025512 AI382987 BE061777 AA089966 BE169930 T41176 AW594624 BE502415 AA121893 AI269283 T4031 1 

AI684569 AA257011 AI079277 AI241318 BE327710 AW975215 AW896268 AA884990 BE327514 
105239 34624 1 AA221036 R87170 BE537068 BE544757 C18935 AW812058 T92565 AA227415 AA233942 AA223237 M668403 AA601627 
AW869639 BE061833 BE000620 AW961 170 AW847519 AA308542 AW821833 AW945688 C04699 AA205504 AA377241 
30 AW821667 AA055720 AW817981 AW856468 AA155719 AA179928 T03007 AW754298 AA227407 AA113928 AA307904 

. C16859 

100654 tigr_HT2969 A03758 A06977 A15293 D17029 D17107 D17171 L00132 L00133 M12523 M13075 M13076 M92816 U22961 V00494 V00495 
X51363 X51364 X51365 

102208 6735.9 U22961 AA203623AA503337AI174733 AI1 92802 C06092 AA035357 AJ19061 9 AI199244 AI828450 AA602296 AI378195 

35 AI209170 AI186653 AI127795 AI183846 H77389 AI589465 AA629390 H94306 AI018388 R68584 AA027196 AI745413 

AI685092 A1093426 A1623873 A1074570 N50096 AA047486 N25060 AA327614 AI042512 AI383957 AA156873 AI333101 
N70806 AJ141254 AI383191 AM01237 AI080709 AI093400 W84549 T90806 R00012 W01413 AA630557 AI378348 AI559265 
AA877103 W84464 AA625146 R68379 AI133207 AI132980 AJ133214 AI064826 AI061615 AI133473 AI174852 AI133404 
AI133272 V00494 M12523 M12523 AI207526 AI133120 AI064802 AI174993 AI1 14729 AI061645 A1064716 AI064959 H77388 

40 T85706 AF075298 All 10799 D17107 NMJJ00477 AF190168 R50724 AI24B416 AI207432 AI133684 AI133345 AI174710 

AI133290 AI133304 AI174948 A1207484 AI1 10717 AF074624 AI114515 AF063516 AI110642 AI114559 A1114498 AI114759 
AI207568 AM64960 AI174753 AM 14666 R69184 R00011 AI064997 T60501 AI207701 T71735 AA385318 H73569 T60496 
H94399 AI133158 T74675 AA484750 T73413 T55909 R50261 T72061 N80533 T51 189 T74936 AI207490 AI132925 AI064701 
AI174748 AI1 14663 AI133104 AI132999 AI133100 AI064925 AI064979 AI133063 AA343347 T69091 AA233989 T39772 

45 A1444620 T52290 016931 T40012 T46403 T58926 T69195 A1133061 T50850 AI400677 AI091136 AA334608 T5741 1 Z20979 

N56507T87485AI133622AA343370T40075 T69671 T53849 T74820 AF075316 AI110818 T40121 T57381 AI114468 
AA332728 T51362 A11 14589 R06691 AJ1 10629 AF063503 AI1 40543 AA334661 AA332720 AA343262 T73513 T86549 
AI1 14840 T57284 T39981 T61407 T72757 T51749 T56630 AA343125 772126 R94135 T83023 T39972 739898 AI174786 
AI132926 R09237 A10&4S38 AI133660 T60398 T88753 T55930 T92126 AI444602 T60996 AI114792 H9391 1 AI133106 

50 R10779 AI065020 T90925 T50889 D17029 AI133703 AA333805 AI133040 AI133017 AI064857 AI110730 AF074637 AI207567 

H71080 T73217 AA343950 AI174743 AA334224 AA334281 R06692 T64739 T40163 T60628 T81661 T73179 R01842 
AA501730 T39931 T39662 T40136 AA334904 T71425 H77784 R00874 A1065049 T84512 T55918 AI207595 T39951 
AAD05016 T60361 T69176T73356T58795 T61233 T39955 760612 AI114676 A1064778 AA035710 W52763 AI114786 
T83564 AA341859 T81684 T55769 AM 14710 T51776 AA343213 A11 14714 T58102 AI1 10809 R28984 AI174854 AA305675 

5 5 AA343592 T53836 T46869 T64721 T55508 W05241 T54019 T57945 T60513 T48364 AF075308 W86731 T82851 T48269 

H54053 T73211 AI1 145^) T48317 T55965 T74857 R84226 T56552 T52231 T74946 T76976 R02576 T95666 AI203974 
AI189471 AA005147AI478102 AI207662 A1192792 AI768421 AI064737 AW051713 AA936693 AI133117 AI766232AI9 13646 
T83962 AI0651 12 AI207689 AI174684 AI207702 T81475 AI133325 AI032512 AA701169 AI936354 AI114720 AI433289 
AAQ46980 A1823482 A1114536 AA860651 AW242644 R07469 AW300438 AI133416 AW271670 A1991363 T7B943 A1823481 

60 AA845518 AA719124 AA883454 T68850 T69115 AI935509 AI150977 T62890 T71374 T68294 AI174774 T67411 T68318 

AI064689 T56624 T69010 T68982 T68302 AB32829 T72908 AI064819 AC05880 T62895 T69430 795111 AA025050 T73330 
W52657 T71984 T691 18 W92684 AI1 14860 T62093 T61797 AI522333 T73322 H92981 T56018 T6181 1 T57232 AI336158 
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T61821 T69457 T62900 T62912 772917 T46885 AI702448 T57212 T57203 R94581 T7131 1 T61819 T89358 T67708 T70918 
T59165A1187111 T64308T62071 T69427 AJ1 14750 T60430 R09734 T69033 T69141 T69453 T67908 R16809 T69394 
AI207729 T55839 T90273 T73339 AW194909 T75485 T71850 T71305 T71287 T53877 T73452 T68852 N75290 AI312890 
T67751 AI174983 T51679 T54851 H69880 N73734 M443453 T73466 H69672 N53869 T68447 D11809 012412 T64300 
T28321 T55864 



102800 14782J20 AA313538 U88895 U88902 
108351 genban)LAA071 193 AA071 193 
101447 entrezJ/121305 M21305 
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TABLE 3: Figure 3 from BRCA 001 US 
Table 3 shows genes downregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

10 UnjgenelD: Unigene number 

Unigene TUIe: Unigene gene title 

R1 : Ratio of normal breast tissue to tumor 



15 

Pkey ExAccn UnigeneiO UnigeneTitle R1 

101336 NM_006732 Hs.75678 FBJ murine osteosarcoma vtral oncogene h 10.0 

102208 U22961 gb;Human mRNA clone with similarity to L 10.0 

20 102990 AA829286 Hs.332053 serum amyloid A1 10.0 

111168 AI798376 gb:tr34b07jc1 NCLCGAP_0v23 Homo sapiens 10.0 

111803 AA593731 Hs.325823 ESTs, Moderately similar to AW5JHUMAN A 10.0 

130085 M62402 Hs.274313 insulin-like growth factor binding prote 10.0 

130840 BE048821 Hs.20144 small Inducible cytokine subfamily A (Cy 10.0 

25 131543 AW966881 Hs.41639 piogrammed cell death 2 10.0 

133120 NNL003278. Hs.65424 tetranectin (plasmMogen-blndlng protein 10.0 

134758 NMJJ00076 Hs.89538 cholesteryt ester transfer protein, plas 10.0 
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TABLE 3A 

Table 3A shows the accession numbers for those pkeys lacking unigenelD's for Table 3. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
5 designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
inRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

1 0 Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



15 Pkey CAT number Accessions 

1 11 168 38585J AI798376 S46400 AW811617 AW811616 W00557 BE142245 AW858232 AW861851 AW858362 AA232351 AA218567 

AA055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA173345 AA199942 AA223384 
_ . AA227092 AA227080 T12379 AA092174 T61 139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 AA157718 

20 AA157719 AA100472 AA100774 AA130756 AA157705 AA157730 AA157715 AA053524 AW849581 AW854566 C05254 

AW882836 T92637 AW812621 AA206583 AA209204 BE156909 AA226824 A1829309 AW991957 N66951 AA527374 H66215 
AA045564 AI694265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 AW817659 
BE081531 H59570 

1022086735J U22961 AA203623 AA503337 A1174733 AI192802 C06092 AA035357 AI190619 AJ199244 AI828450 AA602296 AI378195 
25 AI209170 AI186653 AI127795 AI183846 H77389 AI589465 AA629390 H94306 A1018388 R68584 AA027196 AI745413 AI685092 

AI093426 AI623873 AJ074570 N50096 AA047486 N25060 AA327614 AKJ42512 AI383957 AA156873 AI333101 N70806 AI141254 
AI383191 AI401237 AI080709 AI093400 W84549 T90806 R00012 W01413 AA630557 AI378348 AI559265 AA877103 W84464 
AA625146 R68379 AI133207 A1132980 AI133214 A1064826 AI061615 AI133473 A1174852 AH33404 AI133272 V00494 M12523 
M12523 AI207526 AI133120 AI064802 AI174993 AI114729 A1061645 A1064716 AI064959 H77388 T85706 AF075298 AI110799 
3 0 017107 NIVL000477 AF190168 R50724 AI248416 AI207432 AI133684 AI133345 A1174710 AI133290 AJ133304 A1174948 

AI207484 AI1 10717 AF074624 AI114515 AF063516 AH 10642 AH 14559 AI114498 AI114759 AI207568 AI064960 AI174753 
AI114666 R69184RO0O11 AI064997 T60501 AI207701 T71735AA385318 H73569 T60496 H94399 AI1 33158 T74675 AA484750 
T73413 T55909 R50261 T72061 N80533 T51189 T74936 A1207490 AI132925 A1064701 A1174748 A1114663 AI133104 AI132999 
_ _ AI133100 AI064925 AI064979 AI133063 AA343347 T69091 AA233989 T39772 AI444620 T52290 D16931 T40012 T48403 

35 T58926T69195 AI133061 T50850 AI400677 AI091 136 AA334608 T5741 1 Z20979 N56507T87485A1 133622 AA343370 T40075 

T69671 T53849 T74820 AF075316 AH 10818 T40121 T57381 AH 14468 AA332728 T51362 AI1 14589 R06691 AI110629 
AF063503 AI140543 AA334661 AA332720 AA343262 T73513 T86549 AI114840 T57284 T39981 T61407 T72757 T51749 
T56630 AA343125 T72126 R94135 T83028 T39972 T39896 AI174786 AI132926 R09237 AI064838 AI133660 T60398 T88753 
. T55930 T92126 AI444602 T60996 AH 14792 H93911 AI133106 R10779 AI065020 T90925 T50889 D17029 AI133703 AA333805 

40 AI133040 AI133017 AI064857 AH 10730 AF074637 AI207567 H71080 T73217 AA343950 AI174743 AA334224 AA334281 

R06692 T64739 T40163 T60628 T81661 T73179 R01842 AA501730 T39931 T39662 T40136 AA334904 T71425 H77784 R00874 
AI065049 T84512 T55918 AI207595 T39951 AA005016 T60361 T69176 T73356 T58795 T61233 T39955 T60612 AI114676 
A1064778 AA035710 W52763 AH 14786 T83564 AA341859 T81684 T55769 AI1 14710 T51 776 AA343213 AI114714 T58102 
. AH 10809 R28984 AH74854 AA305675 AA343592 T53836 T46869 T64721 T55508 W05241 T54019 T57945 T60513 T48364 

45 AF075308 W86731 T82851 T48269 H54053 T73211 AH14590 T48317 T55965 T74857 R84226 T56552 T52231 T74946 T76976 

R02576 T95666 AI203974 AI189471 AA005147 AI478102 AI207662 AI192792 AI768421 AI064737 AW051713 AA936693 
AI1331 17 AI766232 AI913646 T83962 AI0651 12 AI207689 AI174684 AI207702 T81475 AI133325 AI032512 AA701 169 AI936354 
AI1 14720 AM33289 AA046980 AI823482 AH 14536 AA860651 AW242644 R07469 AW300438 AI133416 AW271670 AI991363 
T78943 AI823481 AA845518 AA719124 AA883454 T68850 T69115 AI935509 AH50977 T62890 T71374 T68294 AI174774 
50 , T6741 1 T68318 AI064689 T56624 T69010 T68982 T68302 AI332829 T72908 AJ064819 AI205880 T62895 T69430 T9511 1 

AA025050 T73330 W52657 T71984 T691 18 W92684 AI114860 T62093 T61797 AI522333 T73322 H92981 T56018 T61811 
T57232 AI336158 T61821 T69457 T62900 T62912 T72917 T46885 AI702448 T57212 T57203 R94581 T71311 T61819T89358 
T67708 T70918 T59166 AI1871 1 1 T64308 T62071 T69427 AI1 14750 T60430 R09734 T69033 T69141 T69453 T67908 R16809 
T69394 AI207729 T55839 T90273 T73339 AW194909 T75486 T71850 T71305 T71287 T53877 T73452 T68852 N75290 
55 AI312890 T67751 A1174983 T51679 T54851 H69880 N73734 AA443453 T73466 H69672 N53869 T68447 D11809 D12412 

T64300T28321T55864 
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TABLE 4: Figure 4 from BRCA 001 US 
5 Table 4 shows genes upregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos pnobeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 
10 UnlgenelD: Unigene number 
Unigene Title; Unigene gene title 

R1: Ratio of tumor to normal breast tissue 

15 Pkey ExAccn UnlgenelD Unigene Title R1 

100113 NM_001269Hs.84746 chromosome condensation 1 2.3 

100114 X02308 Hs.82962 thymidylate synthetase 2.9 
100131 D12485 Hs.11951 ectonucteotkie pyrophosphatase/phosphoojesterase 1 1.9 

20 100146 BE185499 Hs.2471 KIAA0020 gene product 1.9 

100147 013666 Hs.136348 osteoblast specific factor 2 (fasclcDn Mike) (periostin) 7.5 

100154 H60720 Ks.81892 WAAQ101 gene product 9.2 

100163 W44671 Hs.124 gene predicted from cDNA wifo a complete coding sequence 1.6 

100220 AW015534 Hs.217493 annexinA2 2.0 

25 100265 D38521 Hs.1 12396 WAA0077 protein 1.5 

* 100271 BE160081 Hs.256290 SlOOcaldunvbiruflng protein A11 (caigizzarin) 13.5 

100275 BE242802 Hs.154797 K1AA0090 protein 5.1 

100323 D50920 Hs.23106 K1AA0130 gene product 1.9 

100335 AW247529 Hs.6793 platetekictivating factor acetythydrolase, boform lb, gamma subunit (29kD) 2.7 

30 100364 NM.004341HS.1 54868 carbamoyl-phosphate synthetase 2, aspartate transcarbamylase, and oThydroorotase 2.0 

100372 NrVL014791Hs.184339 WAA0175 gene product 2.6 

100393 084145 Hs.39913 novel RGD-containing protein 3.2 

100400 AW954324 Hs.75790 phosphaWyfinosito! glycan, class C 1.5 

100418 D86978 Hs,84790 K1AA0225 protein 2.0 

35 100482 M65028 Hs.81361 heterogeneous nuclear rfconucleoprotein A/B 2.9 

100518 NM_004415HsJ4316 desmoplaWn (DPI, DPII) 1.9 

100666 L05424 Hs.169610 C044 antigen (homing function and Indian blood group system} 5.7 

100667 L05424 Hs.169610 CD44 antigen (homing function and Indian blood group system) 9.0 

100668 L05424 Hs.169610 COW antigen (homing function and Indian blood group system) 7.6 
40 10067B AW502935 Hs.740 PTK2 protein tyrosme kinase 2 53.2 

100685 AA328229 Hs.184582 ribosomal protein L24 1.8 

100690 AA383256 Hs.1657 estrogen receptor 1 1.6 

100783 AF078847 Hs. 191356 general transcription factor UR polypeptide 2 (44kDsubunit) 5.9 

100850 AA836472 Hs.297939 cathepsinB 1.7 

45 100892 BE245294 Hs.180789 S164 protein 1.7 

100945 AF002225 Hs.1 80686 ublquftin protein IJgase E3A (human papilloma virus E6-assodated protein, Angelman syndrome) 1.5 

100969 AA157634 Hs.79172 solute carrier family 25 (mitochondrial carrien adenine nucleotide trans^ 6.3 

100988 AKD00405 Hs.76480 ubtqultin-Gke 4 11.4 

100999 H38765 Hs.80706 diaphorase (NADH/NADPH) (cytochrome b-5 reductase) 1.6 

50 101031 J05070 Hs.151738 matrix metaUoproteinase 9 (gelatinase B, 92kD gelatinase, 92kO type IV collagenase) 8.2 

101045 J05614 gfoHuman proliferating cell nuclear antigen (PCNA) gene, promoter region. 5.0 

101077 N99692 Hs.75227 Errtpiricaily selected from AFFX single probeset 2.6 

101093 L06419 Hs.75093 procoflagen-fysme, 2-oxoglutarate 5-dbxygenase (lysine hydroxylase, Ehiers-Oanios syndrome type VI) 1.4 

101161 NM.006262HS.37044 peripheral 16.9 

55 101186 AA020956 Hs,179881 corottndlng factor, beta subuntt 2.0 

101216 AA284166 Hs.84113 cyc&niependent kinase inhibitor 3 (CDK2-assodated dual specificity phosphatase) 1.8 

101228 AA333387 Hs.82916 chaperonln containing TCP1, subunlt 6A (zeta 1) 1.7 

101247 AA132666 Hs.78802 glycogen synthase kinase 3 beta 1.9 

101249 L18964 Hs.1904 protein kmase C. (ota 1.5 

60 101332 J04088 Hs.156346 topoisomerase (DMA) U alpha (170KD) 5.2 

101332 J04088 Hs.156346 topoisomerase (DNA) II alpha (170kO) 3.4 

101352 AI494299 Hs.16297 COX17 (yeast) rwrnolog, cytochrome c oxidase assembly protein 6.3 

101396 BE267931 Hs.78996 proliferating cell nuclear antigen 4.2 

101445 M21259 gbfluman Alu repeats in the region 5 to the small nuclear rib 1.9 

65 101470 NM.000546Hs.1846 tumor protein p53 (U-Fraumeni syndrome) 1.6 
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101478 NM_002890Hs.758 RASp21 protein activator (GTPase activating protein) 1 2.5 
101483 M24486 Hs.76768 procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline 44iydroxytase), alpha polypepfide I 5.5 
101540 J04977 Hs.84981 X-ray repair complemenflng defective repair in Chinese hamster cells 5 (double-strand-break rejotnbg Z1 
101573 AW248421 Hs.250758 proteasome (prosome, macropain) 26S subunlt, ATPase. 3 1.6 
5 101580 NMJ)12151Hs.83363 coagulation factor Vlll-assodated (intronlc transcript) 5.7 
101592 AF064853 Hs.91299 guanine nucleotide binding protein (G protein), beta polypeptide 2 1.8 
101592 AF064853 Hs.91299 guanine nucleotide binding protein (G pr 5.6 
101621 BE391804 Hs.62661 guanyiate binding protein 1, interferornlnducble, 67kD 2.4 
101702 AW504089 Hs.179574 protein phosphatase 2 (fonnerly2A), regulatory suburitBffR 52), alpha isoform 13 

10 101734 M74099 Hs.1 47049 cut (OrosophHa)-fiKe 1 (CCAAT displacement protein) 2.1 
101759 M80244 Hs.184601 solute carrier family 7 (cafonfcamino acid trareporter,y+ system), inember 5 5.0 
101767 M81057 Hs.180884 carboxypeptidase B1 (tissue) 14.4 
101782 AA306495 Hs.1869 phosphoglucomutase 1 5.2 
101805 AW409747 Hs.75612 stress4nduced-phosphoprotein 1 (Hsp70/Hsp90-organlzing protein) 8.6 

15 101806 AA586894 Hs.1 12408 S100 catelunvbinding protein A7 (psoriasin 1) 8.9 
101810 NM_000318Hs.180612 peroxisomal membrane protein 3 (35kD, Zellweger syndrome) 3.2 
101879 AA176374 Hs.243886 nuclear autoanttgenlc sperm protein (histone-blnding) 1.6 

101911 AA441787 Hs.1 19689 glycoprotein hormones, alpha polypeptide 31.3 
101920 AF182645 Hs.8024 IK cytokine, down-regulator of HLAII 1.8 

20 101973 U41514 Hs.80120 UDP^-acetyl*lpha-D^^ 

101983 AI904232 Hs.75323 prohlbitin 8.4 

102009 BE245149 Hs.82643 protein tyrosine kinase 9 1.3 

102036 BE250127 Hs.82906 CX20 (cell division cycle 20, S. cerevisiae, homolog) 2.0 

102083 T35901 Hs.75117 Interleukin enhancer binding factor 2, 45kD 1.6 

25 102083 T35901 Hs.75117 interteukin enhancer binding factor 2. 4 1.3 

102107 BE258602 Hs.182366 heat shock protein 75 1.4 

102123 NM_001809Hs.1594 centromere protein A (17kO) 1.8 

102165 BE313280 Hs.159627 death associated protein 3 4.6 

102198 AW950852 Hs.74598 polymerase (DNA directed), delta 2, regulatory subunit(50kD) 4.3 

30 102217 AA829978 Hs.301613 JTV1 gene * 6.7 

102220 U24389 Hs.65436 lysosomal 4.3 

102234 AW163390 Hs.278554 heterochromaWke protein 1 1.9 

102260 AL039104 Hs.159557 karyopherin alpha 2 (RAG cohort 1, Importin alpha 1) 4.4 

102302 AA306342 Hs.69171 protein kinase OBke 2 2.7 

35 102330 BE298063 Hs.77254 drorrwboxtorrdog 1 (Drosophfla HP1 beta) 1.5 

102339 BE378432 Hs.95577 cyclin-dependent kinase 4 2.3 

102348 U37519 Hs.87539 aWehydeo^ydrexjenase 3 family. memberB2 2.0 

102349 AU077055 Hs.289107 bacufoviral IAP repeakxmtaining 2 3.2 
102369 U39840 Hs.299867 hepatocyte nuctear factor 3, alpha 2.0 

40 102374 U33635 Hs.90572 PTK7 protein tyrosine kinase 7 6.2 

102391 AA296874 Hs.77494 deoxyguanosine kinase 1.5 

102455 U48705 Hs.75562 discoWin domain receptor family, member 1 6.9 

102465 NM_001359Hs.81548 2,4-dienoyl CoA reductase l.ntforiondnaJ 1.8 

102488 U50939 Hs.61828 amyloid beta precursor proteirvbinding protein 1,59kD 1.5 

45 102489 AL080116 Hs.74420 origin recognition complex, subunft 3 (yeast homofog)-&ke 3.3 

102494 A1188137 Hs75193 COP9 homolog 2.1 

102501 AF217197 Hs.74562 siah binding protein 1;FBP Interacting repressor; pyrimidine tract binding splicing 3.2 

102522 BE250944 Hs.183556 solute carrier famfly 1 (neutral amino acid transporter), member 5 2.8 

102532 AF040253 Hs.70186 suppressor of Ty (S.cerevislae) 5 homolog 5.7 

50 102564 U59423 Hs.79067 MAD (mothers against decartfnteplegic, Drosophifa) homolog 1 2.3 

102568 W81489 Hs.223025 RAB31, member RASorcagene family * 5.3 

102580 U60808 Hs.152981 CDP^facytgryceral synthase^ 2.1 

102581 AU077228 Hs.77256 enhancer of zeste (Drosophfla) homolog 2 1.6 
tQ2582 U61232 Hs.32675 tubufin-spectfic chaperons e 2.1 

55 102617 AW161453 Hs.198767 COP9 (constitufive photomorphogenic, Arabidopsis, homolog) subunit 5 1.8 

102618 AL037672 Hs.81071 extracellular matrix protein 1 5.8 

102627 AL021918 Hs.158174 zinc finger protein 184 (KruppeHike) 1.3 

102663 NJvL002270Hs.168075 karyopherin (importin) beta 2 1.8 

102676 BE262989 Hs.12045 putative protein 2.3 

60 102687 NMJ07019HS.93002 ubiquitin carrier protein E2-C 4.3 

102689 U96132 Hs.171280 hydroxyacyW^zymeAdehyarogenase.typell 6.0 

102696 BE540274 Hs.239 forkheadboxMI 4.2 

102704 AU077056 Hs.54089 BRCA1 associated RING domain 1 1.9 

102705 T97490 Hs.5O002 small inducible cytokine subfamily A (Cys-Cys), member 19 2.3 
65 102750 AB014460 Hs.66196 nfli (E.coli endonudease lll)-like 1 1.2 

102801 BE252241 Hs.38041 pyridoxal (pyrldoxine, vttarnin B6) kinase 6.4 

102812 U90549 Hs.236774 h^TKmobittty group (nonhistone chrornosomal) protein 17-like 3 1.6 
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102827 BE244588 Hs.6456 chaperanin containing TCP1, subunit 2 (beta) 5.6 

102831 AA262170 Hs.80917 adaptor-related protein complex 3. sigma 1 subunit 2.0 

102844 AV653790 Hs.324275 WW domain-containing protein 1 1.3 

102863 X02419 Hs.77274 plasminogen activator, urokinase 4.4 

5 102925 BE440142 Hs.2943 signal recognition particle 19kO 1.9 

102935 BE5S1850 Hs.80506 smaB nuclear ribonudeoproteln polypeptide A' 2.4 
102968 AU076611 Ks. 154672 methylene tetrahydrofokte dehydrogenase (NAD* dependent), methenyltetrahydrofoJate cyclohydroiase2.7 

102983 BE387202 Hs.1 18638 non-metastatic ceils 1, protein (NM23A) expressed In 3.1 

102985 U95742 HsJ707 G1 to S phase transition 1 52 

10 103023 AW500470 Hs.1 17950 multifunctional polypeptide similar to SAJCAR synthetase and AiR carboxylase 1.6 

103038 AA926960 Hs.334883 COC28 protein kinase 1 2.5 

103060 NNL005940HS.155324 matrix metailoproteinase 11 (MMP11; stromerysln 3) 4.5 

103080 AU077231 Hs.82932 cycfinOI (PRAD1: parathyroid adenomatosbl) 3.1 

103089 D31 152 Hs.179729 collagen, type X, alpha 1 (Schmid metaphyseal chondrodysplasia) 2.4 

15 103177 BE244377 Hs.48876 famesyWiphosphate famesyttransferase 1 * 3.5 

103178 AA205475 Hs.275865 ribosomal protein S18 9.9 

103179 NM_001777Hs.82685 C D47 antigen (RJwelated antigen, Integnn^sscdated signal transducer) 1.3 
103181 X69636 Hs.334731 Homo sapiens, done IMAGE:3448306. mRNA, partial cds 2.0 
103185 NMJ)06825Hs.74368 transmembrane protein (63kD), endoplasn^ 1.6 

20 103191 AA401039 Hs.2903 protein phosphatase 4 {formejly X), catalytic 2.5 

103193 NM_004766Hs.75724 coatomer protein complex, subunit beta 2 (beta prime) 22 

103194 NM_004939Hs.78580 DEAO/H (Asp-Glu-Ala-Asp/His) box polypeptide 1 6.3 
103206 X72755 Hs.77367 monokine induced by gamma interferon 8.8 
103223 BE275607 Hs.1708 chaperanin containing TCP1, subunit 3 (gamma) 3.0 

25 103232 X75962 Hs.1 29780 tumor necrosis factor receptor superfamBy, member 4 1.8 

103238 A1369285 Hs.75189 death-associated protein 5.6 

103297 NMJ)01545Hs.9078 Immature colon carcinoma transcript 1 1.9 

103330 A1803447 Hs.77496 smafl nuclear ribonudeoproteln polypeptide G 2.5 

103349 X89059 gb^apiens mRNA for unknown protein expressed in macrophage 1.6 

30 103376 AL036166 Hs.323378 coated vesicte membrane protein 1.8 

103391 X94453 Hs.1 14366 pyrralme^-carboxyiate synthetase (glutamate gamma-semialdehydo synthetase) 2.3 

103392 X94563 gb:RsapIens dbi/acbp gene exon 1 & 2. 4.0 
103430 BE564090 Hs.20716 translocase of inner mitochondrial membrane 17 (yeast) homotog A 1.3 
103491 AF264750 Hs.288971 myeloid/lymphoid or mixed-lineage leukemia 3 5.6 

35 103505 AL031224 Hs.33102 transcription factor AP-2 beta (activating enhancer-birKling protein 2 beta) 5.1 

103547 A1376722 Hs.180062 pratsasome (prosome, macropaln) subunit, beta type, 8 (large multifunctional protease 7) 9.7 

103588 NM_006218Hs.85701 pr^rjrwinositide-3-lanase, catalytic, alpha polypeptide 2.0 

103613 NM_000346Hs.2316 SRY (sex determining region Y)*box 9 (campomelic dysplasia, autosomal sex-reversai) 1.3 

103621 BE379766 Hs.150675 polymerase (RNA) II (DNA directed) polypepBde K (7.0kD) 2.0 

40 103622 AA609685 Hs.278672 membrane component crtromosome 11, surface marker 1 2.3 

103727 AI876883 Hs.296361 growth factor receptor-cound protein 2 1.3 

103749 AL135301 Hs.8768 hypothetical protein FU10849 1.8 

103754 A1015709 Hs.172089 Homo sapiens mRNA; cONA DKFZp586l2022 (from done DKFZp586l2022) 1.3 

103780 AA094752 Hs.169992 hypothefcal 432 Kd protein 75 

45 103795 H26531 Hs.7367 Homo sapiens BTB domain protein (BDPL) mRNA, partial cds U 

103797 AA080912 gbzn04d03 f\ Stratagene hNT neuron (937233) Homo sapiens cONA clone 5 similar 1.5 

103813 AI042582 Hs.1 81 271 CGM 20 protein 15 

103855 W02363 Hs.302267 hypotheOcal protein FU10330 1.5 

103886 AK001278 Hs.105737 hypothefical proiatn FU 1041 6 similar to coristrMve photon^orphogenlc protein 1 6.5 

50 104052 NMJ02407HS.97644 inamrnaglobln 2 2.9 

104079 AA251242 Hs.103238 ESTs 1.4 

104174 AA478984 Hs.6451 PRO0659 protein 5.6 

104227 A8002343 Hs.98938 protocadherin alpha 9 1.6 

104275 AI751970 Hs.101067 GCN5 (general control of amino-acW synthesis, yeast homolog)-tike 2 5.4 

55 104325 BE379766 Hs.150675 polymerase (RNA) U (DNA directed) polypeptide K (7.0kD) 6.3 

104370 AA324597 Hs.21851 Homo sapiens cONA FU 12900 fs, clone NT2RP2004321 1.6 

104423 R83113 Hs.1432 protein kinase C substrate 80K-H 5.2 

104482 AB037762 Hs.44268 myelin gene expression factor 2 1.2 

104532 AI498763 Hs.203013 hypothetical protein FU 12748 2.1 

60 104563 AL1 17403 Hs.306189 DKFZP434F 1735 protein 1.2 

104667 AI239923 Hs.30098 ESTs 1.3 

104757 AI694413 Hs.332649 olfactory receptor, family 2. subfamily I, member 6 2.3 

104804 AI858702 Hs.31803 ESTs, Weakly similar to N-WASP [H^apiens] 1.3 

104806 AB023175 Hs.22982 WAA0958 protein 2.3 

65 104827 AW052G06 Hs.8551 PRP4/STKWD spOcing factor 10.9 

104846 AI250789 Hs.32478 ESTs 5,6 

104854 AA041276 Hs.1 54729 3-phosphoinositide dependent protein Wnase-1 12-3 



116 



WO 02/059377 



PCT/US02/02242 



104867 AA278898 Hs.225979 hypothetical protein similar to small G proteins, especially RAP-2A 2.0 

104871 T78044 Hs.28893 Homo sapiens mRNA; cDNA DKFZp56402364 (from done DKFZp56402364) 1.3 

104896 AW015318 Hs.23165 ESTs 177 

104909 AW408164 Hs.249184 transcription factor 19 (SC1) 50 

5 104916 AW958157 Hs.155489 NSI-associated protein 1 17 

104919 AA026880 Hs.25252 prolactin receptor 14 

104930 AF043467 Hs.32893 neurexophilln 2 22 

104973 NM_015310Hs.6763 K1M0942 protein 5 0 

104974 Y12059 Hs.278675 bromodomain-contalning 4 14 
10 104975 AL136877 Hs.50758 SMC4(stnjcturalniainteranceof(*raniosomes 4, yeasty 1 2.4 

104975 AL136877 Hs.50758 SMC4 (structural maintenance of chrarnoso 2.3 

104978 AI199268 Hs.19322 Homo sapiens, Similar to RIKENcDNA 20 10317E24 gene, dor«lJ^GE:3502019,rnRNA ( partia!cds7^ 

104979 AA937934 Hs.321062 ESTs * 1.3 
104994 A1499930 Hs.334885 mitochondrial GTP binding protein 3.5 

15 105009 BE379584 Hs.34789 doiichyMiphosphooBgosa^^ 5.5 

105012 AF098158 Hs.9329 chromosome 20 open reading frame 1 3.3 

105028 A1050715 Hs.2331 E2F transcription factor 5, p130-blndlng 2.2 
105032 M127818 gb.7l12a02^1 Soares_pregnantuterus.NbHPU Homo sapiens cDNA clone IMAGE:501674 3' 6.8 

105039 AA907305 Hs.36475 ESTs 25 

20 105041 AB037716 Hs.26204 KIAA1295 protein 2.2 

105045 BE242899 Hs.129951 speckle-type POZ protein 38 

105079 AA151342 Hs.12677 CG1-147 protein 95 

105087 AA147884 Hs.9812 Homo sapiens cDNA FU14388 fis, done HEMBA1002716 5.6 

105088 H58589 Hs.35156 Homo sapiens cDNA FU1 1027 fis, clone PLACE10041 14 2.2 
25 105095 Z78407 Hs.27023 veslde transport-related protein 22 

105110 BE387350 Hs.33122 WAA1 160 protein 16 

105126 AW975433 Hs.36288 ESTs 6.3 

105127 AA045648 Hs.301957 nudix(nudeoside diphosphate linked moiety X)-type motif 5 2.1 
105141 AA164687 Hs.177576 mannosyf(a!pha-1,3-)^I^pro 2.7 

30 105158 AW976357 Hs.234545 hypothetical protein NUF2R 19 

105169 BE245294 Hs.180789 S164 proteki 17 

105186 AA191512 Hs.28005 Homo sapiens cONA FU1 1309 fis, clone PLACE 1010076 4.8 

105254 AA071276 Hs.19469 KIAA0859 protein 1.9 

105281 AA263143 Hs.24596 RADSI-interacfing protein 2.8 

35 105288 N99673 Hs.3585 ESTs, Weakly similar to AF1 26743 1 DNAJ domain-containing protein MCJ [H^apiens] 1.9 

105302 AA700122 Hs.3355 sentrin-spedfic protease 80 

105331 AW270037 Hs.179507 WAA0779 protein 1.8 

105359 NMJ)16015Hs.8054 CGI-68 protein 8.2 

105366 BE264645 Hs.282093 hypothetical protein FU21 918 5.0 

40 105373 AW887701 Hs.32356 hypothetical protein FU20628 2.5 

105374 BE242803 Hs.262823 hypothetical protein FU 10326 2.2 

105387 AW592146 Hs.108636 membrane protein CH1 2.3 

105393 AF167570 Hs.256583 interleukin enhancer binding factor 3. 90kO 5.4 

105399 BE386877 Hs.334811 Npw38-binding protein NpwBP 16 

45 105400 AF198620 Hs,65648 RNA binding motif protein 8A 16 

105445 AA252395 gb:zs12g10.s1 NCLCGAPJ3CB1 Homo sapiens cONA clone IMAGE:685026 3, mRNA sequence. 5.0 

105507 BE268348 Hs.226318 CCR4-NOT transcription complex, subunit 7 16 

105529 AA113449 Hs.32471 hypothetical protein FU20364 13 

105530 AB023179 Hs.9059 K1AA0962 protein 3.4 
50 105547 AA262640 Hs.27445 unknown 9.3 

105564 BE616694 Hs.288042 hypothetical protein FU14299 14 

105596 AA579535 Hs.18490 hypothetical protein FU20452 10.9 

105597 AF054284 Hs.334826 splicing factor 3b, subunit 1, 155kD Z9 
105608 A1808201 Hs.287863 hypothetical protein FU 12475 17 

55 105610 AA280072 Hs.99872 fetal Alzheimer antigen 14 

105617 AK000892 Hs.4069 glucocorticoid modulatory element binding protein 1 17 

105620 AW302245 Hs.181390 casein kinase 1, gamma 2 * 5.5 

105658 AA985190 Hs.246875 hypothetical protein FU20059 9.4 

105697 AW499988 Hs.27801 zinc finger protein 278 2.0 

60 105708 R26944 Hs.180777 Homo sapiens mRNA; cDNA DKFZp564M0264 (from clone DKFZp564M0264) 1.7 

105743 BE246502 Hs.9598 sema domain, immunoglobulin domain (lg), transrr^nibrane d()main fTM) and short 2.6 

105746 AW151952 Hs.46679 hypothetka! protein FU2G739 1.5 

105759 AI123118 Hs.15159 chemoklne-Oke factor, alternatively spliced u 

105771 AI267720 Hs.153221 synovial sarcoma, translocated to X chromosome 16 

65 105820 AA741336 Hs.152108 transcriptional unit N143 22 

105826 AA478756 Hs.194477 E3 ubiquttin tigase SMURF2 13 

105856 AI262106 Hs.12653 ESTs 2.4 
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105858 
105875 
105930 
106000 
106011 
106017 
106073 
106078 
106034 
106140 
106271 
106288 
106300 
106333 
106350 
106359 
106381 



106457 
106470 
106531 
106566 
106589 
106610 
106624 
106650 



105713 
106717 
105723 
106795 
106829 
106831 
106846 
106852 
106873 
106686 
106906 



106945 
106973 
106977 
106978 
107004 
107029 
107071 
107113 
107125 
107136 
107136 
107146 
107151 
107155 
107174 
107197 
107221 
107243 
107248 
107263 
107265 
107298 
107298 
107299 
107316 
107354 
107392 
107481 



AF151066 Hs.281428 
AK001708 Hs.32271 
AF016371 Hs.9880 
AW194426 Hs.20726 
AW081202 Hs.12284 
AA477956 Hs.26268 
AL157441 Hs.17834 
AA130158 Hs.19977 
AA533491 Hs.23317 
AB006624 Hs.14912 
AA251393 Hs.289052 
AB037742 Hs.24336 
Y10043 Hs.19114 
AL043114 Hs.22410 
AKD01404 Hs.194698 
AW390282 Hs.3113Q 
AB040916 Hs.24106 
AW748420 Hs.6236 
AF1 19256 Hs.27801 
D63078 Hs.186180 
AA454036 Hs.8832 
AA243837 Hs.57787 
AKD00933 Hs.28661 
AA458882 Hs.79732 
NfvL003595Hs^6350 
AL049951 Hs.22370 
AV657117 Hs.184164 
BE614802 Hs.184352 
AA600357 Hs.239489 
BE388094 Hs.21857 
AF174487 Hs£93753 
AW959893 Hs.27099 
BE564871 Hs29463 
AB037744 Hs.34892 
AF1 51031 Hs.300631 
N498Q9 Hs.11197 
W79171 Hs.9567 
AA861271 Hs.222024 
AK001838 Hs.296323 
AK000511 Hs.6294 
BE156256 Hs.11923 
AL043152 Hs.50421 
AW631480 Hs.8688 
AA146872 Hs.300700 
AF264750 Hs.288971 
AW385224 Hs.35198 
AK000733 Hs.23900 
AK000512 Hs.69388 
AV661958 Hs.8207 
AV661958 Hs.8207 
AK001455 Hs.5198 
AW378065 Hs.8687 
AW391927 Hs.7946 
BE122762 Hs.25338 
W15477 Hs.64639 
AW888411 Hs.81915 
BE219716 Hs.34727 
AW263124 Hs.315111 
D60341 Hs.21198 
BE379594 Hs.49136 
N95657 Hs.6820 
N95657 Hs.6820 
BE277457 Hs.30661 
T63174 Hs.193700 
NMJW6299Hs£6448 
AW299900 Hs.267632 
AA307703 Hs.279766 



hypothetical protein 

hypothetical protein FU10846 

peptidyl prolyl isomerase H (cydophi&n H) 

ESTs 

Homo sapiens, done !MAGE:2989556. mRNA, panla) cds 
ESTs 

downstream neighbor of SON 



2.9 
1.4 
5.2 
1.7 
2.8 
1.4 
1.4 



ESTs. Moderately similar to ALU8JWMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 1.6 

hypothetical protein FU14681 6.6 

WAAQ286 protein 1.6 
Homo sapiens, Similar to RIKEN cONA 5430429M05 gene, done MGC:13155, mRNA, complete cds 10.8 

K1AA1321 protein 1.3 

Ngrwnobffity group (northist^chronwscona!) protein 4 3.6 

ESTs, Weakly similar to A54849 collagen alpha 1(V1I) chain precursor [H.sapiensI 5.4 

cycfin B2 5.7 

transmembrane 7 super£arnily member 2 6.3 

WAA1483 protein 6.5 

Homo sapiens cDNA: FU21487 fis, clone COL05419 22 

zinc finger protein 278 2.7 

Homo sapiens cONA: aJ23038 fis, done LNG02039 2,3 

ESTs 16 

ESTs 1.6 

Homo sapiens cONA RJ 10071 fis, clone HEMBA1001702 2.4 

Bbutin 1 7.9 

tyrosylproteln sulfotransferase 2 7.7 

Homo sapiens mRNA; cDNA OKFZp564O0122 (from done DKFZp564O0122) 1.8 

ESTs, Moderately similar to S65657 alpha-1C-adrenergic receptor splice form 2 [H.sapiens] 1 .3 

hypothetical protein FU12549 4.5 

T1A1 cytotoxic granule-associated RNA-bindbg protein 1 .3 

ESTs 1.6 

Bd-2-related ovarian killer protein-like 5.7 

hypothetical protein RJ23293 similar to ARL-6 interacting protein-2 16.2 

centrin, EF-hand protein, 3 (COC31 yeast homotog) 1 .5 

WAA1323 protein 2.2 

hypothefica! protein 13 

Homo sapiens, done IMAGE:3343149, mRNA, partial cds 16.8 

GL002 protein 1.5 

transcription factor BMAL2 22 

serurn/gtucocorticoid regulated kinase 3.3 

hypothetical protein DKFZp434L1435 similar to valyi tRNA synthetase 6.8 

hypothetical protein 6.6 

WAA0203 gene product 4.8 

ESTs 6.0 

hypothetical protein FU20727 1 .3 

m^iaVryn^oidormaeo^ 1.8 

GTPase activating protein 2.5 

hypothetical protein FU20505 1.7 

GK001 protein 4.6 

GKD01 protein 3.3 

Own syndrome critical region gene 2 2.0 

ESTs 6.3 

WAA1288 protein 33.5 

ESTs 5.2 

glioma pathogenesis-related protein 6.1 

leukemia-associated phosphoprotein p 1 8 (stathmln) 1 7.4 

ESTs, Moderately similar to 138759 zinc finger/leucine zipper protein (H.sapiens] 7.4 

nudear receptor co-repressor/HDAC3 complex subunit 1 .8 

transtocase of outer mitochondria] membrane 70 (yeast) homotog A 6.6 
ESTs, Moderately similar to ALU7 JWMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 2.5 

ESTs, Moderately similar to YOJ1.CAEEL HYPOTHETICAL 63.5 KD PROTEIN ZK353.1 IN 2.5 

ESTs, Moderately similar to YOJ1_CAEEL H 1.7 

hypothetical protein M6C4606 3.2 

Homo sapiens mRNA; cDNA DKFZp586l0324 (torn clone DKFZp586l0324) 2.0 

zinc finger protein 193 5.0 

TATA element modulatory factor 1 1 2 

klnesln family member 4A 1.6 
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107529 BE515065 Hs.296585 nucleolar protein (KKE/D repeat) 30 

107554 AA001365 Hs.59844 ESTs 1 3 
107681 BE379594 Hs.49136 ESTs. Moderately similar to ALU7JWMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 2^2 

107772 AA018587 Hs.303055 ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 2.1 

3 107859 AW732573 Hs.47584 potassium voltage-gated channel, delayed-reciifier, subfamily S, member 3 8.4 

107901 L42612 Hs.335952 keratin 6B 25 

107901 L42612 Hs.335952 keratin 6B w 

107922 BE153855 Hs.61460 Ig superfamily receptor LNIR 2.2 

107974 AW956103 Hs.61712 pyruvate dehydrogenase kinase, isoenzyme 1 6.7 
10 108040 AL121031 Hs. 159971 SWi/SNF related, matrix associated, actln dependent regulator of chromatin, subfamily b t member 1 1.5 

108230 AA054224 Hs.59847 ESTs 13 

108274 AF129535 Hs.272027 F-box only protein 5 71 

108296 N31256 Hs.161623 ESTs 25 

108496 AA083089 Hs.339659 ESTs 3,5 

15 108607 BE300380 Hs.69476 Homo sapiens cDNAFU12758fis l done NT2RP2001 328 3.4 

108621 AA101809 Hs.182685 ESTs 16 

108634 AW022410 Hs.69507 ESTs 17 

108647 BE546947 Hs.44276 homeoboxCIO 98 

108695 AB029000 Hs.70823 K1M1077 protein 12 

20 108717 AA122393 Hs.70811 hypothetical protein FU20516 13 

108740 AJ089575 Hs.9071 progesterone membrane binding protein 21 

108828 AK001693 Hs.273344 DKFZP564O0463 protein 1.8 

108859 AL121500 Hs.178904 ESTs 1.5 

108872 H06720 Hs.111680 endosulfine alpha Z1 

25 108891 AI801235 Hs,48480 ESTs 5,3 

108894 AK001431 Hs.5105 hypothetical protein FLJ10569 4.0 

108955 AA149754 Hs.195155 Homo sapiens amino acid transport system N2 (SN2) mRNA. complete cds 5.6 

108982 AA151708 Hs.171980 homeo box (expressed in ES cells) 1 1,6 

108987 AA152178 Hs.23467 hypothetical protein FU10633 6.2 

30 109002 AB028987 Hs.72134 KIAA1064 protein 1.7 

109011 AA156542 Hs.72127 ESTs 1.4 
109026 AA157811 gb:zo35d07.s1 Stratagene colon (937204) Homo sapiens cDNA done 3' similar to contains Alurepetifive 5.3 

109068 AA164293 Hs.72545 ESTs Z9 

109101 AW608930 Hs.52184 hypothetical protein FU2061 8 1.6 

35 109112 AW419196 Hs.257924 hypothetical protein FU13782 3.2 

109124 AKOO0684 Hs.183887 hypothetical protein FU22104 1.7 

109139 AJ132592 Hs.59757 zinc finger protein 281 2.6 

109166 AA219691 Hs.73625 RAB6 interacting, kinesin-like (rabkinesin 6) 2.9 

109198 BE566742 Hs.58169 highly expressed In cancer, rich In leudne heptad repeats 2.0 

40 109213 NMJ16603HS.82035 potential nudear protein C50RF5; GAP-like protein 5.3 

109220 AW958181 Hs.189998 ESTs 5.7 

109233 AU077281 Hs.170285 nudeoporin 214kD (CAIN) 5.3 

109270 N99673 Hs.3585 ESTs, Weakly similar to AF126743 1 DNAJ domain-containing protein MCJ (H.saplens] 1.4 

109273 AA375752 Hs.82719 Homo sapiens mRNA; cDNA DKFZp586F1822 (from done DKFZp586F1822) 2.9 

45 109313 AF153201 Hs.86276 C2H2 (KruppeMype) zinc finger protein 1.3 

109341 AA213506 Hs.1 15099 EST 2.9 

109391 AL096858 Hs.184245 KIAA0929 protein Msx2 interacting nudear target (MINT) homotag 1.5 

109420 H83603 Hs.40408 homeo box C9 2.2 

109426 N30531 Hs.42215 protein phosphatase 1 t regulatory sub-unit 6 3.0 

50 109429 AI160029 Hs.61438 ESTs * 1.9 

109445 AA232103 Hs.189915 ESTs 1.8 

109450 AB032969 Hs.173042 KIAA1143 protein 3.7 

109468 NM_015310Hs.6763 KIAA0942 protein 32 

109478 AW074143 Hs.87134 ESTs 2.0 

55 109570 L40027 Hs.1 18890 glycogen synthase kinase 3 alpha 2.1 

109662 F02614 Hs.27319 ESTs 1.4 

109825 R71264 Hs.16798 ESTs 1.3 

110039 H11938 Hs.21907 histone acetyltransferase 2.0 

110056 AA503041 Hs.279009 matrix Gla protein 2.5 

60 110085 AA603840 Hs.29956 KWA0460 protein 17 

110110 T07353 HsJ948 ESTs 2.9 

110129 R51853 Hs.226429 ESTs, Weakly similar to ALU1 HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 1.7 

110154 NMJH4521Hs,17667 SH3«Jomain binding protein 4 4.2 
110240 AI668594 Hs.176588 ESTs, Weakfy similar to CP4Y_HUMAN CYTOCHROME P450 4A1 1 PRECURSOR [Hsapiens] 4.2 

65 110242 N41744 Hs.19978 CGI-30 protein 13 

110259 H28428 Hs.32406 ESTs, Weakly simBarto I38022 hypothetical protein pteaplensl 22 

110312 BE256986 Hs.11696 hypofoeflcal protein FLJ12089 21 
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110330 A1286666 Hs.16621 OKFZP4341 1 1 6 protein 6.2 

110501 K55748 gb.7q94a01.s1 Soares fetal liver spleen INFLSHomo sapiens cONAdon8lMAGE2034003' 6.1 

110504 H55915 Hs^10859 hypothetic protein FU11016 6.1 

110525 H57330 Hs.37430 EST 6.3 

5 110568 AK00116O Hs.5999 hypothetical protein FU 10298 1.3 

110699 T97586 Hs. 18090 ESTs 1.8 

110705 AB007902 Hs.32168 K1AA0442 protein 1.6 

110742 AW190338 Hs.28029 hypothetical protein MGC1 1256 7.6 

110761 AL138077 Hs.16157 hypothetical protein FU12707 2.5 

10 110762 BE044245 Hs.30011 hypothetical protein MGC2963 9.3 

110765 AK000322 Hs.18457 hypothetical protein FU2031 5 5.5 

110769 BE000831 Hs.23837 Homo sapiens cDNAFUl 1812 us, done HEMBA10O6364 2.1 

110799 A1089660 Hs.323401 dpy^OMike protein 15 

110805 T25829 Hs.24048 FK506 binding protein precursor 6.6 

15 110813 AA767373 Hs.35669 ESTs, Moderately similar to ALU1JWMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 5.7 

110820 R33261 Hs.6614 ESTs, WeaHy similar to M3932nw^ 3.4 

110840 N31598 Hs.12727 hypothetical protein FU21 610 1.7 

110844 AI740792 Hs.167531 methylootoroyl-Coenzyme A carboxylase 2 (beta) 1.7 

110854 BE612992 Hs.27931 hypothetical protein FU10607, similar to glucosamine-phosphate N-acetyltransferase - 4.7 

20 110856 AA992380 gb:ot37g06.s1 Soares Jestis_NHT Homo sapiens cDNA clone 3' similar to contains element 2.3 

110885 BE384447 Hs.16034 hypothetical protein MGC1 31 86 3.5 

110897 AL117430 Hs.6880 OKFZP43401 56 protein 2.2 

110915 BE092285 Hs.29724 hypothetical protein FU1 3187 2.6 ' 

110918 H04360 Hs.24283 ESTs, Moderately similar to reduced expression in cancer [Rsapiens] 1.9 

25 110958 NNL005864HS.24587 signal transduction protein (SH3 containing) 6.7 

110963 AK002180 Hs.11449 DKFZP56401 23 protein 2.0 

110981 AK00198O Hs.24284 ADP-ribosyltransferase (NAD+; poly(ADP-ribose) poIymeraseHike 2 1.3 
110984 AW613287 Hs.80120 UDP^^tyWpha-D^alactearrte 1.8 

111125 N63823 Hs.269115 ESTs, Moderately similar to Z195_HUMAN ZINC FINGER PROTEIN 195 [H.sapiens] 3.6 

30 111132 AB037807 Hs.83293 hypottietical protein Z1 

111164 N46180 Hs.122489 Homo sapiens cONA FU13289 frs, clone OVARC1001170 23 

111172 R67419 Hs.21851 Homo sapiens cONA FU12900 fls, clone NT2RP2004321 3.7 

111174 AL050166 Hs.26295 Homo sapiens rnRNA; cONA DKFZp586D1 122 (from done DKFZp586D1 122) 7.5 

111179 AK000136 Hs.10760 asporin {LRR class 1) 7.1 

35 111184 AI815486 Hs.243901 Homo sapiens cDNA RJ20738 frs, clone HEP08257 6.7 

111184 A1815486 Hs.243901 Homo sapiens cONA FU20738 (is, clone HE 3.3 

111189 N67603 Hs.272130 ESTs, Weakly similar to S65824 reverse transcriptase homotog (Rsapiensl 3.6 

111216 AW139408 Hs.152940 ESTs 1.5 

111221 AB037782 Hs.15119 KIAA1361 protein 26 

40 111223 AA852773 Hs.334838 KIAA1 866 protein . 4.6 

111239 N90956 Hs.17230 hypothetical protein FU22087 7.9 

111285 AA778711 Hs.4310 eukaryotic translation initiation factorlA 6.9 

111299 A8033091 Hs.74313 KIAA1 265 protein 5.0 

111312 A1523913 Hs.34504 ESTs 3.8 

45 111318 T99755 Hs.334728 ESTs 1.2 

1 11337 AA837396 Hs.263925 USI^teracting protein NUDE1, rat homoiog 5.1 . 

111352 H58589 Hs.35156 Homo sapiens cONAFUl 1027 fls, done PLACE10041 14 2.2 

111370 A1478658 Hs.94631 brefetdin A-inhtbited guanine nucleotide-exchange protein 1 2.8 

111384 N94608 Hs.288969 HSCARG protein 2.2 

50 111389 AK0O0987 Hs.169111 oxldafion resistance 1 21 

1 1 1391 NM.003896HS.225939 siatyttransferase 9 (CMP^euAcilactosyteramide alpha-2.3-siar/ltransferase; GM3 synthase) 5.1 

111392 W46342 Hs.325081 Homo sapiens, done IMAGE3659680, rnRNA, partial cds 8.4 
111452 R02354 Ks. 15999 ESTs 27 
111486 A1051194 Hs.227978 EST 6.5 

55 111549 W90638 Hs.20321 ESTs. Moderately similar to ZRF1 HUMAN ZUOTIN RELATED FACTOR-1 (M-PHASE 1.4 

111585 R10720 Hs.20670 EST 1.6 

111627 R52656 Hs.21691 ESTs 1.6 

111870 AB037834 Hs.18685 Homo sapiens rnRNA for K1AA141 3 protein, partial cds 2.4 

111937 BE298665 Hs.14846 Homo sapiens rnRNA; cONA DKFZp564O016 (from done DKFZp564D016) 10.6 

60 111944 AW083791 Hs.21263 suppressor of potassium transport defect 3 6.6 

111987 NM.O15310Hs.6763 WAA0942 protein 5.1 

112134 R41823 Hs.7413 ESTs; caisyntBnln-2 2.8 

112244 AB029000 Hs.70823 K1AA1077 protein 14.6 

112388 R46071 Hs.301693 Homo sapiens, done IMAGE:3638994, rnRNA, partial cds 9.0 

65 112456 NMJ)16248Hs.232076 A Hiiase{PRKA) anchor protein 11 1.4 

1 12464 AW007287 Hs.28538 Homo sapiens cONA: FU21086 fis, done CAS03272 1.4 

112506 A1742756 Hs.26079 ESTs 32 
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112513 R68425 Hs.13809 
112752 AK001635 Hs.14838 
112884 AK000004 Hs.5013 
112923 T10258 Hs.5037 
112938 AW97082B Hs.6185 
112958 R61388 Hs.6724 



hypoftietica) protein FU10648 
hypothetical protein FU10773 
Homo sapiens mRNA for FU00004 protein, partial cds 
EST 

K1AA1557 protein 
ESTs 

112966 Z44718 Hs.102548 glucocorticoid receptor DNA binding factor 1 
112978 AK000272 Hs.7099 hypothetical protein RJ20265 

ESTs. Moderately similar to 2115357ATYW protein (Mmusculus] 
zinc finger protein 259 
ESTs 

Homo sapiens mRNA for KIAA1729 protein, partial cds 
Hs.270862 ESTs 

gb:yb51a03.s1 Stratagene fetal spleen (937205) Homo sapiens cONA done IMAGE.74668 3\ 



112995 AA737033 Hs.7155 

112996 BE276112 Hs.7165 
113047 AI571940 Hs.7549 
113049 AW965190 Hs.7560 
113089 T40707 
113196 T57317 



1 13248 T63857 gb:yc16e01.s1 Stratagene lung (937210) Homo'sapiens cONA clone 3*. mRNA sequence 

113254 AK002180 Hs.11449 DKFZP5640123 protein 

113277 AW971049 Hs.11774 protein (peptidyl-proryl ds/trans isomerase) NiMA-interacting, 4 (parvulin) 
113429 AA688021 Ha 179808 ESTs 
113499 AI467908 Hs.8882 ESTs 
113547 H59588 Hs.15233 ESTs 
113554 AW503990 Hs.142442 HP1-8P74 

113647 AA813887 Hs.188173 Homo sapiens cONA RJ12187 its, clone MAMMA1000831 

113702 T97307 gb:ye53h05.s1 Soares fetal liverspieen 1NFIS Homo sapiens cDNA done IMAGE:121497 3' 

113722 AV653556 Hs.184411 albumin 



2.0 
1.8 
6.6 
1.5 
3.2 
6.0 
6.4 
1.2 
5.6 
2.0 
1.9 
2.4 
1.3 
1.7 
2.8 
1.3 
3.2 
\2 
5.9 
2.0 
3.6 
U 
4.4 
1.3 



25 113759 AW499665 Hs.9456 SWi/SNF related, matrix associated, actin dependent regulator <jfchrornatin,subfamBy a, member 5 1.2 



113777 BE266947 Hs.10590 zinc finger protein 313 
113783 AL359588 Hs.7041 hypothetical protein DKF2p762B226 
113791 AI269096 Hs.135576 chitobiase, dW-acetyl- 
113808 W44735 Hs.9286 Homo sapiens cDNA: FU21278 fis, clone COL01 832 
113811 BE207480 Hs,6994 Homo sapiens cDNA: FU22044 fis, clone HEP09141 
113817 H13325 Hs.332795 hypoftetical protein DKFZp761017121 
113826 AW378212 Hs^4809 hypothetical protein FU10826 

EGF-containing fibuRn-Hke extracellular matrix protein 2 
proteasome (prosome, macropain) 26S subunit, non-ATPase, 11 
hypothetical protein, similar to (U06944) PRAJA1 
ESTs 

hypothetical protein FU22041 similar to FK506 binding proteins 
Hs.268828 ESTs 

Hs.120969 Homo sapiens cDNA RJ1 1562 fis, done HEMBA1003197 



113834 T26483 Hs.6059 

113868 W57902 Hs.90744 

113870 AL079314 Hs.16537 

113885 AW959486 Hs.21732 

113923 AW953484 Hs.3849 

113989 W87544 

114022 AI539519 



134 
1.7 

I. 3 
3.3 
3.1 
32 
2.3 

II. 3 
2.7 
6.1 
6.6 
1.9 
U 
5.4 
9.4 
1.8 
1.5 
1.8 



114030 A1825386 Hs.164478 hypofoetfcal protein FU21 939 similar to 5*zacytidine induced gene 2 
114060 AB029551 Hs.7910 RING1 and YY1 binding protein 
114196 AF017445 Hs.150926 fucose-1 -phosphate guanyr/ltransferase 
114226 AS028968 Hs.7989 KIAA1045 protein 

114253 BE149866 Hs.14831 Homo saplera, Simibr to zinc finger pro^ 2.3 

114262 AL117518 Hs.3686 KIAA0978 protein 14 

114275 AW515443 Hs.306117 WAA0306 protein 15.8 

114292 AI815395 Hs.184641 fatty add desaturase 2 19 

114309 AA332453 Hs.20824 CGI-85 protein 2.4 

114392 AA249590 Hs.100748 ESTs, Weakly similar to A28996 praline-rich protein M14 precursor - mouse |M.musculusJ 1.8 

114407 BE539976 Hs.103305 Homo sapiens mRNA; cDNA DKFZp434B0425 {from done DKF2p434B0425) U 

1 14455 H37908 Hs.271616 ESTs, Weakly similar to ALUSJiUMAN ALU SUBFAMILY SX SEQUENCE 5.5 

114463 AL120247 Hs.40109 KIAA0872 protein 52 

114464 A1091713 Hs.106597 Homo sapiens, Similar to RIKENcDNA 11 10012M11 gene, done IMAGE:3688605, mRNA, partial cds 1.2 
114471 AA028074 Hs.104613 RP42homolog 18 
114480 BE066778 Hs.151678 UDP4^tyl-aipha^ 13.4 
114671 AA766268 Hs.266273 hypothetical protein FU1 3346 19 
114698 AA476966 Hs.1 10857 polymerase (RNA) III (DNA directed) polypeptide K (123 kDa) 3^5 
114730 AI373544 Hs.331328 intermediate filament protein syncoittn 38 
114767 AI859865 Hs.1 54443 rnlnichrc^omerram^ 16 
114774 AV656017 Hs.184325 CGI-76 protein 31 
1 14798 AA159181 Hs.54900 serologically defined colon cancer antigen 1 3^5 

bromodomain and PHD finger containing, 3 43 

KIAA0887 protein 7,1 

hypothetical protein t3 
gb:zt29K)2.s1 Soares ovary tumor NbHOT Homo sapiens cDNA done IMAGE:723771 3\ mRNA sequence, 1 .5 

2.9 
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114860 AL157545 Hs.42179 

114895 AA236177 Hs.76591 

114896 BE539101 Hs.5324 
114911 AA236672 
114930 AA237022 Hs.188717 ESTs 
114938 AA242834 Hs.58384 ESTs 
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114965 A1733881 Hs.72472 BMP-R1B 2.3 

115023 AF102546 Hs.63931 dachshund (DrosophQa) homoiog 1.3 

115038 AA252360 Hs.87968 toMxe receptor 9 1.6 

115061 AI751438 Hs.41271 Homo sapiens mRNA full length insert cDNA done EUROIMAGE 1913076 113 

S 115062 AA253314 Hs.154103 UM protein (similar to rat protein Wna 1.5 

115117 AI670847 Hs.5324 hypothetical protein 1.5 

115121 A1634549 Hs.88155 ESTs 2.8 

115206 AW1 83695 Hs.186572 ESTs 2.5 

115221 AW365434 Hs.79741 hypothetical protein FU 101 16 1.5 

10 115239 BE251328 Hs.73291 hypoihetkai protein FU10881 1.3 

115242 A1368236 Hs.283732 ESTs, Moderately similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE 14 

115278 AK002163 Hs.301724 hypothetical protein FU1 1301 1.5 

115285 AW972872 Hs.293736 ESTs 2.4 

115291 BE545072 Hs.122579 hypothetical protein FU 10461 6.2 

15 115400 A1215069 Hs.89113 ESTs 6.6 

115468 AA314349 Hs.48499 tumor antigen SLP^p 7.4 

115471 AK001376 Hs.59346 hypothetical protein FU10514 1.4 

115479 AW301608 Hs.278188 ESTs, Moderately similar to 154374 gene NF2 protein [H^apiens] 4.0 

115496 AW247593 Hs.71819 eukaryotic translation Vitiation factor 4E binding protein 1 16.3 

20 115500 Y14443 Hs.88219 zinc finger protein 200 5.0 

115553 AJ275986 Hs.71414 transcription factor(SMF gene) 2.5 

115581 AI540842 Hs.61082 ESTs 6.1 

115587 BE081342 Hs.283037 HSPC039 protein 2.9 

115590 AA399477 Hs.67896 7-60 protein 5.3 

25 115646 N36110 Hs.305971 solute carrier family 2 (facilitated glucose transporter), memberlO 47 

115652 BE093589 Hs.38178 hypothetical protein FU23468 10.6 

115655 AL048269 Hs.288544 Homo sapiens, clone M6C: 16063, mRNA, complete cds 127 

115663 AI138785 Hs.40507 ESTs 2.0 

115676 AA953006 Hs.88143 ESTs 3.0 

30 115690 AA625132 Hs.44159 hypothetical protein FU2161 5 1.7 

115693 AF231023 Hs.55173 cadherin, EGF LAG seven-pass 6-type receptor 3, flamingo (Drosophila) homolog 6.8 

115715 BE395161 Hs.1390 proteasome (prosome, macropaln) subunft, beta type, 2 1.7 

115734 AI950339 Hs.40782 ESTs 2.6 

115811 NM_015434Hs.48604 DKFZP434B168 protein 2.1 

35 115823 AI732742 Hs.87440 ESTs 2.1 

115837 AI675217 Hs.42761 ESTs 1.3 

115844 A1373062 Hs.332938 hypothetical protein MGC5370 4.4 

115866 AW062629 Hs.52081 K1AA0867 protein 7.2 

115875 N55669 Hs.333823 mitochondrial ribosomal protein L13 1.2 

40 115941 AI867451 Hs.46679 hypothetical protein FU20739 5.5 

115968 AB037753 Hs.62767 KIAA1 332 protein 9.8 

116003 BE275469 Hs.66493 Down syndrome critical region gene 5 14 

116011 AL359053 Hs.57664 Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 2005735 2.4 

116108 AA770688 Hs.28777 H2A histone family, memberL .18 

45 116134 BE243834 Hs.50441 CGI-04 protein 14 

116189 N35719 Hs.44749 ESTs, Moderately similar to T00358 hypothetical protein K1AA0884 [H.sapiens] 12 

116195 AW821113 Hs.72402 ESTs 2.1 

116238 AV660717 Hs.47144 DKFZP586N0819 protein 17 

116246 AF265555 Hs.250646 baculoviral IAP repeatcontaWng 6 17 

50 116262 AI936442 Hs.59838 hypothetical protein FU1 0808 17 

116298 AI955411 Hs.94109 Homo sapiens cDNA FU13634 fe, clone PLACE1011133 19 

116318 AF097645 Hs.58570 deleted to cancer 1; RNA heUcase HOB/DICE1 4.9 

116325 AM72106 Hs.49303 HornosapieiiscONAFU11663Cs,doneHEMBA1004631 14 

116336 AL133033 Hs.4084 WAA1025 protein 19 

55 116339 AK000290 Hs.44033 dipeptidyl peptidase 8 15 

116350 AA497129 Hs.184771 nuclear factor l/C (CCAAT-binding transcripton factor) 19 

116358 AI149586 Ks.38125 toterterorvinduced protein 75. 52kO 19 

116365 N50174 Hs.46765 ESTs 6.1 

116368 N90466 Hs.71109 WAA1229 protein 16 

60 116417 AW499664 Hs.12484 Human done 23826 mRNA sequence 7.4 

116436 AA161411 Hs.58668 chromosome 21 open reading frame 57 2.1 

116462 AF218313 Hs.236828 putative heGcase RUVBL 13 

116470 AI272141 Hs,83484 SRY (sex deteimlning region Y)-box 4 2.1 

116470 AI272141 Hs.83484 SRY (sex determining region Y>box 4 12 

65 116575 AA312572 Hs.6241 pru>sphcmosIfide<Wdnase, regulatory subunit, pdypepflde 1 (p85 alpha) 15 

116637 AKD01043 Hs.92033 totegnn-finked Unase^asscclated serine/threonine phosphatase 2C 2.7 

116640 X89984 Hs.211563 B-cefl Cli/tymphoma 7A 23 
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116700 AI800202 Hs.317589 hypothetical protein MGC10765 1.4 

116705 AW074819 Hs.12313 hypothetical protein RJ14566 3.4 

116732 AW152225 Hs.165909 ESTs, Weakly sJmflarto 138022 hypothetical protein IRsapiens] 2.9 

116921 AW068115 Hs.821 biglycan 8.3 

5 116926 H73608 Hs.290830 ESTs 1.7 

117034 U72209 Hs.180324 YY1 -associated factor 2 3.4 

117132 AI393666 Hs.42315 p10-bindlng protein 5.2 
117247 N21032 gbryx46fl)6.s1 Scares melanocyte 2NbHM Homo sapiens cDNA clone IMAGE:264803 3\ mRNA sequence, 5.5 

117276 N711B3 Hs.121806 Homo sapiens cDNA RJ11971 fis, clone HEMBB 1001208 1J5 
1 0 1 17284 AK001701 Hs.183779 Homo sapiens cONA RJ10590 fis, clone NT2RP2004392, weakly similar to MNN4 PROTEIN 2.0 

117367 AI041793 Hs.42502 ESTs 2.0 

117368 AI878942 Hs.90336 ATPase.H* transporting, lysosomal (vacuolar proton pump),memberJ 2.1 
117382 AF150275 Hs.40173 ESTs 27 
117412 N32536 Hs.42645 solute carrier family 16 (monocarboxylic acid transporters), member 6 1.4 

15 117557 AF123050 Hs.44532 tfubiqultin 3.4 

117588 N34895 Hs.44648 ESTs 3.4 

117745 BE294925 Hs.46680 COM2 protein 3.0 

117754 AA121673 Hs.59757 zinc finger protein 281 1.9 

117879 N54706 Hs.303025 chromosome 11 open reading frame 24 1.8 

20 117881 AF161470 Hs.260622 butyrate^duced transcript 1 5.7 

117904 BE540675 Hs.332938 hypometical protein MGC5370 ' 5.9 

117911 AL137379 Hs.47125 hypotfieficaJ protein FU13912 1.7 

117933 Y10518 Hs.116470 hypometical protein FU20048 1.7 

117983 AL1 10246 Hs.47367 KIAA1785 protein 5.4 

25 118078 N54321 Hs.47790 EST 5.2 

118301 AA453902 Hs.293264 ESTs 2.6 

118429 AA243332 Hs.74649 cytochrome c oxidase subunK Vic 2.5 

118472 AL157545 Hs.42179 bromodomain and PHD finger containing, 3 4.1 

118488 AJ277275 Hs.50102 rapa-2 (rapa gene) 1.2 

30 118509 N22617 Hs.43228 Homo sapiens cDNA FU11835 lis, done HEMBA1006595 15 

118528 A1949952 Hs.49397 ESTs 7.4 

118656 A1458020 Hs.293287 ESTs Z5 

118670 AA332845 Hs.152618 ESTs, Moderately similar to ZN91_HUMAN ZINC FINGER PROTEIN 91 0isapiensl 1.2 

118698 AB033113 Hs.50187 KIAA1287 protein 2.1 

3 5 1 18737 AA199686 gb:zq75g09.ii Stratagene hNT neuron (937233) Homo sapiens cDNA done IMAGE:647488 5' 5.2 

118925 N92293 Hs.206832 ESTs, Moderately similar to ALU8_HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 1.4 
118984 AI668709 Hs.240722 ESTs, Moderately similar to AWSJttJMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 3.6 

118986 AF148713 Hs. 125830 bladder cancer overexpressed protein 4.8 

119206 W24781 Hs.293798 KIAA1710 protein 1.7 

40 119235 AW453069 Hs.3657 activity-dependent neuroprotective protein 2.2 

119235 AW453069 Hs.3657 activity-dependent neuroprotective prate 1.6 

119265 BE539706 Hs.285363 ESTs * 1.4 

119279 N57568 Hs.48028 EST 25.1 

119298 NM.001241HS.155478 cyciinT2 1.6 

45 119338 A1417240 Hs.320836 ESTs, Weakly similar to A47582 B-cefl growth factor precursor [Ksapiens] U 

119349 T65004 Hs.163561 ESTs 8.4 

119403 AU17554 Hs.119908 nucleolar protein NOP5/NOP58 6.7 

119478 A1624342 Hs.170042 ESTs 2.4 

119486 AI796730 Hs.55513 ESTs 2.1 

50 119513 W37933 Empirically selected from AFFX single probeset 1.9 

119601 AK000155 Hs.91684 Honiosapieris mRNA; cDNAOKFZp667l 103 (from clone OKFZp667l103) 3.7 

119602 AW675298 Hs.233694 hypothetical protein RJ1 1350 3.0 
119676 AA243837 Hs.57787 ESTs 1.4 
119682 W61019 Hs.57811 ESTs 1.2 

55 119774 AB032977 Hs.6298 KIAA1 151 protein 1.8 

119780 NM.016625HS.191381 hypothetical protein 3.1 

119789 BE393948 Hs,50915 kailikrein 5 (KLK5; KLK-L2; stratum comeum trypfic enzyme) 9.2 
119805 AJ223810 Hs.43213 ESTs, Weakly similar to IEFS HUMAN TRANSFORMATION-SENSITIVE PROTEIN IEF SSP 3.6 

119818 AA130970 Hs.58382 hypothetical proteinFUl 1101 2.5 

60 119863 AA081218 Hs.58608 Homo sapiens cDNA FU14206 fis. clone NT2RP3003157 2,7 

119905 AW449064 Hs.119571 collagen, type III. alpha 1 (Ehlers-Dantos syndrome type IV, autosomal dominant) 2.6 

119966 AA703129 Hs.58963 ESTs 2.7 

120132 W57554 Hs.125019 lymphoid nuclear protein (LAF-4) mRNA 1.2 

120206 H26735 Hs.91668 Homo sapiens clone PP1498 unknown mRNA 45.7 

65 120248 AI924294 Hs. 1 73259 uncharacterized bone marrow protein BM033 * 1,2 

120253 AA131376 Hs.326401 fibroblast growth factor 12B 38.9 

120269 AW131940 Hs.104030 ESTs 9,6 
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120274 AA177051 gb:nc02a0ls1 NCLCGAPJV3 Homo sapiens cDNA done IMAGE:194 similar to contains AJu 4.6 

120280 AA190577 gbzp52g02.s1 Stratagene HeLa cefl s3 937216 Homo sapiens cONA done 3\ mRNA sequence ZO 

120296 AW99591 1 Hs.299883' hypothetical protein FU23399 1 .8 

120297 AA191384 Hs.104072 ESTs, WeaWy similar to Z195_HUMAN ZINC FINGER PROTEIN 195 [H^aplens] 15.2 
5 120324 AA195517 Hs.191643 ESTs 5.5 

120325 AA195651 Hs.104106 ESTs 6.4 

120327 AK000292 Hs.278732 hypothetical protein FU20285 16.1 

120336 N85785 Hs.181165 eukaryoflc translation elongation factor 1 alpha 1 2.9 

120342 AW45Q669 Hs.45Q68 hypothetical protein DKFZp434l143 5.7 

10 120345 AA210722 Hs.104158 ESTs 4.5 

120349 AW969481 H&55189 hypothetical protein 16.6 

120352 R06859 Hs.193172 ESTs, Weakly slmBar to I38022 hypothetical protein (H.sapiens] 5.0 

120356 AF000545 Hs.296433 putative purinergic receptor 28.1 

120371 AA219305 Hs.104196 EST 12.4 

15 120382 AA228026 Hs.38774 ESTs 4.0 

120383 AL109963 Hs.123122 FSH primary response (LRPR1 , rat) homotog 1 9.7 

120386 . AW969665 Hs.154848 hypothetical protein OKFZp434O0127 32.6 

120388 AA232874 Hs.104245 ESTs 3.1 

120389 AW967985 Hs.325572 ESTs, Moderately similar to ALU7..HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 21 .7 
20 120396 AA134006 Hs.79306 eukaryolic translation tnifiation factor 4E 12.5 

120404 AB023230 Hs.96427 KIAA1013 protein 72 

120418 AW966893 Hs.26613 Homo sapiens mRNA; cDNA DKFZp586F1323 (from clone DKFZp586F1323) 11.4 

120423 AA236453 Hs.16978 Homo sapiens cDNA: FU22822 fis, done KA1A3968 1.9 

120472 AI950087 gb.-wq05c02.x1 NCLCGAPJ<Id12 Homo sapiens cDNA done 3', mRNA sequence 19.4 

25 120473 AA251973 Hs.269988 ESTs 5.4 

120484 AA253170 Hs.96473 EST 10.4 

120504 AA256837 gbzr84d10.s1 SoaresJihHMPu_.S1 Homo sapiens cDNA done IMAGE.-682387 3*. mRNA sequence. 3.9 

. 120509 BE047718 Hs.96545 ESTs 9.4 

120520 AA258601 Hs.161731 EST . 2.4 

30 120535 BE350244 Hs.96547 ESTs 2.5 

120551 AA279160 Hs.111407 Homo sapiens, clone IMAGE3613029. mRNA, partial cds 5.2 

120570 AA280679 Hs^71445 ESTs, Weakly similar to ALU1.HUWAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 14.4 

120582 BE244830 Hs.284228 ZNF135-fike protein 10.2 

120590 AW372799 Hs.125790 leudne-rich repeatcontaining 2 2.1 

35 120596 AA282074 Hs.237323.N-acetylglucosamlne-phosphatemutase 7.5 

' 120619 AW965339 Hs.111471 ESTs 2.5 

120624 AW407987 Hs.173518 M-phase phosphoprotein homoiog 52.0 

120639 AA286942 gb:zs56f05.s1 NCLCGAPJ3CB1 Homo sapiens cONA clone IMAGE:701505 3' similar to contains Alu2.4 

120648 AA287095 Hs. 140309 Homo sapiens, done 1MAGE13677194, mRNA, partial cds 5.0 

40 120653 AW063659 Hs.191649 ESTs 22 

120668 AW969638 Hs.112318 6.2 kd protein 2.2 

120669 BE536739 Hs.109909 ESTs 1.9 
120695 AA976503 gb:oq30aQ4.s1 NCLCGAP_GC4 Homo sapiens cDNA done 3* similar to contains PTR7.H PTR7 46.8 
120695 AI821539 Hs.97249 ESTs 2.5 

45 120713 AW449855 Hs.96557 Homo sapiens cDNA FU 12727 fis, done NT2RP200Q027 5.9 

120718 AA292747 Hs.97296 ESTs 2.9 

120750 AH91410 Hs.96693 ESTs, Moderately similar to 2109260A B ceU growth factor [H.sapiens] 7.0 

120774 AI608909 Hs.193985 ESTs 7.8 

120807 AA346385 Hs.30002 SH3-containing protein SH3GLB2; WAA1848 protein 6.8 

5 0 120809 AA346495 gb:EST52657 Fetal heart II Homo sapiens cONA 3* end similar to EST containing O family repeat, 4.4 

120938 AA386260 Hs.104632 EST 4.4 

120977 AA398155 Hs,97600 ESTs 4.4 

120984 BE262951 Hs.99052 ESTs 5.5 

120985 AI219896 Hs.97592 ESTs 1.2 
55 121011 AA398360 Hs.97608 EST 3.1 

121026 AM39713 Hs.165295 ESTs 3.5 

121081 AA398721 Hs.186749 ESTs, Highly similar to 137550 mismatch repair protein MSH2 [H^aptens] 5.4 

121133 AA363307 Hs.97032 ESTs 3.7 

121176 AL121523 Hs.97774 ESTs 17 

60 121223 AI002110 Hs.97169 ESTs, Weakly similar to (U667H1 2.2.1 [Helens] 2.9 

121320 AA403008 Hs.301927 c6.1A 1.9 

121340 AW956981 Hs.97910 Homo sapiens cDNA FU13383 fis. done PLACE1001024 3.5 

121408 AA406137 Hs.98019 EST 6.0 

121439 AA410190 Hs.98076 ESTs, Weakly similar to A47582 B-cell growfo factor precursor (H.sapiens] 7.4 

65 121450 AA406430 Hs.105362 Homo sapiens, clone MGC:18257, mRNA, complete cds 6.9 

121452 AW971063 Hs.292882 ESTs 1.8 

121455 H58306 Hs.15165 retinoic add induced 14 10.5 
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121457 W07404 Hs.144502 hypothetical protein FU22055 3.4 
121496 AA442224 Hs.97900 ESTs 14.4 
121505 M494172 Hs.194417 ESTs 13.1 
121508 AA402515 Hs.97887 ESTs 28.0 
5 121513 AA416653 Hs.181510 ESTs 6.2 
121514 M412112 gb:zt69b02.s1 SoaresJesJisJIHTHomo sapiens cDNA clone IMAGE727563 3\ mRNA sequence. 2.6 

121549 AA412477 Hs.98142 EST 7,4 
121558 M412497 gb:zt95g1£s1 SoaresJesti$_NHT Homo sapiens cDNA clone IMAGE:73Q150 3' similar to contains2.8 

121577 AA411970 Hs.98096 EST 3.5 

10 121581 AA416568 gb:zu05c10.s1 SoaresJestisJJHT Homo sapiens cDNA done 3>RNA sequence 6.1 

121589 AO001528 Hs.89718 spermine synthase 39 

121594 AA626010 Hs.98247 ESTs 22 

121622 M416931 Hs.126065 ESTs 42 

121655 AA421537 Hs.178072 Homo sapiens mRNA; cDNA DKFZp434B1023 (from done DKFZp434B1023) 7.8 

15 121682 AA418160 Hs.86043 Homo sapiens cOMA FU13558 lis, done PLACE 1007743 20 

121690 AV660305 Hs.110286 ESTs 47 
121706 U55184 Hs.154145 hypothetical protein FU11585 • 12.7 

121714 AA419225 Hs.98269 Homo sapiens cDNA FU1 1953 fc, done HEMB81000883 8.1 

121729 AI949597 Hs.98325 ESTs 1.8 

20 121731 AA421041 Hs.180744 ESTs 4.0 

121744 AA398784 Hs.97514 ESTs 7.1 

121748 BE536911 Hs.234545 hypothetical protein NUF2R 195 

121773 AB033022 Hs.158654 KIAA1 196 protein 7.9 

121775 AA421773 Hs.161008 ESTs 1 7 

25 121776 AA292579 Hs.125133 hypothetical protein FU22501 6.6 

121786 AI810774 Hs.98376 ESTs 10.5 

121832 AW340797 Hs.98434 ESTs 5 8 

121836 AA328348 Hs.218289 ESTs 3 8 

^ 121839 AA425691 Hs.191606 ESTs, Highly similar to K1AA1 048 protein [H.sapiens] 5.0 

30 121842 AF027406 Hs.104865 serine/threonine kinase 23 2.7 

121847 AA446628 Hs.2799 cartilage linking protein 1 2.3 

121871 AW972668 Hs.293044 ESTs 2.9 

121882 AA426376 Hs.98459 ESTs 5.0 
121911 AA427950 gb:zw50f02.s1 Soare$JotatfetusJto2HF8_9w Homo sapiens cDNA done IMAGE:773499 3' 72 

35 121915 AA428179 Hs.223405 ESTs, Moderately similar to A46010 X-linked retinopathy protein [H.sapiens] 2.5 

121935 AA428647 Hs.98611 EST 2.3 

121983 AA298760 Hs.180191 hypothetical protein FLJ14904 3.4 

121985 AI862570 Hs.299214 Homo sapiens, clone IMAGE:2822295, mRNA, partial cds 11.4 

121995 AA210863 Hs.3532 nemc-like kinase 3.8 

40 121999 AA430211 Hs.98668 EST 64 

122009 AW292763 Hs.160822 Homo sapiens cDNA: FU20863 lis, done ADKA01804 2.2 

122013 AA431085 Hs.98706 ESTs 6.5 
122036 W92142 Hs.271963 ESTs. Weakly similar to ALU5_HUMAN ALU SUBFAMILY SC SEQUENCE CONTAMINATION 13.1 

122050 AI453076 Hs.166109 ELAV (embryonic lethal abnormal vision, DrosophilaHike 2 9.1 

45 122060 AA431738 Hs.98750 EST 13.1 

122114 AW161023 Hs.104921 ESTs 1.5 

122188 AA398838 gb:2t80d01 .rl SoaresjBStisJJHT Homo sapiens cDNA done 5\ mRNA sequence 3.3 

122204 AA435936 Hs.98842 EST 56 

122246 AA329550 Hs.29417 HCF-bindbig transcripfen factor Zhangfei 5J 

50 122257 AA436819 Hs.98899 ESTs 5.6 

122302 AA441801 Hs.104947 ESTs 5.8 

122341 AW601969 Hs.99010 hypoftetical protein FU22263 similar to nutfearreceptorWingSET-iiornain protein 1 2.0 

122356 AA443794 Hs.98390 ESTs 73 

122369 AA443985 Hs.303222 ESTs 122 

55 122371 AA868555 Hs.178222 ESTs 50 

122372 AA446008 Hs.336677 EST 76 

122378 AB032948 Hs^1356 hypothetical protein DKF2p762K2015 25 

122405 AA446572 Hs.303223 EST 2.8 

122412 AA446869 Hs.119316 ESTs 7.3 

60 122415 AA446918 Hs.99088 EST 1.9 

122418 AA446966 Hs.99090 ESTs, Moderately similar to similar to KIAA0766 {Rsapiens] 6.8 

122440 AW505139 Hs.9460 Homo sapiens mRNA; cDNA DKFZp547C244 (from done DKFZp547C244) 2.6 

122446 AA447603 Hs,99123 EST 1.8 

122448 AA447626 Hs.99127 EST 35 

65 122458 AI266159 Hs.104980 ESTs 1.5 

122460 AW418788 Hs.99148 ESTs. Weakly similar to S43569 R01H10.6 protein - Caenorhabditis elegans IC.elegans] 9.7 

122464 AA448158 Hs.99152 EST 4.8 
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122490 AA448349 Hs.238151 EST 6.1 

122492 AA448417 Hs.104990 ESTs 5.4 

122502 AA204969 Hs.234863 Homo sapiens cDNA FLJ12082 Cs, clone HEMBB1Q02492 1.3 

122510 AA449232 Hs.99195 ESTs 11.2 

5 122530 AW959741 Hs.40368 adaptor-related protein complex 1, sigma 2 subunit 10.1 

122547 AA779725 Hs.164589 ESTs ' 2.5 * 

122555 AA194055 Hs.293858 ESTs 1.9 

122570 AA452578 Hs.262907 ESTs 9.5 

122572 AA452601 Hs.99287 EST 11.0 

10 122586 AK001910 Hs.99303 Homo sapiens cDNA FU 11048 fis, clone PLACE 1004516 3.4 

122587 AB040893 Hs.6968 K1AA1460 protein 2.0 

122598 AI028173 Hs.99329 ESTs 1.7 

122599 AL355841 Hs.99330 hypothetical protein RJ23588 4.4 
122602 AA411925 HsJH)1960 ESTs 4.6 

15 122607 AA453518 Hs.98023 ESTs 615 

122614 AA453630 Hs.99339 EST 10.7 

122616 AA453638 Hs.161873 ESTs 107.3 

122617 A1681535 Hs.148135 serine/threonine kinase 33 121.4 

122618 AA453641 gb:zx48e06.s1 SoaresJestisJIHT Homo sapiens cDNA done 3\ mRNAsequence 31.1 
20 122622 AA453987 Hs.144802 ESTs 5.6 

122717 AA456859 Hs.178358 ESTs 8.5 

122762 AI376875 Hs.105119 ESTs 10.4 

122829 AW204530 Hs.99500 ESTs 81.8 

122834 AA461492 Hs.99545 Homo sapiens cDNA FU10658 fis, done NT2RP2006052 3.6 

25 122836 AA460581 Hs^9Q996 ESTs 4.5 

122837 AA461509 Hs.293565 ESTs, Weakly similar to putative p150 (H.saplens] 2.7 

122838 AA460584 Hs.334386 ESTs 75.3 
122854 AA600235 Hs.9625 NIMA (never in mitosis gene aj-related kinase 6 7.7 
122856 AI929374 Hs.75367 Src-fike-adapter 5.8 

30 122861 AA335721 Hs.1 19394 ESTs 1-3 

122866 BE539656 Hs.283705 ESTs 4.1 

122868 AF0O5216 Hs.1 15541 Janus kinase 2 (a protein tyrosine kinase) 5.3 

122870 AW576312 Hs.318722 Homo sapiens cDNA: FU21766 fis, clone COLF7179 9.9 

122872 AW081394 Hs.97103 ESTs 5.3 

35 122879 AA769410 Hs.128654 ESTs 13.9 

122907 AA470074 Hs.169896 ESTs 11-5 

122916 AA470140 Hs.229170 EST 1-7 

122981 AA478951 Hs.105629 ESTs 5.0 

123013 AW968324 Hs.17384 ESTs 15.4 

40 123016 AW338067 Hs.323231 Homo sapiens cDNA FU11946 fis, clone HEMBB1000709 2.8 

123034 AL359571 Hs.44054 ninein (GSK3B interacting protein) 8.7 

123072 A1382500 Hs.104308 ESTs, Weakly similar to KIAA1395 protein (H^apiens] 8.8 

123082 AA485360 Hs.105661 ESTs 3.9 

123088 AI343652 Hs.105667 ESTs 3.8 

45 123110 AA486256 Hs.193510 EST 7.4 

123114 BE304942 Hs.265848 myomegaEn 2.8 

123131 T52027 Hs.271795 ESTs, Weakly similar to 138022 hypothetical protein [H.saplens] 2.4 

123132 AI061582 Hs.324179 Homo sapiens cDNA FU12371 fis, done MAMMA1002434 15.6 
123136 AW451999 Hs.194024 ESTs 5.1 

50 123149 AI734179 Hs.105676 ESTs 23.8 

123152 AW601773 Hs.270259 ESTs 5.2 
123258 AA490929 Hs.105274 ESTs. Weakly similar to RMS1 JiUMAN REGULATOR OF MITOTIC SPINDLE ASSEMBLY 1 [H.sapiens] 9.3 
123315 AA496369 gb:zv37d10.s1 Soares ovary tumor NbHOT Homo sapiens cDNA done !MAGE:755827 3* similar to4.1 

123369 AA504767 Hs.105738 ESTs 6.9 

55 123394 AA731404 Hs.105510 ESTs 3.6 

123433 AW450922 Hs.112478 ESTs 3.7 

123466 AA599042 Hs.112503 EST 7.4 

123470 AW303285 Hs.303632 Human DNA sequence from done RP1 1-1 10H4 on chromosome 5 Contains a pseudogene similar to 3.5 

123471 AB021644 Hs.197219 zinc finger protein 14 (KOX 6) 5.2 
60 123475 BE439553 Hs.250528 Homo sapiens, done 1MAGE:4098694, mRNA, partial cds 1.7 

123482 N95059 Hs.55098 ESTs 1.6 

123486 BE019072 Hs.334802 Homo sapiens cDNA RJ14680 fis, clone NT2RP2004242, weakly similar to 2.4 

123508 AW380388 Hs.155546 KIAA1080 protein; GofeKassodated. gamma-adaptin ear containing, ARF-b'mding protein 2 2.2 

123615 AA609170 gbaf12a12.s1 Soares testis JtHT Homo sapiens cDNA clone J t mRNA sequence 7.8 

65 123619 AA602964 gb:no97c02.s1 NCLCGAP _Pr2 Homo sapiens cDNA done, mRNA sequence 2.8 

123658 AA609364 gbaffldDtel Scares testis JJKT Homo sapiens cONA done IMAGE:743441 3* similar to contains Alu. 1.7 

123674 A1269609 Hs.105187 kinesin protein 9 gene 5.7 
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123735 NM_013241Hs.95231 FH1/FH2 domain-containing protein 
123738 AA609891 Hs.1 \Zm EST 
123753 AA609955 Hs.234961 Huntingtin interacting protein E 

123804 AA620464 Hs.261915 EST, Weakly similar to S65657 alpha-1C-adrenergic receptor splice form 2 (Rsapiens] 
5 123811 AA6205B6 gb:ae60g05.s1 Stratagene lung carcinoma 937218 Homo sapiens cDNA clone iMAGE:951320 3' 

123951 AB012922 Hs.173043 metastasis-associated 1-Hke 1 

123983 AJ272267 Hs.146178 choline dehydrogenase 

124001 L42542 Hs.75447 ralA binding protein 1 

124006 AI147155 Hs.270016 ESTs 
10 124070 AI950314 Hs.1 54762 HIV-1 rev binding protein 2 

124074 H05635 Hs.294030 topolsonrerase-related function protein 4-2 

124178 BE463721 Hs.97101 putative G protein-coupled receptor 

124203 AA372796 Hs.269339 ESTs, Weakly similar to AF161356 1 HSPC093 (H-sapiens] 

124352 AA640891 Hs.102406 ESTs 
15 124375 D87454 Hs.192966 KIAA0265 protein 

124385 AI267847 gbaq49a10.x1 Stanley Frontal NB pool 2 Homo sapiens cDNA done simitar to contains 

124390 AA317338 Hs.7535 COBW-fike protein 

124391 AF155099 Hs.279780 NY-REN-18 antigen 

124417 N34059 gb:yv28h09.s1 Soares fey fiver spleen 1NFLS Homo sapiens cONA clone 3' similar to contains Alu 3.3 

20 124428 H13540 Hs.82202 ribosomal proteh L17 2.9 
124440 AA532519 Hs,129043 Human DNAsequence from done 989H11 on chromosome 22q1 3. 1-13.2. Contains part of a 
124466 R10084 Hs.1 13319 Wnesin heavy chain member 2 

124482 N53935 gb:yv59dQ9.s1 Soares fetal Bverspleen 1NRS Homo sapiens cDNA done 3'. mRNA sequence 

124498 H79433 Hs^68997 ESTs 
25 124515 AA669097 Hs.109370 ESTs 

124608 N71076 Hs.102800 ESTs, Weakly similar to neuronal thread protein AD7c*!TP (H^apiensl 
124631 NM_014053Hs.270594 RVCR protein 

124634 AI765123 Hs.143671 Homo sapiens cDNA FU13533 fe, done PLACE1006371 
124637 AA160474 Hs.75798 hypoftefical proiefo 
30 124642 AW968856 Hs.278569 sorting nexin 17 
124649 N92593 Hs.313054 ESTs 
124656 AW297702 Hs.102915 ESTs 
124661 R48170 Hs.78436 EphB1 

124683 AA381661 Hs.119878 ESTs, Weakly similar to M3K9_HUMAN MITOGEN-ACT1VATED PROTEIN KINASE KINASE 
35 124712 R09166 Hs.191148 ESTs 
124735 R22952 Hs.268685 ESTs 

124761 AA374756 Hs.93560 Homo sapiens mRNA for K1AA1771 protein, partial cds 
124768 AW368528 Hs.100855 ESTs 
124775 R41772 Hs.100878 ESTs 
40 124777 R41933 Hs.140237 ESTs, Weakly similar to ALU1JHUMAN ALU SUBFAMILY J SEQUENCE 
124788 R43543 Hs.100912 Homo sapiens cDNA: FU22726 fis, done HSI15005 
124809 AL355722 Hs.106875 Homo sapiens EST from done 35214, full insert 

124811 R46068 Hs.288912 hypothetical protein FU22604 

124812 R47948 Hs.188732 ESTs 
45 124822 AA418160 Hs.86043 Homo sapiens cONAFU13558fc. done PLACE1007743 

124825 AA501669 Hs.336693 ESTs 
124833 AW975868 Hs.294100 ESTs 
124857 R63652 Hs.137190 ESTs 
124860 R65763 Hs.101477 EST 
50 124863 AI382555 Hs.127950 bromodofnain-containing 1 

124876 AF135422 Hs.27059 GDP-mannose pyrophosphorylase A 
124878 BE397530 Hs.288057 hypothetical protein FLJ22242 

124902 H37941 Hs.101883 ESTs 

124903 AW296713 Hs.221441 ESTs 
55 124930 AI076343 Hs.173939 ESTs, Weakly similar to ALUB_HUMAN Hit ALU CLASS B WARNING ENTRY ill [Ksapiens] 

124942 R99978 Hs.268892 ESTs, Moderately similar to B34087 hypothetical protein (H^apiens] 
124958 A1078645 Hs.431 rrwrineteukemia\^l(bmi-1)oncogerwhornolog 
124^80 T40841 Hs 98681 ESTs 

125002 T59338 Hs!269463 ESTs. Weakly slmOarto ALU1.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 
60 125047 T79815 Hs.279793 ESTs 
125051 T79956 Hs.100588 EST 
125056 T81310 Hs.100592 ESTs 
125101 AM72068 Hs.286236 K1AA1 856 protein 

125113 T96595 Hs.302270 ESTs, Weakly similarto ALUFJHUMAN Hi! ALU CLASS F WARNING ENTRY Ul [Rsapiens] 
65 125115 T97341 gb:ye57e05.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done IMAGE: 121 856 3' similarto 

125125 A1222382 Hs.240767 Human DNAsequence from done RP1-12G14cflcrtfomos()me6q24.1-25Z Contains the 5 1 end of the gene 
125147 W38150 Empirically selected from AFFX single probeset 1.7 
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125161 W44657 Hs.144232 EST 10.7 
125249 AA630863 Hs.131375 ESTs, Moderately similar to ALUBJJUMAN Ml ALU CLASS B WARNING ENTRY ll! [H^apiens] 1.3 

125255 AF098162 Hs.1 18631 fimeless (Drosophila) homotog 9.4 

125279 AW4018Q9 Hs.4779 WAA1150 protein 1.5 

5 125280 AI123705 Hs.106932 ESTs 8.0 

125298 AW972542 Hs.28900B Homo sapiens cONA: FU21814 fis. clone HEP01068 1.5 

125660 AW292171 Hs23978 scaffold attachment factor B 5.9 

125827 NMJM3403HS.97496 YY1 transcription factor U 

125891 U29589 Hs.7138 cholinergic receptor, muscarinic 3 6.4 

10 126005 AW409701 Hs.1578 baculovira) IAP repeat-containing 5 (survwin) 143 

126202 AA157632 Hs.272630 vacuolar proton pump delta polypeptide 2.4 

126695 AA643322 Hs.172028 a disintegrin and metaJloproteinase domain 10 9.1 

127050 AW411066 Hs.274351 CGW9 protein 17.0 

127274 AW966158 Hs.58582 Homo sapiens cONA FU12789 fis, done NT2RP2001947 12,8 

15 128355 AW293012 Hs.161623 ESTs 7.3 

128493 D87466 Hs.240112 KIAA0276 protein 3.1 

128493 D87466 Hs.240112 KIAA0276 protein 1.3 

126522 BE173977 Hs.10098 putative nucleolar RNA hefcase 9.4 

128527 AA504583 Hs.101047 transcription factor 3 (E2Aln^urK^lobulinenr^cerbindirg factors E12/E47) 1.5 

20 128528 R39234 Hs.251699 ESTs, Weakly similar to IDN4-GGTR14 (ftsapiens] 2.8 

128595 U31875 Hs.272499 short-chain alcohol dehydrogenase family member 12.1 

128599 NM_015366Hs.102336 Rho GTPase activating protein 8 2.3 

128604 AI879099 Hs.102397 GIOT^forgoriatofropinirK^ 1.3 

128608 BE267994 Hs.102419 zinc finger protein 7.1 

25 128625 AB037841 Hs.102652 hypothetical protein ASH1 1.3 

128629 AL096748 Hs.102708 DKFZP434A043 protein 3.2 

128639 AW582962 Hs.102897 CGW7 protein 2.0 

128656 AA458542 Hs.10326 coatomer protein complex, subunit epsilon 1.4 

128656 M458542 Hs.10326 coatomer protein complex, subunit epsilo 1.3 

30 128658 6E397354 Hs.324830 dfptheria toxin resistance protein required for dphthamide biosynthesis (Saccharomyces)-like 2 24 
128670 AA975486 Hs.103441 Homo sapiens, Similar to RIKEN cONA 1700010L19 gene, done MGC:16214, mRNA, complete cds 7.1 

128691 W27939. Hs.103834 hypothetical protein MGC5576 7.7 

128698 BE081143 Hs.225977 nuclear receptor coadivator 3 3.8 

128700 Y15221 Hs.103982 small inducible cytokine subfamily B (Cys-X-Cys), member 1 1 1.6 

35 128714 T85231 Hs.1 79661 tubulin, beta 5 7.6 

128717 AKD01564 Hs.104222 hypoflietical protein FU10702 5.5 

128733 BE147740 Hs.104558 ESTs, Moderately similar to I38022 hypolheflcal protein (H-sapiens) 2.7 

128737 AF292100 Hs.104613 RP42homolog 2.8 

128742 AA307211 Hs.251531 proteasome (prosome, macropain) subunit alpha type, 4 4.4 

40 128746 A1470163 Hs.323342 acfin related protein 2/3 complex, subunit 4 (20 kD) 22 

128747 AB027249 Hs.104741 PDZ-oindlng kinase; T-cell originated protein kinase 2.8 

128772 BE302796 Hs.105097 thymidine kinase 1, soluble 5.3 

128781 N71826 Hs.105465 small nuclear ribonudeoprotein polypeptide F 53.9 

128797 NM_002975Hs.105927 stem cell growth factor; lymphocyte secreted C-type lectin 13.3 

45 128806 AW630942 Hs.106061 RD RNA-blnding protein . 2.6 

128814 AW248431 Hs.256526 nuclear prelamln A recognition factor 2.2 

128830 BE281170 Hs.106357 vatosirwxmtaining protein 5.9 

128835 AKD01731 Hs.106390 Homo sapiens mRNA; cONA OKFZp586H0924 (from clone DKFZp586H0924) 1.6 

128854 BE159181 Hs.168232 hypothetical protein FU1 3855 2.2 

50 128854 BE159181 Hs.168232 hypothetical protein FLJ1 3855 1.9 

128868 AA419008 Hs. 106730 chromosome 22 open reading frame 3 3.0 

128868 AA419008 Hs.106730 chromosome 22 open reading frame 3 22 

128871 AF189723 Hs.1 06778 ATPase l Ca++transpor8ng, type 2C, member 1 1.5 

128891 P34856 Hs.292457 Homo sapiens, done MGC:16362, mRNA, complete cds 13.3 

55 128906 R57988 Hs.10706 epithelial protein lost In neoplasm beta 4.7 

128920 AA622037 Hs. 166468 programmed ce)J death 5 1.4 

128925 R67419 Hs.21851 Homo sapiens cONA FU12900 lis, clone MT2RP2004321 1.9 

128946 Y13153 Hs.107318 kynurenine 3-rnonooxygenase (kynurenlne 3-hydroxylase) 7.2 

128949 AA009647 Hs.8850 a dislntegrin and metaHoproteinase domain 12 (meltrin alpha) (ADAM-12) 2.4 

60 128958 AW139032 Hs.107376 hypothetical protein DKFZp434N035 1.3 

128959 AI580127 Hs.107381 hypothetical protein FU112Q0 10.9 

128965 AW150697 Hs.107418 ESTs 1.4 

128970 AI375672 Hs.165028 ESTs 1.3 

128975 BE560779 Hs.284233 NJCE^S protein 14.0 

65 128979 AW271217 Hs.281434 Homo sapiens cONA FU14028 fis. done HEMBA1003838 1.6 

128995 A1816224 Hs.107747 DKF2P566C243 protein 1.9 

129019 AI950087 gb:wqO5c02jc1 NCLCGAP_Kid12 Homo sapiens cONA clone 3', mRNA sequence 25 
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129021 AL044675 Hs.173081 K(AA0530 protein 3.8 

129021 AL044675 Hs.173081 KIAA0530 protein 2.5 

129032 R80088 Hs.108104 ubiquitin-conjugating enzyme E2L 3 3,4 

129076 AW296806 Hs.326234 ESTs, Highly similar to T46422 hypothetical protein DKFZp434M2023.1 [H.sapiens] 5.0 

5 129078 AI351010 Hs.102267 lysosomal Z1 

129088 AA744610 Hs.194431 pafedin 17.1 

129095 112350 Hs.108623 thrombospondin 2 2.7 

129096 AA463189 Hs.288906 WW Domain-Containing Gene 20.9 

129097 BE243933 Hs.108642 zinc finger protein 22 (KOX 15) 3.0 
10 129099 AF146074 Hs.108660 ATP-binding cassette, sub-family C (CFTR/MRP). member 5 5.8 

129135 W93048 Hs.250723 hypometlcal protein MGC2747 5.9 

129149 AA356620 Hs.108947 KIAA0050 gene product 6.3 

129172 AW162916 Hs.241576 hypotheticai protein PR02577 1.8 

129192 AA286914 Hs.183299 ESTs 2.1 

15 129194 AA150797 Hs.109276 latexin protein 3.2 

129198 N57532 Hs,109315 KIAA1415 protein 5.8 

129207 A1934365 Hs. 1 09439 osteogiycin (osteoinductive factor, mimecan) 8.0 

129228 U40714 Hs.239307 tyrosyi-tRNA synthetase 2.9 

129229 AF013758 Hs.109643 poJyadenyiate binding protein-interacting protein 1 3.2 
20 129254 AA252468 Hs.1098 DKFZp434J 1813 protein 2.6 

129255 AI961727 Hs.109804 H1 histone family, member X 73 

129288 W26392 Hs.1 10080 ESTs, Weakly similar to S13495 pregnancy zone protein [H^apiens] 9.6 

129296 AI051967 Hs.1 10122 ESTs 1.2 

129323 AA287239 Hs.5518 Homo sapiens cDNAFUl 131 1 fis, clone PLACE 101 01 02 5.1 

25 129340 H75334 Hs,11050 F-box only protein 9 4.6 

129347 BE614192 Hs.279869 melanoma-associated antigen recognised by cytotoxic T lymphocytes 7.6 

129362 U30246 Hs.1 10736 solute carrier family 12 (sodiurr^potasslum/chtoride transporters), member 2 6.7 

129366 BE220806 Hs.184697 Homo sapiens clone 23785 mRNA sequence 8.6 

129370 A1686379 Hs. 11 0796 SAR1 protein 1.4 

30 129372 NM_016039Hs.1 10803 CGI-99 protein 2.0 

129403 AF149785 Hs.111126 pituitary tumortrarurforming 1 interacting protein 7.4 

129404 AI267700 Hs.317584 ESTs 5.0 
129404 AI267700 Hs.317584 ESTs 2.5 
129423 AA204686 Hs.234149 hypothetical protein FU20647 10.2 

35 129449 AI096988 Hs,111554 ADP^bosylation factor-ifoe 7 8.0 

129453 AW974265 Hs.1 11632 Lsm3 protein 3.2 

129482 AA188185 Hs.289043 spindlin 6.7 

129482 AA188185 Hs.289043 spindlin 3.6 

129513 AW843633 Hs.306163 hypothetical protein AL1 101 15 7.1 

40 129515 AF255303 Hs.1 12227 niembran*assoc!ated nucleic acid binding protein Z5 

129527 AA769221 Hs.270847 delta-tubutin 3.2 

129559 W01296 Hs.1 1360 hypometical protein FU14784 7.5 

129560 AA317841 Hs.7845 . hypothetical protein MGC2752 6.8 
129570 AJ923097 Hs.1 1441 chromosome 1 open reading frame 8 2.0 

45 129575 F08282 Hs.276428 progestin Induced protein 1.6 

129587 H14718 Hs.11506 Human clone 23589 mRNA sequence 6.8 

129588 BE408300 Hs.301862 postmeiotfc segregation increased 24ike 9 1.4 
129591 N57423 Hs.179898 HSPC055 protein 7.3 
129594 AW403724 Hs.36989 coagulation factor VII (serum promroinbm wnversbn accelerator) 9.0 

50 129596 AF035537 Hs.115521 REV3 (yeast homotog)-lite, catalytic subunit of DNA polymerase zeta 1.6 

129628 U38945 Hs.1 174 cycDn-dependent kinase inhibitor 2A (melanoma. p16. inhibits CDK4) 2.2 

129628 U38945 Hs.1 174 cycflrwiependent kinase Inhibitor 2A (me 1.4 

129629 AK000398 Hs.1 1747 hypothetical protein FU20391 3.8 
129649 AD000092 Hs.16488 calreficulin 3.3 

55 129675 NM_015556Hs.172180 KIAA0440 protein 13.4 

129680 U03749 gb:Human chromogranln A (CHGA) gene, promoter an 14.1 

129689 AW748482 Hs.77873 B7homotog3 Z6 

129702 AI304966 Hs.12035 ESTs, Weakly similar to 138022 hypothetical protein [H.sapiens] 7.4 

129720 AA156214 Hs.12152 APMCF1 protein 2.0 

60 129721 NIVL001415HS.211539 eukaryotic translation Wtiation factor 2, subunit 3 (gamma. 52kO) 1.7 

129726 H15474 Hs.132698 fatty acid desaturase 1 8.3 

129778 AK001676 Hs.12457 hypothetical protein FU10814 1.8 

129779 AA394090 Hs.12460 Homo sapiens done 23870 mRNA sequence 5.4 
129800 AF052112 Hs.12540 lysosomal 1.7 

65 129806 AB023148 Hs.173373 KIAA0931 protein U 

129815 BE565817 Hs.26498 hypothetical protein FU21657 3.1 

129840 NM_006590Hs.12820 SnRNP assembly defective 1 homolog 1.8 
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129861 AL049999 Hs.85963 0KFZP564M1 82 protein 2.2 

129864 A1393237 Hs.1 29914 runt-related transcription factor 1 (acute myeloid leukemia 1; am!1 oncogene) 1.7 

129869 AI222069 Hs.13015 hypothetical protein similar to mouse Onajll 2.7 

129922 AF042379 Hs.13386 gairtma-tubuEn complex protein 2 4.5 

5 129945 BE514376 Hs.165998 PAM mRNA-bindlng protein 1.8 

129953 AA412195 Hs.13740 ESTs ZS 

129972 AW753185 Hs.180628 dynamin 1-fika 1.6 

129983 U09848 Hs.132390 zinc finger protein 36 (KOX 18) 1.3 

129989 AB015856 H&247433 activating transcription factor 6 4.0 

10 130010 AA301116 Hs.142838 nudeolarphosphoproteInNopp34 1.6 

130081 AA287325 Hs.14713 ESTs 4.0 

130082 S73265 Hs.1473 gastrirKeteasing peptide 1.6 
130097 AL046962 Hs.14845 f orWiead box 03A 2.8 
130100 AL135561 Hs.14891 hypothetical protein FU21047 2.3 

15 130111 X53002 Hs.149846 Integrin, beta 5 2.3 

130112 AA916785 Hs.1 80610 splicing factor prollne/glutamine li^ 3.0 

130112 AA916785 Hs.180610 splicing factor profinefclutamine rich ( 2.1 

130128 L76937 Hs.150477 Werner syndrome 1.8 

130135 AA311426 Hs.21635 tubulin, gamma 1 6.1 

20 130211 NMJXJ3358HS.23703 ESTs, Moderately simflax to CEGT_HUMAN CERAM1DE GLUCOSYLTRANSFERASE [H.saplens] 1.6 

130212 D80001 Hs.1 52629 KIAA01 79 protein 1.3 

130236 R85367 Hs.51957 splicing factor, arginine/serine-rich 2, interacting protein 2.0 

130241 AL035588 Hs.1 53203 MyoD family Inhibitor 3.2 

130242 X79201 Hs.1 53221 synovial sarcoma, translocated to X chromosome 5.4 
25 130249 D81983 Hs.322852 GAS2-retated on chromosome 22 4.8 

130263 NM.002497HS.153704 NIMA (never in mitosis gene ablated kinase 2 1.4 

130287 AA479005 Hs.154036 tumor suppressing subtransferable candidate 3 2.6 

130310 AB011121 Hs.154248 amyotrophic fateral sderosis 2 Quw^ 6.3 

130353 Z19064 Hs.172210 MUF1 protein 6.2 

30 130356 AF127577 Hs.1 55017 nuclear receptor Interacting protein 1 2.4 

130357 AJ224442 Hs.155020 putative methyftransferase 3.4 

130359 NMJ)13449Hs.277401 bromalomain adjacent to zJnc finger domain, 2A 8.5 

130367 AL135301 Hs.8768 hypothetical protein FU10849 1.4 

130372 AI077464 Hs.5011 RNA binding motif protein 9 3.3 

35 130393 N89467 Hs.155291 WAA0005 gene product 1.8 

130399 AW374106 Hs.155356 hypothetical protein MGC2840 similar to a putative glucosyltransferase 3.4 

130407 BE385099 Hs.334727 hypothetical protein MGC3017 2.3 

130409 NMJH)1197Hs.155419 B02*teradngldller(arjoptosis^ucing) 2.7 

130419 AF037448 Hs.155489 NSI-assodated protein 1 1.8 

40 130441 U63630 Hs. 1 55637 proteh Wr^, OrW^cthrated, catarytic polypepfide 2.3 

130448 BE513202 Hs.15589 PPAR binding protein 3.9 

130455 D90041 Hs.155956 N-acetyltransferase 1 (arylamine N-acetyltransferase) 33.6 

130455 090041 Hs.155956 N-acetyttransferase 1 (arylamine N-acety 4.6 

130471 AL121438 Hs.183706 adducln 1 (alpha) 2.7 

45 130465 BE245851 Hs.180779 HBrustDnefamBy, member B 5.0 

130487 U49844 Hs.77613 ataxia telangiectasia and Rad3 related 4.3 

130498 138951 Hs.180446 karyopherln (Importin) beta 1 1.6 

130503 BE208491 Hs.295112 KIAA0618 gene product 16.1 

130511 L32137 Hs.1584 cartilage ongornericnratrixr^tefo(r#eud 6.1 

50 130511 L32137 Hs.1584 cartilage oligomeric matrix protein (pse 5.3 

130526 AW876523 Hs.15929 hypoftellcal protein FU12910 Z1 

130542 U64675 Hs.179825 RAN binding protein 2-iike 1 7.8 

130544 AA321238 Hs.4310 eukaryotic translation Wtiation factor 1A 1.5 

130553 AF062649 Hs.252587 pituitary tumor-transforming 1 14.4 

55 130556 AI907018 Hs.15977 Empirically selected from AFFX single probeset 4.7 

130567 AA383092 Hs.1608 replication protein A3 (14kD) 7.9 

130568 AA232119 Hs.16085 putative G-proteln coupled receptor 33 
130574 AF083208 Hs.16178 apoptosis antagonizing transcription factor 1.2 
130586 AB007891 Hs.16349 K1AA0431 protein 5.6 

60 130598 AL042210 Hs.16493 hypothetical protein DKFZp762N2316; K1AA1803 protein 1.4 

130601 AA609738 Hs.16525 ESTs 1.5 

130614 AI354355 Hs.16697 dowrHegutator of transcriptjon 1, TBP-bindlng (negative cofactor 2) U 

130617 M90516 Hs.1674 glutamlrie^ctose-6-phosphate transaminase 1 12.1 

130617 M90516 Hs.1674 glutamme^ctose-6-phosphate transamin Z4 

65 130618 AA383439 Hs.16758 Splr-1 protein 15.9 

130667 BE246961 Hs.17639 Homo sapiens uKquitm protein ligase (UBE3B) mRNA, partial cds 135 

130674 AL048842 Hs.194019 attractin 1.5 
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130675 AA442233 Hs.17731 hypothetical protein FU1 2892 5.4 

130692 AA652501 Hs.13561 hypothetical protein MGC4692 5.0 

130693 R68537 Hs.17962 ESTs 2.0 
130712 AJ271881 Hs.279762 bromodomain-contairung 7 1.8 

5 130714 AI348274 Hs.18212 DNA segment on chromosome X (unique) 9879 expressed sequence 2.0 

130730 AB007920 Hs.18586 K1AA0451 gene product 3.7 

130744 H59696 Hs.W47 POP7 (processing of precursor, S. cerevisiae) horndog 3.1 

130751 AF052105 Hs.18879 tfromosome 12 open reading frame 1.4 

130757 AL036067 Hs.18925 protein x 0001 5.7 

10 130768 AF258627 Hs.211562 ATP-btnding cassette, sub-family A (ABC1), member 1 5.1 

130789 AKD00355 Hs.8899 sirtuln (silent mating type tnformation regulation 2, S.cerevisae,homolog) 5 52 

130815 A6018298 Hs.19822 SEC24 (S. cerevisiae) related gene femBy, member D 1.5 

130836 J05068 Hs.2012 transcobalamln I (vitamin B12 binding protein, R binder family) 15.7 

130841 AL157468 Hs.325825 Homo sapiens cDNA RJ20848 fis, clone AOKA01732 2.8 

15 130843 AA447492 Hs.20183 ESTs, Weakly similar to AF164793 1 protein x 013 [H.sapiens] 1.5 

130844 U76248 Hs.20191 seven in absentia (Orosophila) homotog 2 3.4 

130855 AJ243706 Hs.143323 putative DNA/chromatin binding motif 1.7 

130861 NM.016578HS-20509 HBV pX associated protein-6 1,9 

130879 NIA.003416Hs.2076 zlncfingerprotein7(KOX4 l cloneHF.16) 1.4 

20 130880 BE514434 Hs.20830 Wnesin-6ke2 2.1 

130892 AL120837 Hs.20993 high-glucose-regulated protein 8 2.4 

130898 AB033078 Hs.186613 sphIngosine-1-phosphate lyase 1 1.7 

130911 BE409769 Hs.21189 DnaJfJtep40)homolog 1 subfarnilyA t menrjber2 1.8 

130919 N79110 Hs.21276 collagen, type IV, alpha 3 (Goodpasture antigen) binding protein 2.3 

25 130944 BE382657 Hs.21486 signal transducer and activator of transcription 1, 91 kD 5.4 

130971 N39842 Hs.301444 KIAA1673 2.2 

130992 BE398Q91 Hs.74316 desmoplakln (DPI, DPI!) 1.8 

130993 T97401 Hs.21929 ESTs 1.6 
131005 AV658308 Hs.2210 thyroid hormone receptor interactor 3 * 1.6 

30 131028 AI879165 Hs.2227 CCAAT/enhancer binding protein (C/EBP), gamma 1.2 

131042 AI826288 Hs.171637 hypothetical protein MGC2628 1,6 

131046 AA321649 Hs.2248 smaD inducible cytokine subfamily B (Cys-X-Cys), member 10 7.4 

131046 AA321649 Hs.2248 small Inducible cytokine subfamily B (Cy 3.0 

131047 H23230 Hs.22481 ESTs, Moderately similar to A46010 X-Iinked retinopathy protein [KsaplensJ 1.7 
35 131060 AA194422 Hs.22564 myosin VI 5.1 

131060 AA194422 Hs.22564 myosin VI 2.5 

131070 N53344 Hs.22607 ESTs 7.1 

131076 AA749230 Hs.26433 dolichyl-phosphate (UDP-N-acetylglucosamine} N-acetylglucosaminephosphotransferase 1 ( 2.0 

131076 AA749230 Hs.26433 dofichyl-phosphate(UDP-N-acetylghicosam ** 1.9 

40 131099 AL133353 Hs.226581 COX15 (yeast) homolog, cytochrome c oxidase assembly protein 7.0 

131174 NM_006540Hs.29131 nuclear receptor coacfivator 2 1.9 

131185 BE280O74 Hs.23960 cydinBI 5.8 

131206 AW138839 Hs.24210 ESTs 2.0 

131213 AA885699 Hs.24332 CGl-26 protein 7.0 

45 131225 H62087 Hs.31659 thyroid hormone receptor-associated protein, 95-kDsubunit 7.5 

131231 N47468 Hs.59757 zinc finger protein 281 19 

131233 D89053 Hs.268012 fatty-acid-Coenzyme A Rgase, long-chain 3 3.5 

131243 AW383256 Hs.24752 spectrin SH3 domain binding protein 1 2.8 

131245 AL080080 Hs.24766 mtoredoxin domain-containing 2.6 

50 131247 AL043100 Hs.326190 fatty acid amide hydrolase 5.6 

131281 AA251716 Hs.25227 ESTs 5.7 

131283 X80038 Hs.339713 Homo sapiens done F19374 APO E-C2 gene cluster U 

131305 AV656017 Hs.184325 CGI-76 protein 5.0 

131320 AA505691 Hs.145696 splicing factor (CC1.3) 1.8 

55 131339 AF058696 Hs.25812 Nijmegen breakage syndrome 1 (nibrin) 2.6 

131339 AF058696 Hs.25812 Nijmegen breakage syndrome 1 (nflwfn) 2.6 

131375 AW293165 Hs.143134 ESTs 5.4 

131390 BE269388 Hs.182698 mitochondrial ribosomal protein L20 5.3 

131410 BE259110 Hs.279836 HSPC166 protein 2,2 

60 131412 NM.012247HS.124027 SELENOPHOSPHATE SYNTHETASE ; Human selenium donor protein 2.0 

131429 AL046302 Hs.26750 hypothetical protein FU21 908 1.4 

131458 BE297567 Hs.27047 hypothetical protein FU20392 1,7 

131475 AA992841 Hs.27263 KIAA1458 protein 2.0 

131501 AV661958 Hs,8207 GK001 protein 2.6 

65 131501 AV661958 Hs.8207 GK001 protein 16 

131511 AA732153 Hs.27865 Homo sapiens cDNA: FU21333 fis, done COL02535 2.0 

131528 AU076408 Hs.28309 UDP-gluoose dehydrogenase 16 
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131532 BE268278 Hs.28393 hypothetical protein MGC2592 7.4 

131543 AW966881 Hs.41639 programmed cefl death 2 2.2 

131544 AL355715 Hs.28555 programmed cefl deatfi 9 (P0CD9) 2.1 
131562 NM-003512HS.28777 H2A histone family, member L 1.7 

5 131564 T93500 Hs.28792 Homo sapiens cONA HJ11041 fis. clone PLACE1004405 5.1 

131564 T93500 Hs.28792 Homo sapiens cDNAFUl 1041 lis. clone PL 1.8 

131569 AL389951 Hs.271623 wdeoporfn 50kD 5.0 

131618 BE393822 Hs.29645 Homo sapiens mRNA; cDNA DKFZp761C029 (from clone DKFZp761C029); partial cds 1.8 

131622 R78195 Hs.29692 Homo sapiens cDNAFUl 1436 fis, done HEMBA1001213 1.3 
10 131623 AB037791 Hs.29716 hypoftefical protein FU10980 22 

131623 AB037791 Hs.29716 hypoftefical protein FU10980 1.9 
131643 AW410601 Hs.30026 HSPC1 82 protein 2.9 
131653 AW960597 Hs.30164 ESTs 1.3 
131656 A1218918 Hs.30209 KIAA0854 protein 2.8 

15 131669 X52486 Hs.3041 uracii-DNA glycosybse 2 2.8 

131692 BE559681 Hs.30736 KIAA01 24 protein 5.6 

131714 AA642831 Hs.31016 putative DNA binding protein 2.9 

131722 D13757 Hs.311 phosphoribosyl pyrophosphate amidotransferase 3.4 
131737 AK001641 Hs.31323 inhibitor of kappa fight polypeptide gene enhancer in Cecils, kinase complex-associated protein 3.8 

20 131760 X76732 Hs.3164 nudeobtodln 2 2.9 

131760 X76732 Hs.3164 nucleobindin 2 2.8 

131763 AI878932 Hs.317 topoisomerase (DNA) I 3.4 

131772 AA382590 Hs.170980 WAA0948 protein 255 

131774 BE267158 Hs.169474 DKFZP586J01 19 protein 55 

25 131787 087077 Hs.196275 K1AA0240 protein 2.4 

131793 AW966127 Hs.32246 Homo sapiens cDNA RJ14656 fis, done NT2RP2002439 7.9 

131795 BE501849 Hs.32317 Wgh-mobifity group 20B 1.4 

131798 X86098 Hs.301449 adenovirus 5 E1A binding protein 4.1 

131817 U20536 Hs.3280 caspase 6, apoptosls-reiated cysteine protease 4.2 

30 131824 U28838 Hs.32935 TATA box binding protein (TBP)-associated factor, RNA polymerase HI, GTF3B subunit 2 3.5 

131850 A1251317 Hs.33184 ESTs 5.1 

131878 AA083764 Hs.6101 hypothetical protein MGC3178 5.8 

131885 BE502341 Hs.3402 ESTs 13.7 

131885 BE502341 Hs.3402 ESTs 2.4 

35 131887 W17064 Hs.332848 SWl/SNF related, matrix associated, adm dependent regulator of chroma^ 3.2 

131900 AA099014 Hs.231029 Homo sapiere, ctone MGC:15961. rriRrM, complete cd^ 8,7 

131900 AA099014 Hs.231029 Homo sapiens, done MGC.i 5961, mRNA, com 2.0 

131904 AW78866 Hs.284296 Homo sapiens cDNA: FU22993 fis, done KAT1 1914 5.5 

131905 AA179298 Hs.3439 stomatln-fike 2 11.3 
40 131913 AW207440 Hs.185973 degenerative spermatocyte (homotog DrosophBa; Opid desaturase) 1.7 

131916 AA025976 Hs.34569 ESTs 5.2 

131925 AF151048 Hs.183180 anaphase promoting complex subunit 1 1 (yeast APC1 1 homotog) 2.7 

131929 BE541211 Hs.34804 Homo sapiens cDNAFUl 1472 lis, done HEMBA1001711 5.3 

131941 BE252963 Hs.35086 ubiquitin specific protease 1 2.3 

45 131950 AA3551 13 Hs,35380 x 001 protein 1.5 

131962 AK000046 Hs.267448 hypothetical protein RJ20039 2.3 

131965 W79283 Hs.35962 ESTs 1.4 

131971 BE567100 Hs.154938 hypoSiefical protein MDS025 3.5 
131977 U90441 Hs3622 procc^gen-fjroBne, 2-oxoglutarata 4^oxygenase (prdine 44iydroxylase) l alpte polypeptide 6.5 

50 131985 AA503020 Hs.36563 hypotiietkai protein FU22418 2.4 

131991 AF053306 Hs.36708 budding uninhibited by berudmidazoies 1 (yeast homotog), beta 2.1 

132019 H56995 Hs.37372 Homo sapiens DNA binding peptide mRNA, partial cds 3.2 

132031 AF193844 Hs.375o COF3 complex subunit 7a 5.8 

132062 BE266155 Hs.3832 dathrirvassodated protein AP47 1.5 

55 132084 NM002267Hs^886 karyopherin alpha 3 fimportin alpha 4) 3.7 

132103 BE171921 Hs.3991 ESTs 1.4 

132105 AV646076 Hs.39959 ESTs 5.8 

132116 AW960474 Hs.40289 ESTs 1.7 

132176 AA857025 Hs.8878 kinesin-like 1 3.3 

60 132180 NM.004460Hs.418 fibroblast activation protein, alpha 14.7 

132192 AA206153 Hs.4209 mitochondrial ribosomal protein L37 5.5 

132194 R42432 Hs.4212 ESTs 4.4 

132203 NM_004782Hs.194714 synaptosorna^assodated protein, 29kD 22 

132207 BE206939 Hs.42287 E2F bansedption factor 6 22 

65 132235 AV658411 Hs.42656 WAA1681 protein 7.8 

132240 AB018324 HS.42676 WAA0781 protein 1.5 

132252 AI566004 Hs.141269 Homo sapiens cDNA: FU21550 fis, done COUD6258 U 
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132266 AA301228 Hs.43299 hypothetical protein RJ1 2890 5.7 

132273 AA227710 Hs.43658 DKFZP586L151 protein 4.2 

132276 AA653507 Hs.285711 hypothetical protein FU13089 2.1 

132288 N36110 Hs.305971 solute carrier family 2 {facilitated glucose transporter), member 10 1.5 

5 132294 AB023191 Hs.44131 KIAA0974 protein 10.0 

132298 NNL015986H5.7120 cytokine receptor-like molecule 9 1.9 

132299 AW405882 Hs.44205 cortistatin 9.2 
132325 N37065 Hs.44856 hypoftefical protein RJ12116 2.0 
132348 AW067708 Hs.170311 heterogeneous nuclear ribonucleoprotein CMike 6.5 

10 132370 AW572805 Hs.46645 ESTs 3.8 

132374 AF155582 Hs.46744 corel UDP-g^ctosefl-acetylgalac^^ 1.5 

132376 A1279892 Hs.46801 soring nexin 14 12.5 

132384 AA312135 Hs.46967 HSPC034 protein 283 

132393 AL135094 Hs.47334 hypothetical protein FU 14495 1.9 

15 132450 AA100012 Hs.48827 hypothetical protein FU12085 1.9 

132452 AW973521 Hs.247324 mitochondrial ribosomal protein S14 6.1 

132456 AB011084 Hs.48924 KIAA0512 gene product; ALEX2 1.7 

132465 AW169847 Hs.49169 K1AA1634 protein 8.6 

132470 A1224456 Hs.4934 H.sapiens polyA site DNA 5.2 

20 132484 X16660 Hs. 11 9007 RAB4, member RAS oncogene family 1.4 

132518 AW885606 Hs.5064 ESTs 6.1 

132528 T78736 Hs.50758 SM(M (structural rnaintenarra ^ 3.3 

132530 AA306105 Hs.50785 SEC22, vesicle trafficking protein (S. cerevisiaeHJke 1 2.0 

132532 AA454132 Hs.5080 mitochondrial ribosomal protein L16 2.9 

25 132534 BE388673 Hs.5086 hypothetical protefri MGC10433 2.2 

132543 BE568452 Hs.5101 protein regulator of cytokinesis 1 7.3 

132571 AW674699 Hs.5169 suppressor of G2 allele of SKP1, S. cerevisiae. homolog of 1.7 

132574 AW631437 Hs.5184 TH1 drosophfla homolog 7.1 

132596 AK001484 Hs.5298 CGI-45 protein 2.2 

30 132611 AA345547 Hs.53263 hypothetical protein FU13287 2.2 
132612 H12751 Hs.5327 PR01914 protein * 6.8 

132616 BE262677 Hs.283558 hypofoetical protein PR01 855 14.0 

132638 AI796870 Hs.54277 DNA segment on chromosome X (unique) 9928 expressed sequence 11.4 

132648 U51127 Hs.54434 hypothetical protein MGC1715 1.9 

35 132668 AB018319 Hs.5460 KIM0776 protein 2.6 

132692 AW191962 Hs.249239 collagen, type VIII, alpha 2 2.0 

132715 F11875 Hs.5534 Homo sapiens cDNA FU12961 lis, clone NT2RP2005645 1.5 

132718 NMJXM600Hs.554 Sjogren syndrome antigen A2 (60kD, ribonucleoprotein autoantigen SS-A/Ro) 3.0 

132724 AI142265 Hs.55498 geranylgeranyl diphosphate synthase 1 2.4 

40 132731 AI189075 Hs.301872 hypothetical protein MGC4840 12.4 

132744 AA010233 Hs.55921 glutamyl-prolyl-tRNA synthetase 14.6 

132760 AA125985 Hs.56145 tfymosin, beta, identified in neuroblastoma cells 2.7 

132771 Y10275 Hs.56407 phosphoserine phosphatase 3.0 

132773 AA459713 Hs.295901 KIAA0493 protein 2.3 

45 132784 AI142133 Hs.56845 GDP dissociation inhibitor 2 1.6 

132798 A1026701 Hs.5716 K1AA0310 gene product 3.7 

132807 U07418 Hs.57301 mul (E. cofi) homolog 1 (colon cancer, nonpoiyposis type 2) 1.8 

132810 AB007944 Hs.5737 K1AA0475 gene product 5.9 

132813 BE313625 Hs.57435 salute carrier family 11 (proton-coupled divalent metal Ion transporters), member 2 8.7 

50 132815 A1815189 Hs.57475 sex comb on midleg homolog 1 6.4 

132817 N27852 Hs.57553 tousteoMike kinase 2 3.6 

132821 AJ251595 Hs.169610 C044 antigen (homing function and Indian blood group system) 18 

132833 U78525 Hs.57783 eukaryotic translation initiation foctor 3. subunft 9 (eta, 1 1 6kD) 14.6 

132842 NM_016154Hs.279771 Homo sapiens dore PP1596 untavn mRNA 1.6 

55 132844 F12200 Hs.5811 chromosome 21 open reading frame 59 2.5 

132851 U09716 Hs.287912 lectin, mannose-binding, 1 1,4 

132863 BE268048 Hs.236494 RAB10, member RAS oncogene family 4.2 

132869 AW963217 Hs.203961 ESTs, Moderately similar to AF1 16721 89 PR02168 [H.sapiens] 2.8 

132873 AW007683 Hs.58598 K1AA1266 protein 2.0 

60 132875 NMJ)04850Hs.58617 Rho^ssodated, coited-coil containing protein kinase 2 1.6 

132891 BE267143 Hs.59271 U2(RNU2) smail nuclear RNA auxiliary factor 1 (non-standard symbol) 1.4 

132897 AW503667 Hs.59545 ring finger protein 15 5.4 

132902 A1936442 Hs.59838 hypothetical protein FU 10808 6.1 

132912 AW732760 Hs.167578 Homo sapiens cDNA FU1 1095 fis, done PIACE1005374 7.1 

65 132913 W78714 Hs.60257 Homo sapiens cDNAFU13598fc, clone PLACE1009921 2.8 

132940 T79136 Hs.127243 Homo sapiens mRNA for KIAA1724 protein, partial cds 6.1 

132941 A1817165 Hs.6120 hypothetical protein FU13222 10.3 
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132942 AA554458 Hs.197751 KIAA0666 protein 1.8 

132952 AI658580 Hs.61426 Homo sapiens mesenchymal stem cefl protein DSC96 mRNA, partial cds 2.2 

132952 AA576635 Hs.6153 CGWB protein 4.9 

132972 AA034365 Hs.288924 Homo sapiens cDNAFUl 1392 lis. done HEMBA1000575 2.7 

5 132973 AA035446 Hs.323277 ESTs 5.3 

132977 AA093322 Hs.301404 RNA binding motif protein 3 3.2 

132980 AA040896 Hs.62016 ESTs 13 

132994 AA1 12748 Hs.279905 clone HQ0310 PRO0310p1 3.0 . 

133012 AA847843 Hs.62711 Homo sapiens, clone IMAGE: 335 1295, mRNA 10.3 

10 133015 AJ002744 Hs.246315 UDP-ftecety^alpha^abct^ 2.1 

133016 AI439688 Hs.6239 hypothefical protein RJ20886 U 

133053 AI065016 Hs.6390 Homo sapiens done RB3344 PRO0845 mRNA, complete cds 6.0 

133062 AW500374 Hs.64056 PRO01 49 protein 5.3 

133069 BE247441 Hs.6430 protein with poryglutarnlne repeat calcium (ra2^ homeostasis endoplasmic reticulum protein 4.9 

15 133091 AK001628 Hs.64691 K1AA0483 protein 3.5 

133110 AA808177 Hs.65228 ESTs 13.1 

133134 AF198620 Hs.65648 RNA binding motif protein 8A 1.3 

133145 H94227 Hs.6592 Homo sapiens, clone IMAGE:2961368, mRNA, partial cds 12 

133152 Z11695 Hs.324473 mitogervac^ated protein kinase 1 1.3 

20 133174 AA431620 Hs.324178 hypothetical protein MGC2745 17.1 

133175 AW955632 Hs.66666 ESTs, WeaWy similar to S19560 profine-rich protein MP4 - mouse (M.musculus] 1.8 

133177 X97795 Hs.66718 RAD54 (S.cerevislae)-fike 4.9 

133197 AI275243 Hs.180201 hypothetical protein RJ20671 3.1 

133208 AI801777 Hs.6774 ESTs 4.4 

25 133226 AW954569 Hs.296287 Homo sapiens, Simitar to bromcxJomaln-contalning 4, done IMAGE:3542455, mRNA, partial cds 1.7 

133228 AI492924 Hs.6831 go{gi phosphoproteln 1 6.0 

133240 AK001489 Hs.242894 ADP-ribosylatlon factor-fike 1 1.5 

133254 A1567421 Hs.273330 Homo sapiens, done IMAGE:3544662, mRNA, partial cds 1.4 

133266 A1160873 Hs.69233 zinc finger protein 5.6 

30 133268 AW956781 Hs.293937 ESTs, Weakly similar to FXD2JWMAN FORKHEAO BOX PROTEIN D2 [H.sapiens] 1.9 

133285 M76477 Hs389G82 GM2 gangGoside activator protein - 4.7 

133291 BE297855 Hs.69855 NRAS-reJated gene 5.0 

133314 AA102670 Hs.70725 gamma-aminobutyric acid (GABA) A receptor, pi 2.7 

133321 T79526 Hs.1 79516 integral type I protein 9.3 

35 133327 AL390127 Hs.7104 KruppeWike factor 13 4.4 

133347 BE257758 Hs.71475 add duster protein 33 1.8 

133360 AI016521 Hs.71816 v-akt murine Ihymoma viral oncogene homotogl 5.5 

133366 AA292611 Hs.72050 non-metastatic ceils 5, protein expressed in (nucleoside-diphosphate kinase) 2.7 

133367 AF231919 Hs.18759 KIAA0539 gene product 1.7 
40 133370 AF2455G5 Hs.72157 DKFZP564I1922 protein 1.8 

133383 8E313555 Hs.7252 KIAA1224 protein 1.7 

133390 AI950382 Hs.72660 phosphatldyberine receptor 1-3 

133391 AW103364 Hs.727 Inhlbin, beta A (actwln A. activin AB alpha polypeptide) 16.1 
133394 AA305127 Hs.237225 hypothetical protein HT023 12.2 

45 133437 AL031591 Hs.7370 phosphotidyflnositol transfer protein, beta 10.4 

133452 NNL002759HS.274382 protein kinase, Interferon-indudble double stranded RNA dependent 1.2 

133453 AI659306 Hs.73828 protein tyrosine phosphatase, non-receptor type 4 (megakaryocyte) 1.7 
133500 AW964804 Hs.74280 hypothetical protein FU22237 11.1 
133529 W45623 Hs.74571 ADP-rfoosylafion factor 1 2.8 

50 133540 AL037159 Hs.74619 proteasome (prosome, macropain) 26S subunit, non-ATPase.2 2.9 

133543 AU077073 Hs.108327 damage-specific DNA binding protein 1 (127kD) 2,5 

133578 AU077050 Hs.75066 transOn 1.5 

133579 X75346 Hs.75074 mitogen-actrVated protein Wnase^'vated protein kinase 2 2,1 
133582 BE391579 Hs.75087 Fas-activated sertne/threonine kinase U 

55 133594 AW160781 Hs.172589 nuclear phosphoproteln similar to S. cerevfelae PWP1 Z2 

133595 AA393273 Hs.75133 transcription factor 6-tiko 1 (mitochondrial transcription factor 1-iike) 1,5 

133599 NM_002885Hs.75151 RAP1, GTPase activating protein 1 5.7 

133621 NNL0O4893HsJ5258 H2Ahfetonefarnir/,rnemberY 25.5 

133627 NM_002047Hs.75280 grycyMRNA synthetase 15.8 

60 133631 N1O00401HS.75334 exostoses (multiple) 2 3,3 

133649 U25849 Hs.75393 add phosphatase 1, soluble 1.6 

133690 AV661185 Hs.75574 mitochondrial ribosomal protein L19 4.1 

133720 127641 Hs.75737 penwtrtolarrnatenall 1.5 

133722 AW969976 Hs^79009 matrix Gla protein 6.3 

65 133751 AW402048.HS.334787 Homo sapiens, Similar to likely orthoiog of yeast ARV1. clone IMAGE:3506392. mRNA 3.9 

133757 T52946 Hs.196209 RAE1 (RNA export 1 , S.pombe) hornofog 1.7 

133760 BE271766 Hs.181357 iaminin receptor 1 (67kO, ribosomal protein SA) 1.8 
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133765 M62194 Hs.75929 cadherin 11, type 2. OB-cadherfn (osteoblast) 1.5 

133780 AA557660 Hs.76152 decorin 3.5 

133784 BE622743 Hs.301064 affaptin 1 6.8 

133791 M34338 Hs.76244 spermidine synthase 2.6 

5 133797 AI133921 Hs.76272 retinoblastoma^lndlng protein 2 1.4 

133822 D50525 Hs.699 peptidylprolyl bomerase 8 (cydophilln 6) 8.0 

133842 AW797468 Hs.285013 putative human HLA class II associated protein I 13.5 

133845 AA147026 Hs.76704 ESTs 22 

133850 W29092 Hs.7678 ce&ilarretindc acid-binding protein 1 1.8 

10 133859 U86782 Hs.178761 26S proteasome-associated pad1 homolog 2.0 

133865 AB011155 Hs.170290 discs, large (Drosophia) homolog 5 2.8 

133867 AW340125 Hs76989 KIAA0097 gene product 6.7 

133868 AB012193 Hs.183874 culiin 4A 2.5 
133881 U30872 Hs.77204 centromere protein F (350/400kD, mttosin) • 3.0 

15 133922 U30825 Hs.77608 splicing factor, arginine/serine-rich 9 1.4 

133924 D86326 Hs.325948 vesicle docking protein p1 15 5.4 

133929 NM_006306Hs.211602 SMC1 (structural maintenance of chromosomes 1, yeast)-like 1 4.9 

133936 117128 Hs.77719 gamma-glutamyl carboxylase 3.7 

133941 BE244332 Hs,77770 adapbr^eteted protein complex 3. mu2subunil 12.1 

20 133959 X81789 Hs.77897 splicing factor 3a. subunit3,60l<D 9.7 

133976 AI908165 Hs.169946 GATA-binding protein 3 (T^cefl receptor gene activator) 3.1 

133989 AL040328 Hs.78202 SWI/SNF related, matrix associated, actin dependent regulator of chromatin 1.3 

133997 AI824113 Hs.78281 regulatorofG-proteins^nallhg12 " 9.7 

134010 AB016092 Hs.197114 RNA binding protein; AT-rich element binding factor 2.4 

25 134015 D31764 Hs.278569 sorting nexin 17 2.5 

134070 NMJX)3590Hs.78946 cullin3 1.3 

134110 U41060 Hs.79135 UV-1 protein, estrogen regulated 4.2 

134129 NMJ)14742Hs79305 KIM0255 gene product 2.2 

134134 H86504 Hs.173328 protein phosphatase 2, regulatory subunit B (B56), epsilon Isoform 5.0 

30 134200 BE559598 Hs.197803 KIAA01 60 protein 3,2 

134206 AF107463 Hs.79968 spfcing factor 30, sun/ival of motor neuron-related 2.5 

134208 NM_000288Hs.79993 peroxisomal biogenesis factor 7 2.1 

134219 NMJ004O2Hs.8O206 glucose-6-phosphate dehydrogenase 9.1 

134234 BE300078 Hs.80449 Homo sapiens, done 1MAGE:3535294, mRNA, partial cds 2.8 

35 134275 AI878910 Hs.3688 cisplatin resistance-assodated overexpressed protein 1.6 

134292 AI906291 Hs.81234 immunogloburin superfemBy, member 3 2.0 

134301 AW502505 Hs.81360 Homo sapiens cDNA: FU21927 fis, done HEP04178, highly similar to HSU90909 2.5 

134305 U61397 Hs.81424 ubiquiMxe 1 (sentrin) ^ 2.8 

134324 AB029023 Hs.179946 K1AA1 100 protein 10.4 

40 134326 AW903838 Hs.81800 chondroitin sulfate proteoglycan 2 (verslcan) 1.9 

134329 N92036 Hs.81848 RAD21 (S. pombe) homolog 2.6 

134337 NMJK>4922Hs.81964 SEC24 (S. cerevisiae) related gene family, member C 2.3 

134348 AW291946 Hs.82065 interfeukin 6 signal transducer (gp130, oncostatin M receptor) 13.0 

134367 AA339449 Hs.82285 phosphoribosylgrydrcamide fom^^ 8.8 

45 134376 X06560 Hs.82396 2*,5*-ol^oadenylate synthetase 1 (4046 kD) 1.5 

134379 AW362124 Hs.323193 hypothetical protein MGC3222 8.1 
134384 AI589941 Hs.8254 Homosaplens, Similar to torru>r differentially expressed Z6 

134391 AA417383 Hs.82582 Integrin. beta-like 1 (with EGF-flke repeat domains) 4.1 

134395 AA458539 Hs.8262 lysosomal 1.7 

50 134403 AA334551 Hs.82767 sperm specific antigen 2 2.6 

134405 AW067903 Hs.82772 collagen, type XI. alpha 1 1.3 

134411 BE272095 Hs.167791 retteulocalbln 1. EF-hand caldum bmding domain 3.2 

134415 AI750762 Hs.82911 protein tyrosine phosphatase type IVA, member 2 1.9 

134421 AU077196 Hs.82985 collagen, type V, alpha 2 10.3 

55 134424 Z44190 Hs.83023 peroxisomal biogenesis factor 118 2.4 

134446 M112036 Hs.83419 KIAA0252 protein 1.2 

134447 M58603 Hs.83428 nudear factor of kappa light polypeptide gene enhancer In B-cells 1 (p105) 1.6 
134470 X54942 Hs.83758 CDC28 protein kinase 2 11 
134480 NM_005000Hs.83916 Empirically selected from AFFX single probeset 5.3 

60 134485 X82153 Hs.83942 cathepsin K (pycnodysostasis) 2.5 

134498 AW246273 Hs.84131 threonyMRNA synthetase 2.1 

134513 AA425473 Hs.84429 KIAA0971 protein 3.8 

134516 AKD01571 Hs.273357 hypothetical protein FU10709 2.4 

134520 BE091005 Hs.74861 activated RNA polymerase II transcription cofector 4 67 

65 134529 AW41 1479 Hs.848 FK506-bindlng protein 4 (59kD) 2.3 

134577 BE244323 Hs*5951 exportin, tRNA (nudear export receptor fbrtRNAs) 5.5 

134582 AA927177 Hs.86041 CGG triplet repeat binding protein 1 5.8 
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134612 AW068223 Hs.171581 ublquitin C-terminal hydrolase UCH37 . 22 

134624 AF035119 Hs.8700 deleted In fiver cancer 1 2.0 

134632 X78520 Hs.174139 chloride channel 3 2.3 

134654 AK001741 Hs.8739 hypothetical protein FU 10879 1.4 

5 134664 AA256106 Hs.87507 ESTs ■ 72.9 

134666 8E391929 Hs.8752 transmembrane protein 4 8.5 

134687 U62317 Hs.88251 arytsulfatase A 6.0 

134692 NM.003474Hs.8850 a disintegrin and metaDoprotstnase domain 12 {meltrin alpha) 4.3 

134705 BE161887 Hs.88799 anaphase-promoting complex subunit 10 2.3 

10 134714 Y14768 Hs.890 lysosomal 6.7 

134719 AA852985 Hs.89232 chromobox homolog 5 (Orosophfla HP1 alpha) 2.3 

134722 AF129536 Hs^84226 F«box only protein 6 2.9 

134724 AF045239 Hs.321576 ring finger protein 22 6.6 
134746 X07871 Hs.89476 C02 antigen (p50), sheep red blood cell receptor ' 2.3 

15 134751 AW630803 Hs.89497 laminBI 62 

134790 BE002798 Hs.287850 Integral membrane protein 1 1.9 

134806 AD001528 Hs.69718 spermine synthase 18 

134834 AW451370 Hs.6991 adaptor-related protein complex 1, gamma 2 subunit 1.4 

134850 AI701162 Hs.90207 hypothetical protein MGC1 1138 1.4 

20 134853 BE268326 Hs.90280 5-aminoimklazoM^rboxa/nlde ribonucleotide forniyttransferase/lMP cydohydiolase 5.6 

134859 026488 Ks.90315 KIAA0007 protein 2.8 

134880 AI879195 Hs.90606 1 5 kDa selenoprotein 1.7 

134910 AA532963 Hs.9100 Homo sapiens cDNARJ 13100 fis, clone NT2RP3002255 1.7 

134925 AW885909 Hs.6975 PRO1073 protein 2.1 

25 134955 AW401361 Hs.91773 protein phosphatase 2 (formerly 2A), catalytic subunit, alpha isoform U 

134971 A1097346 Hs.286049 phosphoserine aminotransferase 2.1 

134975 R50333 Hs.92186 teman colled-coD protein 2.3 

135011 AB037835 Hs.92991 K1AA1414 protein 1.6 

135022 NM_000408Hs.93201 grycerol^phosphate dehydrogenase 2 (rnKochondrial) 3.9 

30 135032 AW301984 Hs.173685 hypomefical protein FU 1261 9 62 

135077 AW503733 Hs.9414 K1AA1488 protein 2.0 

135083 AB036063 Hs.94262 p53-indudble ribonucleotide reductase small subunit 2 homolog 1.3 

135095 AF027219 Hs.9443 zinc finger protein 202 7.1 

135096 AA081258 Hs.132390 zinc linger protein 36 (KOX 18) 3.2 
35 135153 AI093155 Hs.95420 JM27proteh 2.5 

135181 BE250865 Hs.279529 px194ixe protein 1.4 

135199 AA477514 Hs.96247 transfiiwssociated factor X 5.0 

135207 N26427 Hs.9634 ESTs, Highly similar to C10JUJMAN PUTATIVE C10 PROTEIN [H.sapiens] 6.1 

135214 T78802 Hs.96560 hypothelical protein FU 11 656 4.6 

40 135243 BE463721 Hs.97101 putative G protein-coupled receptor 5.6 

135245 AJ028767 Hs.262603 ESTs 3.5 

135257 AW291023 Hs.97255 ESTs, Weakly similar to A46010 X-ilnked retinopathy protein (H.sapiens J 1.2 

135263 AI088775 Hs.55498 geranylgeranyl diphosphate synthase 1 2.6 

135274 AA448460 Hs.1 12017 GE36gene 5.3 

45 135294 AA150320 Hs.9800 protein kinase Njmu-R1 9.1 

135295 AI090838 Hs.98006 ESTs 2.4 

135307 AI743770 Hs.98368 ESTs, Weakly similar to K1AA0822 protein [H.sapiens] 13.3 

135321 A1652069 Hs.98614 ribosome binding protein 1 (dog 180kD homolog) 2.6 

135354 AA456454 Hs.183418 ceO division cycle 2-Dke 1 (PITSLRE proteins) 8.3 

50 135361 AA373452 Hs.167700 Homo sapiens cDNA FU10174 fc, clone HEMBA1003959 1.5 

135389 U05237 Hs.99872 fetal Alzheimer anfigen 4.9 

135400 X78592 Hs.99915 androgen receptor (dihydrotestosterone recepton tesficular femhlzatkw; spinal and bulbar 2.0 

134975 R50333 Hs.92186 Leman colled-coll protein 2.6 

. 135011 AB037835 Hs.92991 KIAA1414 protein 1.4 

55 135022 NM_O0O4O8Hs.9320t gtyceroW^phate dehydrogenase 2 (mi 1.6 

135032 AW301984 Hs.173685 hypothetical protain FU12619 1.4 

135077 AW503733 Hs.9414 KIAA1488 protein 1.8 

135083 AB036063 Hs.94262 p53-lnducibie ribonucleotide reductase s 2.5 

135095 AF027219 Hs.9443 zinc finger protein 202 1.5 

60 135096 AA081258 Hs,132390 zinc finger protein 36 (KOX 18) 2.1 

135153 AI093155 Hs.95420 JM27 protein 4.4 

135181 BE250865 Hs.279529 px19-Dke protein 14.9 

135199 AA477514 Hs.96247 transGrMBsocated factor X 1.3 

135207 N26427 Hs.9634 ESTs^H^sImllartoCIOJIUMANPUTATl 1.7 

65 135214 T78802 Hs.96560 hypofteflcal protein FU1 1656 6.1 

135243 BE463721 Hs£7101 putaflve G protein-coupJed receptor 2.7 

135245 AI028767 Hs.262603 ESTs ' 12.2 
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135257 AW291023 Hs.97255 ESTs, Weakly similar to A46010 X-linked 7.6 

135263 AI088775 Hs.55498 geranylgeranyl diphosphate synthase 1 1.8 

135274 AA448460 Hs.1 12017 GE36gene 4.1 

135294 AA150320 Hs.9800 protein kinase tymitRI U 

5 135295 A1090838 Hs.98006 ESTs 4.8 

135307 AI743770 Hs.98368 ESTs, Weakly similar to WAA0822 protein 5.8 

135321 AI652069 Hs.98614 ribosome binding protein 1 (dog 180kD ho 12.3 

135354 AA456454 Hs.183418 ceD division cycle 2-fike 1 (PITSLRE pr 5.7 

135361 AA373452 Hs.167700 Homo sapiens cDNA FU10174 fis, clone HE 7.9 

10 135389 U05237 Hs,99872 fetal Alzheimer antigen 1,9 

135400 X78592 Hs.99915 androgen receptor (dlhydrotestosterone r 13.9 

302256 AA857131 Hs.171595 HIV TAT specific factor 1 5.3 

302276 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c-erb-b2, 2.2 

303135 AW592789 Hs.279474 HSPC070 protein 1.4 

15 303686 AK000714 Hs.109441 MSTP033 protein 5.2 

310085 R43191 Hs.101248 Homo sapiens done IMAGE32553, mRNA seq 2.3 

315518 AA808229 Hs.167771 ESTs 2.8 

317781 NM_007057Hs.42650 ZW10 interactor 2.0 

320836 AJ268997 Hs.197289 rab3 GTPase-activatmg protein, norwata 5.5 

20 321114 AA902256 Hs.78979 Golgi apparatus protein 1 1.4 

322221 N24236 Hs.179662 nucleosome assembly protein 1-Iike 1 1.3 

322474 AF1 18083 Hs.29494 PR01912 protein 2.9 

322556 BE041451 Hs.177507 hypothetical protein 1.6 

323541 AF292100 Hs.104613 RP42homotog 1.8 

25 407827 BE278431 Hs.40323 BUB3 (budding uninhibited by benzimidazo 1.6 

408196 AL034548 Hs.43627 SHY (sex determining region Y)-box 22 6.1 

408813 AI58OO90 Hs.48295 RNA helicase family 5.6 

409176 R73727 Hs.101617 ESTs, Weakly similar to T32527 hypotheti 2.6 

413670 AB000115 HsJ5470 hypoflieflcaJ protein, expressed in osteo 2.4 

30 414108 AI267592 Hs.75761 SFRS protein kinase 1 1,5 

414846 AW304454 Hs.77495 UBX domain-containing 1 4.2 

416980 AA381133 Hs.80684 high-mobility group (nonhistone chromoso 23.6 

417378 R57256 Hs.82037 TATA box binding protein (TBP)-associate 5.8 

418283 S79895 Hs.83942 cathepsin K (pycnodysostosis) 1.3 

35 418467 NM.006910HS.85273 retinoblastoma4)5nding protein 6 1.6 

420269 U72937 Hs.96264 alpha thatassemia/mental retardation syn 2.3 

420802 U22376 Hs.1334 v-myb avian myeloblastosis viral oncogen 1.6 

421225 AA463798 Hs.102696 MCT-1 protein 3.5 

421642 AF172066 Hs.106346 retlnolc add repressible protein 4.9 

40 421828 AW891965 Hs.279789 histone deacetytase 3 3.1 

421983 AI252640 Hs.110364 peptidylprolylisomeraseC{cyclophilin 1.9 

422052 AA302744 Hs.104518 ESTs 2.4 

422055 NMJ)14320Hs.111029 putative heme-bindlng protein 4.1 

423750 AF165883 Hs.298229 prefoldin2 7.0 

45 424001 W67883 Hs.137476 paternally expressed 10 (PEG10; WAA105 4.9 

425182 AF041259 Hs.155040 zinc finger protein 217 3.4 

425284 AF155568 Hs.155489 NS1 -associated protein 1 2.1 

426372 BE304680 Hs.169531 DEADJH (Asp-GluAIa-Asp/His) box polypep 7.5 

428049 AW183765 Hs.182238 GW128 protein 1.7 

50 428477 AW500533 Hs.1 1482 splicing factor, arginine/sennewich 11 2.4 

437562 AB001636 Hs.5683 DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 3.8 

438449 AK001333 Hs.6216 Homo sapiens hepatocellular carcinomas 5.6 

441560 F13386 Hs.7888 Homo sapiens done 23736 mRNA sequence 2.0 

445580 AF167572 Hs.12912 skbl (S. pombe) homolog 7.5 

55 446999 AA151520 Hs.334822 hypothetical protein MGC4485 2.2 

447111 AI017574 Hs.17409 cysteine-rich protein 1 (intestinal) £8 

447778 BE620592 Hs.71190 ESTs, Weakly similar to S1 6506 hypotheti 1.7 

448873 NMJJ03677HS.22393 density-regulated protein 5.9 

449687 W68520 Hs.331328 intermediate filament protein syncoilin 5.6 

60 450701 H39960 Hs.288467 Homo sapiens cDNA FU12280 fis, done MA 1.4 

450703 AA011202 Hs.184771 nudear factor 1/C (CCAAT-binding transc 4.7 

452461 N78223 Hs.108106 transcription factor 29 

452511 BE408178 Hs.285165 Homo sapiens cDNA FU20845 fis, done AD 12.1 

453157 AF077036 Hs.31989 DKFZP586G1722 protein 4.7 

65 453658 BE541906 Hs.87819 Homo sapiens, clone MGC:2492, mRNA, comp 1.3 

100833 AF135168 Hs.108802 N^thylrnaieirnide-sensitive fadtor 3.2 

102481 U50360 gb:Humancaldum,calmodurjn-dependentp 6.2 
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102827 BE244588 Hs.6456 dwperontaconlainlagTCP1 t subunlt2(b 7.9 

103549 BE270465 Hs.78793 protein kinase C.zeta 2.0 

104331 AB040450 Hs.279862 cdklnhIbftorp21 binding protein 5.3 

110018 AW579842 his. 104557 hyrjometicaJ protein FU1Q697 2.0 

5 115008 AKD01827 Hs.87889 heOcas*rnoi 5.7 

119075 M10905 Hs.287820 fibronectin 1 13 

119615 AL034423 Hs.75875 ublqur&vconjugaflng enzyme E2 variant 2.9 

125006 BE065136 Hs.145696 spDdng factor (CCU) 1.7 

127609 X80O31 Ks.530 collagen, type IV, alpha 3 (Goodpasture 2.4 

10 129209 R62G76 Hs. 17820 Rrunissociated, co3ed-coB containing p 5.2 

129917 M30773 Hs.278540 protein phosphatase 3 (formerly 2B), reg 4.5 

130182 BE267033 Hs.192853 ubtquitin-conjugating enzyme E2G 2 (homo 11.0 

130365 W56119 Hs.155103 eukaryotic translation initiation factor 3.3 

131135 NrvL016569Hs.267182 TBX3-lso protein U 

15 131853 AI681917 Hs.3321 ESTs, Highly similar to (RX1_HUIWN IROQU 3.2 

131881 AW361018 Hs.3383 upstream regulatory element binding prot 14.3 

132726 N52298 Hs.55608 hypothetical protein MGC955 3.0 

135193 X95525 Hs.96103 TATA box binding protein (TBP)-associate 2.7 

409487 H19886 gb:yn57a05.r1 Soares adult brain N2b5H 2.3 

20 416040 AW819158 Hs.289044 Homo sapiens cDNA FLJ12048 fis. clone HE 7.4 
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TABLE 4A 



Table 4A shows the accession numbers for those pkeys lacking unigenelD's for Table 4. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 



123615 30686.-15 

123619 371681 1 

101445 1650_-5 

124385 656394 J 

124417 1642364J 

124482 1657509J 

102481 31281„-28 

103349 11052.-2 

110856 19346J4 . 



120280 160212.1 
113248 328626J 
120472 44573.2 



129019 44573J 



120695 9683_3 



122188 275673J 
121581 283769J 
122618 305217J 
109026 150431J AA157811 AA836869 
123658 genbanLAA609364 AA609364 
123811 genbank^AA620586 AA620586 
125115 genbank_T97341 T97341 
125147 NOTJ=OUNDj>ntre*J/V38150 
118737 382979J AA1 99686 N73861 
120274 genbankJVA177051 AA177051 
113196 genbanleT57317 T57317 
120504 genbanLAA256837 AA256837 
120639 genbanU\A286942 AA286942 



AA609170 

AA602964AA609200 
M21259 

AI267847 N27351 
N34059N46979 
N53935 N53950 
U50360 
X89059 

AA992380 N33063 N21418 H79958 R21911 H79957 

103797 109699J AA080912M075318 M083403 AA076594 M078992 AA084926 AA081881 AA113913 AA113892 
AA083821 AA134801 AA082953 AA070343 AA062835 AA075419 AA063293 AAQ71252 M078900 AA062836 AW974305 

M190577M181657 

T63857 AW971220 AA493469 T63699 

AJ950087 N70208 R97040 N36809 AI3081 19 AW967677 N35320 AJ251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 AI820532 W87891 T85904 U71456T82391 BE328571 T75102 R34725 AA884922 BE328517 
A1219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 AA488964 
AA283144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 A1872193 AI763273 AW173586 AW150329 AI653832 
AI762688 AA988777 AA488892 AJ356394 AW103813 AI539642 AA642789 AA856975 AW505512 AI961530 AW629970 
8E612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961101 AA251669 AA251874 
AI819225 AW2Q5862 AI683338 A1858509 AW276905 AJ633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 AW613456 AI373032 AI564269 
F00531 H83488 W37181 W78802 R66056 AJ002839 R67840 AA300207 AW959581 T63226 F04005 
A1950087 N70208 R97040 N36809 AI3081 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
A1219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 AA488964 
AA283144 AI890387 AI950344 AI741346 A1689062 AA282915 AW102898 AJ872193 AI763273 AW173586 AW150329 AJ653832 
A1762688 AA988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 AI961530 AW629970 
BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW981101 AA251669 AA251874 
AI819225 AW205862 AI683338 AI858509 AW276905 AI633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H60052 T92487 A1022058 AA780419 AA551005 W80701 AW613456 AI373032 AI564269 
F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F04005 
AA976503 A1917802AA953664M404613 AA42B771 BE280542 AW194691 AI927301 AI740458 AI796100 AI935603 AW052210 
AA970201 AI633384 AA425910 AI017004 A1241295 AA402816 AA291468 
AA398838AA435847 

AA416568 AA442889 AA417233 AA442223 
AA453641 AA454061 
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120809 genbankJ\A346495 AA346495 
113702 genbank^T97307 T97307 

129680 23162J U03749 NM.001275 JQ3483 JQ3915 A1214509 AW245744 AL046455 AA318960 AI741505 AA843875 AI829382 AI560122 
AI858999 055958 AI684005 D53170 AA854091 AI025609 D53119 D54729 D55504 D55377 055313 AW512244 AA846441 
5 AW043898AI969102AA405741 AI091983 AA788784 AA706586 AA854361 AW470949 AA843095 AA772028 AI148432 

AI038109 AA782478 AA910064 AI220384 AA781296 AA843881 AA854064 AA843125 AA843419 AA319036 AA319054 
AI273831 W32275 AI584185 C05724 AA789023 AI686818 054392 AI022485 AA431410 AA854232 W39212 W15214 AA894441 
AI803081 AJ167381 AW245389AA319430 AA3351 56 AI042646 AA327030 AA725 170 727943 AA889304 AA976699 AI687001 
AJ621 107 A1865540 AA772107 C08286 AA319661 AA405992 



101045 


entre2L.J05614J05614 


117247 


genbank 


N21032 


110501 


genbank 


H55748 


103392 


entre2LX94563X94563 


105032 


genbank 


AA1 27818 


119513 


NOT_FOUND__entrez W37933 


105445 


genbank 


AA252395 


.121514 


genbank 


AA412112 


121558 


genbank 


AA412497 


121911 


genbank 


AA427950 


123315 


714071J 


AA496369AA498646 


114911 


genbank 


AA236672 


409487 


1134778J 


H19886AW402806T10231 
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TABLE 5: Figure 5 from BRCA 001 US 
Table 5 shows genes upregulated in tumor tissue compared to normal breast tissue. 



Ptey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

10 Unigene Tide: Unigene gene title 

R1: Ratio of tumor to normal breast tissue 



Pkey ExAccn UnigenelD UnigeneTitle 



15 

100114 X02308 Hs.82962 
100147 D13666 Hs.136348 
100154 H60720 Hs.81892 
100335 AW247529 Hs.6793 
20 100666 L05424 Hs.169610 

100667 L05424 Hs.169610 

100668 L05424 Hs.169610 
100678 AW502935 Hs.740 
100988 AK000405 Hs.76480 

25 101031 J05070 Hs.151738 
101045 JQ5614 
101332 J04088 Hs.156346 
101352 AI494299 Hs.16297 
101580 NWL012151HS.83363 

30 101592 AF064853 Hs.91299 
101767 M81057 Hs.180884 
101606 AA566894 Hs.1 12408 
101810 NWL000318Hs.180612 
101983 AJ904232 Hs.75323 

35 102107 BE258602 Hs.182366 
102165 BE313280 Hs.159627 
102198 AW950852 HsJ4598 
102217 AA829978 Hs.301613 
102220 U24389 Hs.65436 

40 102302 M306342 Hs.69171 
102348 U37519 Hs.87539 
102374 U33635 Hs.90572 
102455 U48705 Hs.75562 
102568 W81489 Hs.223025 

45 102618 AL037672 HS.81Q71 
102687 NM.007019HS.93002 
102689 U96132 Hs.171280 

102704 AU077058 Hs.54089 

102705 T97490 Hs.50002 
50 102801 BE252241 Hs.38041 

102827 BE244588 Hs.6456 
103060 NMJ)05940Hs.155324 
103080 AU077231 Hs.82932 
103178 M205475 Hs.275865 

55 103206 X72755 Hs.77367 
103238 AI369285 Hs.75189 
103547 AI376722 Hs.180062 
103549 BE270465 Hs.78793 
103886 AKD01278 Hs.105737 

60 104325 BE379766 Hs.150675 
104827 AW052006 Hs.8551 
104846 A1250789 Hs.32478 
104854 AA041276 Hs.154729 
104867 AA278898 Hs.225979 



R1 



thymidylate synthetase 2.9 

osteoblast specific factor 2 (fasciciin 7.5 

KIAA0101 gene product 9.2 

platelet-activating factor acetylhydroia 2.7 

C044 antigen {homing function and Indian 5.7 

CD44 antigen (homing function and Indian 9 

CD44 antigen (homing function and Indian 7.6 

PTK2 protein tyrosine kinase 2 53.2 

ubiquiMke4 11.4 

matrix metalfoproteinase 9 (getatinase B 8.2 

gb:Human proliferating cell nuclear anfi 5 

topoisomerase (DNA) II alpha (170kO) 3.4 

COX17 (yeast) homotog, cytochrome c oxid 6.3 

coagulation (actor Vlli-associated (intr 5.7 

guanine nucleotide binding protein ( 5.6 

carboxypeptidaseBI (tissue) 14.4 

S100 calcium-binding protein A7 (psorias 8.9 

peroxisomal membrane protein 3 (35kD, Ze 3.2 

prohibltin 8.4 

heat shock protein 75 1.4 

death associated protein 3 4.6 

polymerase (DNA directed), delta 2, regu 4.3 

JTV1 gene 6.7 

lysosomal 4.3 

protein kinase C-like 2 2.7 

aldehyde dehydrogenase 3 family, member 2 

PTK7 protein tyrosine kinase 7 6.2 

discoldin domain receptor family, member 6.9 

RAB31, member RAS oncogene family 5.3 

extracellular matrix protein 1 5.8 

ubiquitin carrier protein E2-C 4.3 

hydroxyacyl-Coenzyme A dehydrogenase, ly 6 

BRCA1 associated RING domain 1 1.9 

small inducible cytokine subfamily A (Cy 2.3 

pyridoxal (pyridoxin^ vitamin B6) klnas 6.4 

chaperonin containing TCP1, subunit 2 (b 5.6 

matrix metalloproteinase 1 1 (MMP 1 1 ; stro 4.5 

cyciin D1 (PRAD1: parathyroid adenomatos 3.1 

ribosomal protein S1 8 9.9 

monokine induced by gamma interferon 8.8 

death-associated protein 5.6 

proteasome (prosome, macropain) subunit, 9.7 

protein kinase C.zeta 7.9 

hypothetical protein FU10416 similar to 6.5 

polymerase (RNA) II (DMA directed) polyp 6.3 

PRP4/3TKWD splicing factor 10.9 

ESTs 5.6 

3-phosphoinositide dependent protein kin 123 

hypothetical protein similar to small G 2 
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104896 AW015318 Hs.23165 ESTs 17.7 

104909 AW408164 Hs.249184 transcripflon factor 19 (SC1) 5 

104916 AW958157 Hs.155489 NSI-assodated protein 1 1.7 

104919 AA026880 Hs.25252 prolactin receptor 1.4 

5 104974 Y12059 HS.278S75 bronwo^mah-contalnlng 4 1.4 

104978 A1199268 Hs. 19322 Homo sapiens, Similar to RIKEN cDNA 2010 72 

105012 AF098158 Hs.9329 chromosome 20 open reading frame 1 3.3 

105039 AA907305 Hs.36475 ESTs 2.5 

105079 AA151342 Hs.12677 CG1-147 protein 9.5 

10 105088 H58589 Hs.35156 Homo sapiens CDNAFU1 1027 fc, clone PL 22 

105393 AF167570 Hs.256583 InterleuJdn enhancer binding factor 3, 9 5.4 

105547 AA262640 Hs.27445 unknown 9.3 

105564 BE616694 Hs.288042 hypothetical protein RJ14299 14 

105658 AA985190 Hs.246875 hypothetical protein FU20059 9.4 

15 105746 AW151952 Hs.46679 hypothetical protein FU20739 15 

105858 AF151066 Hs.281428 hypothetical protein 2.9 

105930 AF016371 Hs.9880 peptidyl prolyl Isomefase H (cyctophflin 5.2 

106094 AA533491 Hs.23317 hypothetical protein RJ 14681 6.8 

106350 AKD01404 Hs.194698 cyclinB2 5.7 

20 106359 AW390282 Hs.31130 transmentrane7superfemnymember2 6.3 

106610 AA458882 Hs.79732 fibuBnl 7.9 

106624 NM.003595HS.26350 tyrosylprotein sulfotransferase 2 7.7 

106713 BE614802 Hs.184352 hypothetical protein FU12549 4.5 

106829 AW959893 Hs.27099 hypothetical protein FU23293 similar to 16.2 

25 106846 AB037744 Hs.34892 KIAA1 323 protein 2.2 

106873 N49809 Hs.11197 Homo sapiens, clone 1MAGE:3343149, mRNA, 16.8 

106973 BE156256 Hs.11923 hypothetical proteai 6.6 

107029 AF264750 Hs.288971 myeloioTryrnr^oidorrnixeoMineageleukem 18 

107197 W15477 Hs.64639 glioma pathogenesis^elated protein 6.1 

30 107859 AW732573 Hs.47584 potassium voltage-gated channel, delayed 8.4 

107901 L42612 Hs.335952 keratin 68 2.5 

107922 BE153855 Hs.61460 ig superfamfly receptor LNIR 2.2 

107974 AW956103 Hs.61712 pyruvate dehydrogenase kinase, Isoenzyme 6.7 

108274 AF129535 Hs.272027 F-box only protein 5 7.1 

35 108647 BE546947 Hs.44276 homeoboxCIO 9.8 

108695 AB029000 Hs.70823 WAA1077 protein 72 

108894 AK001431 Hs.5105 hypothetical protein FU10569 4 

109011 AA156542 Hs.72127 ESTs 14 

109068 AA164293 Hs.72545 ESTs 2.9 

40 i 109273 AA375752 Hs.82719 Homo sapiens mRNA; cDNA DKFZp586F1822 (f 2.9 

109468 NM.015310Hs.6763 KIAA0942 protein 3.2 

110240 AI668594 Hs.176588 ESTs, Weakly similar to CP4Y_HUMAN CYTOC 42 

110330 AI288666 Hs.16621 DKF2P434I1 16 protein 6.2 

110501 H55748 gbyq94a0ls1 Soares fetal liver spleen 6.1 

45 110742 AW190338 Hs.28029 hypothetical protein MGC1 1256 7.6 

110762 BE044245 Hs.30011 hypothetical protein MGC2963 9.3 

110856 AA992380 gbx)t37g08.s1 SoaresJestlsJWT Homo sap 2.3 

110958 NM_005864Hs.24587 signal transduction protein (SH3 contain 6.7 

111125 N63823 Hs.269115 ESTs, Moderately similar to Z195_HUMANZ 3.6 

50 111179 AK000136 Hs.10760 asportn (LRR class 1) 7.1 

111239 N90956 Hs.17230 hypothetical protein FU22087 75 

111285 AA778711 Hs.4310 eukaryoflc translation initiation factor 6.9 

111392 W46342 Hs.325081 Homo sapiens, clone IMAGE:3659680, mRNA, 8.4 

111937 BE298665 Hs.14846 Homo sapiens mRNA; cDNA DKFZp564D016 (fr 10.6 

55 112244 AB029QQ0 Hs.70823 , KIAA1 077 protein 14.6 

112995 AA737033 Hs.7155 ESTs, Moderately similar to 211 5357ATYK 5.6 

113777 BE266947 Hs.10590 zinc finger protein 313 13.4 

113791 A1269096 Hs.135578 chitobiase, di-N-acetyl- 13 

113811 BE207480 Hs.6994 Homo sapiens cDNA: FU22044 fis, done H 3.1 

60 113834 T26483 Hs.6059 EGF-containing GbuMke extraceliula 1U 

113868 W57902 Hs.90744 proteasome (prosome, macropaln) 26S subu 2.7 

113870 AL079314 Hs.16537 hypothetical protein, similar to (U06944 6.1 

113923 AW953484 Hs.3849 hypomeffcal protein RJ22041 similar to 1.9 

114275 AW515443 Hs.306117 WAA0306 protein 15.8 

65 114895 AA236177 Hs.76591 WAA0887 protein 7.1 

114965 AI733881 Hs.72472 BMP-R1B 2.3 

115081 AI751438 Hs.41271 Homo sapiens mRNA full length Insert cDN 11.8 
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115278 AK002163 
115291 BE545072 
115652 BE093589 
115693 AF231023 
115941 AI867451 
115968 AB037753 
116011 AL359053 
116417 AW499664 
116470 AI272141 
116637 AK001043 
117132 AI393666 
117881 AF161470 
118528 AI949952 
119075 M10905 
119265 BE539706 
119349 T65004 
119403 AL1 17554 
119789 BE393948 
120205 H26735 
120253 AA131376 
120297 AA191384 
120325 AA195651 
120327 AKD00292 
120349 AW969481 
120356 AF000545 
120371 AA219305 
120383 AL109963 
120386 AW969665 
120389 AW967985 
120396 AA134006 
120418 AW966893 
120472 AI950087 
120484 AA253170 
120570 AA280679 
120582 BE244830 
120596 AA282074 
120624 AW407987 
120695 AA976503 
120713 AW449855 
120750 A1191410 
120774 A1608909 
120807 AA346385 
120809 AA346495 
120984 BE262951 
121081 AA398721 
121408 AA406137 
121505 AA494172 
121508 AA402515 
121513 AA416653 
121549 AA412477 
121558 AA412497 
121655 AA421537 
121744 AA398784 
121748 BE536911 
121773 AB033022 
121832 AW340797 
121839 AA425691 
121882 AA426376 
121911 AA427950 
121999 M430211 
122013 AA431085 
122036 W92142 
122356 AA443794 

122371 AA868555 

122372 AA446008 
122460 AW418788 
122490 M448349 



Hs.301724 

Hs.122579 

Hs.38178 

Hs.55173 

Hs.46679 

Hs.62767 

Hs.57664 

Hs.12484 

Hs.83484 

Hs.92033 

Hs.42315 

.Hs.260622 

Hs.49397 

Hs.287820 

Hs.285363 

Hs.163561 

Hs.1 19908 

Hs.50915 

Hs.91668 

Hs.326401 

Hs.104072 

Hs.104106 

Hs.278732 

Hs.55189 

Hs.296433 

Hs.104196 

Hs.123122 

Hs.154848 

Hs.325572 

Hs.79306 

Hs.26613 

Hs.96473 

Hs.271445 

Hs.284228 

Hs.237323 

Hs.173518 

Hs.96557 
Hs.96693 
HS.193985 
Hs.30002 

Hs.99052 

Hs.186749 

Hs.98019 

Hs.194417 

Hs.97887 

Hs.181510 

Hs.98142 

Hs.178072 

Hs.97514 

Hs.234545 

Hs.158654 

Hs.98434 

Hs.191606 

Hs.98459 



Hs.98706 

Hs.271963 

Hs.98390 

Hs.178222 

Hs.336677 

Hs.99148 

Hs.238151 



hypothetical protein RJ 1 1 301 1.5 
hypothetical protein FLJ10461 6.2 
hypothetical protein FU23468 10.6 
cadherin, EGF LAG seven-pass G-type rece 6.8 
hypothetical protein FU20739 5.5 
WAA1332 protein 9.8 
Homo sapiens mRNA full length insert cDN 2.4 
Human done 23826 mRNA sequence 7.4 
SRY (sex determining region Y}-box 4 2.1 
Integrin-llnked kinase-associaled serine 2.7 
p10-bindlng protein 5.2 
butyrate-lnduced transcript 1 57 
ESTs 7.4 
fibronecrJn 1 5.7 
ESTs 1.4 
ESTs 8.4 
nudeolarprotein NOP5/NOP58 67 
kallikreln 5 (KLK5; KLK-L2; stratum com 9.2 
Homo sapiens clone PP1498 unknown mRNA 45.7 
fibroblast growth factor 12B 38.9 
ESTs,WeaWys1milartoZ195JWMANZINC 15.2 
ESTs 6.4 
hypothetical protein RJ20285 16.1 
hypothetical protein 163 
putative punnerglc receptor 28.1 
EST 12.4 
FSH primal response (LRPR1, rat) homoto 97 
hypothetical protein DKFZp434D0127 32.6 
ESTs, Moderately similar to ALU7_HUMAN A 21.7 
eukary otic translation initiation factor 1 2.5 
Homo sapiens mRNA; cDNA DKFZp586F1323 (f 11.4 
gb.T/vqrj5c02j(1NCLCGAPJ<id12Homosapien 19.4 
EST 10.4 
ESTs, Weakly similar to ALU1JWMAN ALU S 14.4 
ZNF135«llke protein 10.2 
N-acetylglucosamlne-phosphate mutase 7.5 
KH)hase phosphoprotsln homotog 52 
gb:oq30a04.s1 NCLCGAPJ3C4 Homo sapiens 46.8 
Homo sapiens cDNA FU12727 fis, done NT 5.9 
ESTs, Moderately similar to 2109260A Be 7 
ESTs 7.8 
SH3-containing protein SH3GLB2; K1AA1848 6.8 
gb:EST52657 Fetal heart li Homo sapiens 4.4 
ESTs 5.6 
ESTs, Highly similar to I37550 mismatch 5.4 
EST 6 
ESTs 13.1 
ESTs 28 
ESTs 6.2 
EST 7.4 
gb:zt95g12^1Soares_testis_.NHT Homo sap 2.8 
Homo sapiens mRNA; cDNA DKFZp434B1023 (f 7.8 
ESTs 7,1 
hypothetical protein NUF2R 19.5 
KIAA1196 protein 7.9 
ESTs 5.8 
ESTs, Highly similar to KIAA1046 protein 5 
ESTs 5 
gb:zw50ID2£l Soares totaUetus_Nb2HF8_ 7.2 
EST 6.4 
ESTs 6.5 
ESTs, Weakly slmflar to ALU5.HUMAN ALUS 13.1 
ESTs 7.3 
ESTs 5 
EST 7.6 
ESTs,WeaWyslmiiartoS43569R01H10.6 9.7 
EST 6.1 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



122492 
122510 
122530 
122572 
122607 
122614 
122616 
122618 
122622 
122717 
122829 
122838 
122856 
122868 
122907 
123016 
123034 
123136 
123152 
123394 
123466 
123486 
123615 
123735 
123753 
124006 
124385 
124440 
124656 
124683 
124735 
124761 
124768 
124788 
124811 
124812 
124822 
124860 
124903 
124930 
124942 
125051 
125056 
125101 
125115 
125280 
127274 
128528 
128670 
128691 
128772 
128781 
128797 
128868 
128891 
128946 
128975 
128995 
129019 
129076 
129088 
129096 
129198 
129347 
129362 
129372 
129404 



AA448417 Hs.104990 
AA449232 Hs.99195 
AW959741 Hs.40368 
AA452601 Hs.99287 
AA453518 Hs.98023 
AA453630 Hs.99339 
AA453638 Hs.161873 
AA453641 

AA453987 Hs.144802 
AA456859 Hs.178358 
AW2O4530 Hs.99500 
AA460584 Hs.334386 
A1929374 Hs.75367 
AF005216 Hs,1 15541 
AA470074 Hs.169896 
AW338067 Hs.323231 
AL359571 Hs.44054 
AW451999 Hs.194024 
AW601773 Hs.270259 
AA731404 Hs.105510 
AA599042 Hs.112503 
BE019072 Hs.334802 
AA609170 

NM_013241Hs.95231 
AA609955 Hs.234951 
AI147155 Hs.270016 
AI267847 

AA532519 Hs.129043 
AW297702 Hs.102915 
AA381661 Hs.1 19878 
R22952 Hs.268685 
AA374756 Hs.93560 
AW368528 Hs.100855 
R43543 Hs.100912 
R46068 H3.288912 
R47948 Hs.188732 
AA418160 Hs.86043 
R65763 Hs.101477 
AW298713 Hs.221441 
AI076343 Hs.173939 



R99978 

T79956 

T81310 

AJ472068 

T97341 

AI123705 



Hs.268892 
Hs.100588 
Hs.100592 
Hs.286236 

Hs.106932 
AW966158 Hs.58582 
R39234 Hs.251699 
AA975486 Hs. 103441 
W27939 Hs,103834 
BE302796 Hs.105097 
N71826 Hs.105465 
NfyL002975Hs.105927 
AA419008 Hs.106730 
F34856 Hs.292457 
Y13153 Hs.107318 
BE560779 Hs.284233 
AI816224 Hs.107747 
A1950087 

AW296806 Hs.326234 
AA744610 Hs.194431 
AA463189 Hs.288906 
N57532 Hs.109315 
BE614192 Hs.279869 
U30246 Ha.110736 
NM_016039Hs.110803 
A1267700 Hs.317584 



ESTs 5.4 
ESTs 112 
adaptor-related protein complex 1 , sigma 10.1 
EST 11 
ESTs 61.5 
EST 10.7 
ESTs 107.3 
gb2x48e06.s1 Soares_test's_NHT Homo sap 31.1 
ESTs 5.6 
ESTs 8.5 
ESTs 81.8 
ESTs 75.3 
Src-like-adapier 5.8 
Janus kinase 2 (a protein tyrosine Unas 5.3 
ESTs 11.5 
Homo sapiens cDNAHJ1 1946 fis, done HE 2,8 
ninein (GSK3B interacting protein) 8.7 
ESTs 5,1 
ESTs 5.2 
ESTs 3.6 
EST 7,4 
Homo sapiens cDNA FU 14680 fis, cione NT 2.4 
gbaf12a12^1 Soares_testis_NHT Homo sap 7.8 
FHVFH2 domain-containing protein 10 
Huntingtin interacting protein E 30.6 
ESTs 8.1 
gbaq49a10.x1 Stanley Frontal NB pool 2 57.1 
Human DNA sequence from clone 989H1 1 on 7.8 
ESTs 8.3 
ESTs ( Weakly similar to M3K9.HUMAN MITOG 7.9 
ESTs 11.3 
Homo sapiens mRNA for K1AA1771 protein, 9 
ESTs 8.1 
Homo sapiens cDNA: FU22726 fis, done H 5.1 
hypothetical protein FU22604 142 
ESTs 73 
Homo sapiens cDNA FU13558 fis, clone PL 6.6 
EST 23.9 
ESTs 32.4 
ESTs, Weakly similar to ALUB_HUMAN nil 22.8 
ESTs, Moderately similar to B34087 hypot 6.1 
EST 135.3 
ESTs 5.4 
K1AA1856 protein 5.6 
gb:ye57e05.s1 Soares fetal liver spleen 9.6 
ESTs 8 
Homo sapiens cDNA FU 12789 fis, done NT 12.8 
ESTs, WeaWy similar to IDN4-GGTR14 (H.s 2.8 
Homo sapiens, Similar to RIKEN cDNA 1 700 7.1 
hypothetical protein MGC5576 7.7 
thymidine kinase 1, soluble 5.3 
small nuclear ribonudeoproteln polypept -53.9 
stem cell growth factor; lymphocyte seer 13.3 
chromosome 22 open reading frame 3 3 
Homo sapiens, done MGC: 16362, mRNA, com 13.3 
kyninBnine3wrionooxygenase(kynurenine3 72 
NICE-5 protein 14 
DKFZP566C243 protein 1.9 
gb:wq05c02jc1 NC!_CGAPJ<ld12 Homo saplen 2.9 
ESTs, Highly similar to T46422 hypothec 5 
paDadin 17.1 
WW Domain-Containing Gene 205 
WAA1415 protein 5.8 
melanoma-associated anligen recognised b 7.6 

6.7 
2 
5 



CGI-99 protein 
ESTs 
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129482 AA188185 Hs.289043 spindlin 6.7 

129559 W01296 Hs.11360 hypothetical protein RJ14784 7,5 

129587 H14718 Hs.11506 Human done 23589 mRNA sequence 6.8 

129629 AK000398 Hs.11747 hypothetical protein RJ20391 3.8 

5 12S649 AO000092 Hs.16488 calreficulin 3.3 

129680 U03749 gb:Human chromogranin A (CHGA) gene, pro 14.1 

129689 AW748482 Hs.77873 B7homotog3 Z6 

129702 A1304966 Hs.12035 ESTs,Weakly similar to 138022 hypotheti 7.4 

129720 M156214 Hs.12152 APMCF1 protein 2 

10 130010 AA301116 Hs. 142838 nucleolar phosphoproteln Nopp34 1.6 

130097 AL046962 Hs.14845 forkhead box 03A £8 

130135 AA311426 Hs.21635 tubulin, gamma 1 6.1 

13021 1 NM_003358Hs.23703 ESTs, Moderately similar to CEGT_HUMAN C1.6 

130242 X79201 Hs.153221 synovial sarcoma, translocated to Xchro 5.4 

15 130359 NMJ)13449Hs.277401 bromodomaln adjacent to zinc finger doma 8.5 

130365 W56119 Hs.155103 eutaryotte translation initiation factor 11 

130448 BE513202 Hs.15589 PPAR binding protein 3.9 

130455 D90041 Hs.155956 N-a<styftransferase 1 {arylamine N-acety 33.6 

130471 AL121438 Hs.183706 adducin 1 (alpha) 2.7 

20 130503 BE208491 Hs.295112 WAA0618 gene product 16.1 

130511 L32137 Hs.1584 cartifage o^omeric matrix protein (pse 6.1 

130542 m€75 Hs.179825 RAN binding protein 2-iike 1 7.8 

130553 AF062649 Hs,252587 pituitary tumor-transforming 1 14.4 

130556 AI9G7018 Hs.15977 Empirically selected from AFFX single pr 4.7 

25 130567 AA383092 Hs.16Q8 replication protein A3 (14kD) 7.9 

130574 AF083208 Hs.16178 apoptosis antagonizing transition fee 1.2 

130617 M90516 Hs.1674 glutamine^m(^e^prK)sphatetransannin 1Z1 

130667 BE246961 Hs.17639 Homo sapiens ublquifin protein ligase(U 13.9 

130693 R68537 Hs.17962 ESTs 2 

30 130744 H59696 Hs.18747 POP7 (processing of precursor, S.cerevi 3.1 

130757 AL036067 Hs.18925 protein x 0001 5.7 

130880 BE514434 Hs.20830 kinesin-like 2 Zi 

130944 BE382657 Hs.21486 signal transducer and activator of trans 5.4 

131046 AA321649 Hs.2248 small inducible cytokine subfamily B (Cy 7.4 

35 131060 AA194422 Hs.22564 myosin VI 5.1 

131099 AL133353 Hs.226581 COX15 (yeast) homolog, cytochrome c oxid 7 

131135 N!WL016569Hs.267182 TBXUso protein 3.3 

131185 BE280074 Hs.23960 cydinBI 5.8 

131225 H62087 Hs.31659 thyroid hormone receptor-associated prot 7.5 

40 131245 AL080080 Hs.24766 thloredoxln domain-contoinlng 2.8 

131283 X80038 Hs.339713 Homo sapiens clone F19374 APO E-C2 gene 1.3 

131569 AL389951 Hs.271623 nucleoporin 50kD 5 

131643 AW410601 Hs.30026 HSPC182 protein 2.9 

131714 AA642831 Hs.31016 putative DNA binding protein 2.9 

45 131722 013757 Hs.311 phosphoribosyl pyrophosphate amidotransf 3.4 

131760 X76732 Hs.3164 nucleoblndin 2 2.9 

131793 AW966127 Hs.32246 Homo sapiens cDNA FU14656 fis, clone NT 7.9 

131885 BE502341 Hs.3402 ESTs 13.7 

131900 AA099014 Hs.231029 Homo sapiens, clone MGC:15961, mRNA.com 8.7 

50 131905 AA179298 Hs.3439 stomatin-fike 2 11.3 

131941 BE252983 Hs.35086 ubSqurtin specific protease 1 23 

131971 BE567100 Hs.154938 hypothetical protein MDS025 3.5 

132180 NM_.004460Hs.418 fibroblast acfivation protein, alpha 14.7 

132203 NM_004782Hs.194714 synaptosomal-assodated protein, 29kD 7.8 

55 132273 AA227710 Hs.43658 DKFZP586L151 protein 10 

132288 N36110 Hs.305971 solute carrier My 2 {facilitated ghi 9.2 

132294 AB023191 Hs.44131 KIAA0974 protein 2 

132348 AW067708 Hs.170311 heterogeneous nuclear nbonucleoproteln 12.5 

132370 AW572805 Hs.46645 ESTs 28.3 

60 132384 AA312135 Hs.46967 HSPC034 protein 6.1 

132450 AA100012 Hs.48827 hypofoetical protein FU12085 8.6 

132465 AW169847 Hs.49169 K1AA1634 protein 6.1 

132532 AA454132 Hs.5080 rriitochondrial ribosomal protein L16 7.1 

132574 AW631437 Hs.5184 TH1 drosoprtfla homolog 14 

65 132638 AI796870. Hs.54277 DNA segment on chromosome X (unique) 992 12.4 

132718 NM_004600Hs.554 Sjogren syndrome antigen A2 {60kD, ribon 3.7 

132726 N52298 Hs.55608 hypoffietlcal protem MGC955 14.3 
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132731 A1189075 Ha.301872 hypothetical protein MGC4840 5.9 

132744 AA010233 Hs.55921 glutamyt-proIyt-tRNA synthetase 6.4 

132773 AA459713 Hs.295901 WAA0493 protein 14.6 

132798 AI026701 Hs.5716 KIAA0310 gene product 2.5 

5 132810 AB007944 Hs.5737 WAA0475 gene product 4.2 

132833 U78525 Hs.57783 eukaiyotic translation Initiation factor 6.1 

132842 NM 016154Hs.279771 Homo sapiens done PP1596 unknown mRNA7.1 

132851 U09716 Hs.287912 lectin, rnannose-binolng, 1 6.1 

132891 BE267143 Hs.59271 LT2(FMJ2) smaD nuctear RNA auxiliary fac 21 

10 132941 AI817165 Hs.6120 hypothetical protein RJ 13222 2.1 

132972 AA034365 Hs.288924 Homo sapiens cONA FU1 1392 fis, clone HE 3.5 

132980 AA040696 Hs.62016 ESTs 1.3 

132994 AA112748 Hs^79905 clone HQ0310PRO0310p1 17.1 

133016 AI439688 Hs.6289 hypothefical protein FU20886 4.4 

15" 133177 X97795 Hs.66716 RAD54 (S.cerevisiaeH<te 4.4 

133208 AI801777 Hs.6774 ESTs 5.5 

133254 AI567421 Hs.273330 Homo sapiens, done IMAGE:3544662, mRNA, 1.3 

133266 A1160873 Hs.69233 zinc finger protein 16.1 

133268 AW956781 Hs.293937 ESTs, Weakly similar to FXD2J1UMAN FORKH 12.2 

20 133285 M76477 Hs.289082 GM2 ganglioskle activator protein 10.4 

133390 AI950382 Hs.72660 phosphatidylserlne receptor . 5.7 

133391 AW103364 Hs.727 inhibin, beta A (actMn A, acfivln AB a 25.5 
133540 AL037159 Hs.74619 proteasome (prosome, macropaln) 26S subu 1.7 
133594 AW160781 Hs.172589 nuclear phosphoprotein similar to S. cer 2.6 

25 133621 NfvL004893Hs.75258 H2A hlstone family, member Y 13.5 

133720 127841 Hs.75737 pericentriolar material 1 6.7 

133760 BE271766 Hs.181357 lamlnln receptor 1 (67kD, ribosomal prot 5.4 

133784 BE622743 Hs.301064 arfaptin 1 12.1 

133791 M34338 Hs.76244 spermidine synthase 9.7 

30 133797 AL133921 Hs.76272 retinobiastoma-binding protein 2 1.3 

133822 D50525 Hs.699 peptidylproiyl isomerase B (cydophirm 9.7 

133850 W29092 Hs.7678 ceiiular retinolc acid-binding protein 1 4.2 

133865 AB011155 Hs.170290 discs, large (Drosophlia) homolog 5 5 

133881 U30872 Hs.77204 centromere protein F (350/400kD, mitosin 9.1 

35 133924 066326 Hs.325948 vesicle docking protein pi 15 1.8 

133959 X81789 Hs.77897 splicing factor 3a, subunit3,60kD 10.4 

133989 AUQ40328 Hs.78202 SWI/SNF related, matrix associated, act! 2.6 

133997 A1824113 Hs.78281 regulator of G-protBin signalling 12 13 

134234 BE300078 Hs.80449 Homo sapiens, done 1MAGE:3535294, mRNA, 10.3 

40 134348 AW291946 Hs.82065 rnterteukfn 6 signal transducer (gp130, 6.7 

134376 X06560 Hs.82396 2\5^oGgoadenylate synthetase 1 (40-46 55 

134379 AW362124 Hs.323193 hypothetical protein MGC3222 5.8 

134405 AW067903 Hs.82772 collagen, type XI. alpha 1 72.9 

134421 AU077196 Hs.82985 collagen, type V. alpha 2 6.7 

45 134480 NfVL005000Hs.fi3916 Empirically selected from AFFX single pr 6.2 

134516 AK001571 Hs373357 hypomefical protein FU10709 1.4 

134529 AW411479 Hs.848 FKSOS-bindhg protein 4 (59kD) 2.8 

134751 AW6308O3 Hs.89497 Iamin81 6.1 

134790 BE002798 Hs.287850 integral membrane protein 1 1.2 

50 134806 AO001528 Hs.89718 spermine synthase 2.6 

134850 AI701162 Hs.90207 hypothetical protein MGC1 1 138 9.1 

134859 026488 Hs.90315 WAA0007 protein ' 13.3 

134971 AI097346 Hs.286049 phosphoserine aminotransferase 2 

135181 BE250855 Hs.279529 pxWike protein 14.9 

5 5 135207 N26427 Hs.9634 ESTs, Hfchly similar to C1CLHUMAN PUTAT1 1.7 

135245 A1028767 Hs.262603 ESTs 12.2 

135257 AW291023 Hs.97255 ESTs, Weakly simflar to A46010 X-Bnked 7.6 

135307 AI743770 Hs.98368 ESTs, Weakly similar to KIAA0822 protein 5.8 

135321 AI652069 Hs.98614 ribosome binding protein 1 (dog 180kD ho 12.3 

60 135354 AA456454 Hs.183418 cefl division cycle 2-Cke 1 (PITSLRE pr 5.7 

135400 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 13.9 

302276 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c 5.3 

317781 NM-007057HS.42650 ZW10 Interactor Z8 

321114 AA902256 Hs.78979 Golgl apparatus protein 1 55 

65 322556 BE041451 Hs.177507 hypothefical protein 2.9 

420802 U22376 Hs.1334 v-myb avian myeloblastosis viral oncogen 2.3 

424001 W67883 Hs.137476 patBmafly expressed 10 (PEG10; K1AA105 7 
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425182 AF041259.HS.155040 

446999 AA151520 Hs.334822 

450701 H39960 Hs.288467 

452461 N78223 Hs.108106 

453157 AF077036 Hs.31989 



zinc finger protein 217 2.3 
hypothetical protein MGC4485 7.5 
Homo sapiens cDNA RJ12280 fe, done MA 5.6 
transcripfion (actor 4.7 
OKFZP586G1722 protein 1Z1 
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TABLE 5 A 

Table 5A shows the accession numbers for those pkeys lacking unigenelD's for Table 5. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 



10 Ptey: 

CAT number 
Accession: 



Unique Eos probeset Identifier number 
Gene duster number 
Genbank accession numbers 



15 Pkey 

123615 
124385 
110856 
20 120472 



25 



30 129019 44573 J 



CAT number Accessions 



35 



40 

122618 
125115 
120809 
45 129680 



50 

101045 
110501 
121558 
55 121911 



30686_-15 AA609170 
656394J AI267847 N27351 

19346.14 AA992380 N33063 N21418 H79958 R21911 H79957 

44573 2 AI950087 N70208 R97040 N36809 AI3081 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 A1820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 
AA488964 AA283144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 A1872193 AI763273 AW173586 
AW150329 A1653832 AI762688 AA988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 
AI961530 AW629970 BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961 101 
AA251669 AA251874 AI819225 AW205862 AI683338 A1858509 AW276905 AI633006 AA972584 AA908741 AW072629 
AW513996 AA293273 AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 
AW613456 AI373032 AI564269 F00531 H83488 W37181 W78802 R66056 AI002639 R67840 AA300207 AW959581 T63226 
F04005 

AI950087 N70208 R97040 N36809 A13081 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 
AA488964 AA283144 A1890387 AI950344 A1741346 A1689062 AA282915 AW102898 AI872193 AI763273 AW173586 
AW150329 AI653832 A1762688 M988777 AA488892 A1356394 AW103813 AI539642 AA642789 AA856975 AW505512 
AI961530 AW629970BE612881 AW276997 AW513501 AW512843 AA044209 AW856538AA180009 AA337499 AW961101 
AA251669 AA251874 AW19225 AW205862 AI683338 AJ858509 AW276905 AI633006 AA972584 AA908741 AW072629 
AW513996 AA293273 AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 
AW613456 AI373032 A1564269 F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 
F04005 

120695 9683 3 AA976503 AI917802AA953664AA404613AA428771 BE280542 AW1 94691 A1927301 AI740458 
AI796100 AI935603 AW052210 AA970201 AI633384 AA425910 A1017004 AI241295 AA402816 AA291468 

305217 J AA453641AA454061 

genbanK_T97341 T97341 

genbanKJW346495 AA346495 

23162.1 U03749 NM.001275 J03463 J03915 A1214509 AW245744 AL046455 AA318960 A1741505 AA843875 AI829382 AJ560122 
AI858999 D55958 AJ684005 D53170 AA854091 AI025609 D53119 D54729 D55504 D55377 D55313 AW512244 AA846441 
AW043898 AI969102 AA405741 A1091983 AA788784 AA706586 AA854361 AW470949 AA843095 M772028 AJ148432 
A1038109 AA782478 AA910064 A1220384 AA78 1236 AA843831 AA854064 AA843125 AA843419 AA31 9036 AA319054 
AI273831 W32275 A1584185 C05724 AA789023 AI686818 D54392 AI022485 AA431410 AA854232 W39212 W15214 
AA894441 AI803081 A1167381 AW245389 AA319430 AA335156 AI042646 AA327030 AA725170 T27943 AA889304 
AA976699 AI687001 A1621 107 AI865540 AA772107 C06286 AA31 9661 AA405992 

enbe*J05614J05614 

genbanLH55748 H55748 

genbank u AA412497 AA412497 

genbankJ\A427950 AA427950 
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TABLE 6: Figure 6 from BRCA 001 US 



Table 6 shows genes upregulated in tumor tissue compared to normal breast tissue. 

5 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigeneiD: Unigene number 

1 0 Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal breast tissue 



15 ExAccn Unigene© UnlgeneTItle R1 

100147 D13666 Hs.136348 osteoblast specific factor 2 (fascidln 7.5 

100678 AW502935 Hs.740 PTK2 protein tyrosine kinase 2 53.2 

101808 AA586894 Hs. 112403 S100 calcium-binding pro!e*>A7(psonas 8.9 

102455 U48705 Hs.75562 discoidin domain receptor family, member 6.9 

20 103206 X72755 Hs.77367 monokine induced by gamma interferon 8.8 

105743 BE246502 Hs,9598 sema domain, immunoglobulin domain (Ig), 2.6 

105746 AW151952 Hs.46679 hypothetical protein FU20739 1.5 

106373 AW503807 Hs.21907 histone acetyttransferase 1.8 

110240 AI668594 Hs.176588 ESTs, Weakly similar to CP4Y_HUMAN CYTOC 4.2 

25 119260 AK001724 Hs.102950 coat protein gamma-cop 3.2 

120206 H26735 Hs.91668 Homo sapiens clone PP1498 unknown mRNA 45.7 

120253 AA131376 Hs.326401 fibroblast growth factor 12B 38.9 

120297 AA191384 Hs.104072 ESTs. Weakly similar to 21 95.HUMAN ZINC 15.2 

120624 AW407987 Hs.173518 M-phase phosphoprotein homotog 52 

30 120695 AA976503 gb:oq30a04.s1 NCLCGAP GC4 Homo sapiens 46.8 

120807 AA346385 Hs.30002 SH3-containing protein SH3GLB2; K1AA1848 6.8 

121508 AA402515 Hs,B7667 ESTs 28 

122607 AA453518 Hs.98023 ESTs 61.5 

122616 AA453638 Hs.161873 ESTs 107.3 

35 122618 AA453641 gb:zx48e06.s1 Scares JestisJJHT Homo sap 31.1 

122829 AW204530 Hs.99500 ESTs 81.8 

122838 AA460584 Hs.334386 ESTs 75.3 

123753 M609955 Hs.234961 Huntingtin interacting protein E 30.6 

124385 AI267847 gb:aq49a10jc1 Stanley Frontal NB pool 2 57.1 

40 124860 R65763 Hs.101477 EST 23.9 

124930 AI076343 Hs.173939 ESTs, Weakly similar to ALUBJ4UMAN Ml 22.8 

125051 T79956 Hs.100588 EST 135.3 

128781 N71826 Hs.105465 small nuclear ribonucleoprotein polypept 53.9 

129096 AA463189 Hs.288906 WW Domain-Containing Gene 205 

45 129347 BE614192 Hs.279869 melanoma-associated antigen recognised b 7.6 

129689 AW748482 Hs.77873 B7homolog3 2.6 

130503 BE208491 Hs.295112 KIAA0618 gene product 16.1 

130511 L32137 Hs.1584 cartilage ollgomeric matrix protein (pse 6.1 

131046 AA321649 Hs.2248 small inducible cytokine subfamily B (Cy 7.4 

50 131643 AW410601 Hs.30026 HSPC1 82 protein 2.9 

131925 AF151048 Hs.183180 anaphase promoting complex subunit 11 (y 2.7 

132180 NivL004460Hs.418 fibroblast activation protein, alpha 14.7 

132370 AW572805 Hs.46645 ESTs 28.3 

132994 AA112748 Hs.279905 cibneHQ0310PRO0310p1 17.1 

55 133016 AI439B83 Hs.6289 hypothetical protein FU20886 4.4 

133266 AI160873 Hs.69233 zinc finger protein 16.1 

133391 AW103364 Hs.727 inhibin, beta A (activin A, active AB a 25.5 

134169 AJ690916 Hs.1781 37 transducer of ERBB2, 1 1.2 

134219 NM_000402Hs.80206 gtucose-6-phosphatedehydrogenase 1.9 

60 134405 AW067903 Hs.82772 collagen, type XI. alpha 1 723 

134529 AW411479 Hs.848 FK506-binding protein 4 (59kD) 2.8 

134975 R50333 Hs.92186 Leman cotted-coll protein 2.6 

135181 BE250865 Hs.279529 px19-like protein 145 

322556 BE041451 Hs.177507 hypoftetical protein 2.9 
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TABLE 6 A 



Table 6A shows the accession numbers for those pkeys lacking unigenelD's for Table 6. For 
5 each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DbubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 



Pkey CAT number Accessions 
124385 656394 J A1267847 N27351 

120695 9683.3 AA976503 AI917802 AA953664 AA404613 M428771 BE280542 AW194691 AI927301 AI740458 AI796100 AI935603 

AW052210 AA970201 AI633384 AA425910 AI017004 A1241295 AA402616 AA291468 
122618 305217J AA453641 AA454061 



Pkey: Unique Eos probeset Identifier number 

CAT number. Gene duster number 
Accession: Genbank accession numbers 
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TABLE 7: Figure 7 from BRCA 001-1 US 

5 Table 7 shows genes upregulated in tumor tissue compared to normal breast tissue. Open 
reading frames in the sequences have been characterized as having a signal sequence (SS), a 
transmembrane domain (TM) or other. 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Ptey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Unlgene number 

Unigene Title: Unigene gene title 

R1; Ratio of tumor to normal breast tissue 



ORF struct info: Structural characterization of open reading frame for the sequence of the gene 




rKsy cXACcn 


Unigenelu 


UnigeneTitle 


R1 


ORF struct info 




rvo.On/ HQ 


nUoLHIlc (AJUUcllbaLKJll I 


O 3 


TM 


100114 X023Q8 




uiyuuuyiau? synutcioSo 


9Q 


other 


irwHi mo^fli 

lUUlOl UtZ^OO 


Me 11Q<;1 

ns.iiyoi 


ectonudeotide pyrophosphatase/phosphool 


4 a 


other 


100146 BF185499 


Ue 0471 


k"tAA(Y19n nana nmHitrt 


1 Q 


TM 


\\PJ\Hl UlJvOO 


ns.i 00040 


osieoDiasi speanc lacior i \rasaciin 


7 R 


other 


inniid HfiH79n 


ns.o \oq£. 


k'lnnH'fni nana nmWiir* 

iwvau iui gene proouu 


O 9 


other 




H« 194 

119. 1 ATT 


gene preaicieu irom cuiha wilji a complete 


1 A 
1.0 


other 


1 UUZxU MVVU 1 dwO*f 


II5.Z 1 / HsO 


dnnavln AO 


i 


other 


100265 038521 


Hs. 112396 


KIAA0077 protein 


1.5 


other 


100271 BE160081 


Hs.256290 


S100 calcium-binding protein A1 1 (cafcjiz 


13.5 


other 


100275 BE242802 


Hs.154797 


KIAA0090 protein 


5.1 


other 


100323 D50920 


Hs.23106 


KiAA0130gene product 


1.9 


TM 


100335 AW247529 


Hs.6793 


platelet-activating factor acetylhydroia 


2.7 


other 


100354 NM.004341 


Hs.154868 


carbamoyl-phosphate synthetase 2, aspart 


2 


omer 


100372 NMJJ14791 


Hs.184339 


K1AA0175 gene product 


2.6 


other 


100393 084145 


Hs.39913 


novel RGD-contalnlng protein 


3.2 


other 


100400 AW954324 


Hs.75790 


phosphatidyEnosrtol gfycan, class C 


1.5 


other 


100418 D86978 


Hs.84790 


KIAA0225 protein 


2 


other 


100482 M65028 


Hs.81361 


heterogeneous nuclear ribonucleoprotein 


2.9 


omer 


100518 NMJJ04415 


Hs.74316 ■ 


desmopIakIn(DPI,DPil) 


1.9 


other 


100666 L05424 


Hs.169610 


C044 antigen (homing function and Indian 


5.7 


other 


100667 L05424 


Hs.169610 


C044 antigen (homing function and Indian 


9 


? 


100668 L05424 


Hs.169610 


CD44 antigen (horning function and Indian 


7.7 


other 


100678 AW502935 


Hs740 


PTK2 protein tyrosine kinase 2 


53.2 


other 


100783 AF078847 


Hs.191356 


general transcription factor liH, polype 


6 


other 


100892 BE245294 


Hs.180789 


S164 protein 


1.7 


? 


100945 AF002225 


Hs.180686 


ublquitin protein ligase E3A (human papi 


1.5 


other 


100969 AA157634 


Hs.79172 


solute carrier family 25 (mitochondrial 


6.3 


other 


100988 AK000405 


Hs.76480 


ubiquitin4ke 4 


11.4 


? 


100999 H38765 


Hs.80706 


diaphorase (NADH/NADPH) (cytochrome b-5 


1.6 


other 


101031 J05070 


Hs.151738 


matrix metaltoprotelnase 9 (gelatinase B 


8.4 


other 


101045 J05614 




gb:Human proliferating cell nuclear antl 


5 


? 


101077 N99692 


Hs.75227 


Empirically selected from AFFX single pr 


2.6 


other 


101093 LQ6419 


Hs.75093 


procollagen-lysine, 2-oxoglutarate 5-dta 


1.4 


? 


101186 AA020956 


Hs.179881 


core-binding factor, beta subunit 


2 


TM 


101216 AA284166 


Hs.84113 


cyctin-dependent kinase inhibitor 3 (CDK 


1.8 


other 


101228 AA333387 


Hs.82916 


chaperonin containing TCP1 ( subunit 6A ( 


\3 


TM 


101247 AA132666 


Hs.78802 


glycogen synthase kinase 3 beta 


1.9 


other 


101249 L18964 


Hs.1904 


protein kinase C, iota 


1.5 


other 


101332 J04088 


Hs.156346 


topotsomerase (DMA) II alpha (170kD) 


5.3 


other 


101352 AI494299 


Hs.16297 


COX17 (yeasfl homolog, cytochrome c oxid 


4.2 


other 


101396 BE267931 


Hs.78996 


proliferating cell nuclear antigen 


1.9 


TM 


101445 M21259 




gb^uman Alu repeats in the region 5 to 


1.6 


TM 


101470 NM-000546 


Hs.1846 


tumor protein p53 (U-Fraumeni syndrome) 


2.5 


other 


101478 NMJJ02890 


Hs.758 


RAS p21 protein activator (GTPase activa 


5.5 


other 
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01483 M24486 Hs.76768 procofagefvprofine, 2-oxoglutarate 4-di 

01540 J04977 Hs.84981 X-ray repair complementing defective rep 

01573 AW248421 Hs.250758 proieasome (prosome, macropain) 25S subu 

01580 NMJM2151 H*83363 coagulafon factor Vlll-associated (Mr 

101592 AF064853 Hs.91299 guanine nucleotide binding protein (G pr 

101621 BE391804 Hs.62661 guanyfete binding protein 1, Interferon- 

101702 AW504089 Hs.179574 protein phosphatase 2 (formerly 2A), reg 

01734 M74099 Hs.147049 cut(Dr^phitaHike 1 (CCAAT displacem 

01759 M80244 Hs. 184601 solute carrier family 7 (cattonte anuno 

01767 M81057 Hs.180884 carboxypepfeJase B1 (tissue) 

01782 AA306495 Hs.1869 ptephcgluoomutase 1 

01805 AW409747 Hs.75612 stress-irniuced-prtosphoprotein 1 (Hsp70VH 

01806 AA586894 Hs.112408 S100 calcium-binding protein A7 (psonas 
01810 NM_0GQ318 Hs.180612 peroxisomal membrane protein 3 (35kD, Ze 
01879 AA176374 Hs.243886 nuclear autoantigenic spemi protein (his 
01911 AA441787 H&1 19689 glycoprotein hrjrmones. alpha polypeptide 
01920 AF182645 Hs.8024 IK cytokine, down-regulator of HLA II 
101973 U41514 Hs.80120 UDP^cetyl-alpha^alacto3arnine:polyp 
02009 BE245149 Hs.82643 protein tyrosine kinase 9 

102036 BE250127 Hs.82906 COC20 (cell division cycle 20, S. cerevi 

02083 T35901 HsJ5117 interteuxin enhancer bindJng factor 2, 4 

02107 BE258602 Hs.182366 heat shock protBin 75 

02123 NM_001809 Hs.1594 centromere protein A (17kD) 

02165 BE313280 Hs.159627 death associated protein 3 

02198 AW950852 Hs.74598 polymerase (DNA directed), delta 2. regu 

02217 AA829978 Hs.301613 JTV1 gene 

02220 U24389 Hs.65436 lysosomal 

02234 AW163390 Hs.278554 heterochromatin-like protein 1 

02260 AL039104 Hs.159557 karyopherin alpha 2 (RAG cohort 1 , impor 

02302 AA306342 Hs.69171 protein Wnase C-like 2 

02330 BE298063 Hs.77254 chromobox homolog 1 (Drosophila HP1 beta 

02339 BE378432 Hs.95577 cycfirHiependent kinase 4 

02348 U37519 Hs.87539 aldehyde dehydrogenase 3 family, member 

02349 AU077055 Hs.289107 bacutovlral IAP repeat-containing 2 
02369 U39840 Hs.299867 hepatocyte nuclear factor 3, alpha 
02374 U33635 Hs.90572 PTK7 protein tyrosine kinase 7 
02391 AA296874 Hs.77494 deoxyguanosine kinase 

02455 U48705 HsJ5562 discoidin domain receptor family, member 

02465 NMJW1359 Hs.81548 2,4-oTerwyl CoA reductase 1, mltochondri 

02488 U50939 Hs.61828 amyloid beta precursor protein-binding p 

02489 AL080116 Hs.74420 origin recognition complex, subunit 3 (y 
02494 AI188137 Hs.75193 COP9r*rnotog 

02501 AF217197 Hs.74562 siah binding protein 1;FBP interacting 

02522 BE250944 Hs. 183556 solute carrier family 1 {neutral amino a 

02532 AF040253 Hs.70186 suppressordTy(S.cerevlsiae)5homolo 

02564 U59423 Hs.79067 MAD (mothers against decapentaptegic, Dr 

02568 W81489 Hs.223025 RAB31 , member RAS oncogene family 

02580 U60808 Hs.152981 CDP-diacylgiycero! synthase {phosphatida 

02581 AU077228 Hs.77256 enhancer of zeste (Drosophila) homolog 2 

02582 U61232 Hs.32675 tubulin-spedfic chaperone e 

102617 AW161453 Hs.198767 COP9 (constitutive prwtonrorphogentc, Ara 

102618 AL037672 Hs.81071 extracellular matrix protein 1 
02627 AL021918 Hs.158174 zinc finger protein 184 (Kruppel-fike) 
I02663 NrVL002270 Hs.168075 karyopherin (Importin) beta 2 
02676 BE262989 Hs.12045 putative protein 

02687 NM_007019 Hs.93002 ubiqulfin carrier protein E2-C 

102689 U96132 Hs.171280 hydroxyacyWDoenzyme A dehydrogenase, ty 

102696 BE540274 Hs.239 forkhead box M1 

02704 AU077058 Hs.54089 BRCA1 associated RING domain 1 

102705 T97490 Hs.50002 small inducible cytokine subfamily A (Cy 

102750 AB014460* Hs.66196 nfo (E.cofl endonudease lh>fike 1 

m BE252241 Hs.38041 pyrttoxai (pyrtdoxine, vitamin B6) krnas 

02812 U90549 Hs^36774 btgh-mobQity group (rwrihistone cruomoso 

02827 BE244588 Hs,6456 chaperonin containing TCP1, subunit 2 (b 

02844 AV653790 Hs^24275 WW domaincontaining protein 1 

02868 X02419 Hs.77274 plasminogen activator, urokinase 

02925 BE440142 Hs.2943 signal recognition particle 19kD 
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11^935 8 too 1850 


Ks. 80506 


small nuclear ribonudeoprotein poiypept 


2.4 


? 


102968 AU076611 


Hs.154672 


methylene tetrahydrofolate dehydrogenase 


2.7 


other 


102983 BE387202 


ft 1 J A A AAA 

Hs.1 18638 


norwnetastatic cells 1, protein (NM23A) 


3,1 


other 


102985 U95742 


Hs.2707 


G1 to S phase transition 1 


5.2 


? 


4nirt^^ A ft A If Ail J««A 

103023 AW500470 


Hs.1 17950 


multifunctional polypeptide similar to S 


1.6 


other 


103038 AA926960 


Hs.334883 


COC28 protein kinase 1 


2.5 


TM 


103060 NMJD05940 


Hs.155324 


matrix metallopjoteinase 1 1 (MMP1 1; stro 


4.5 


other 


103080 AU077231 


1 1 A A AAA 

Hs.82932 


cydln D1 (PRAD1: parathyroid adenomatos 


3.1 


other 


103089 D31152 


Hs.179729 


collagen, type X, alpha 1 (SchmW metaph 


2.4 


other 


103177 BE244377 


Hs.48876 


famesyWIphosphate famesyitransferase 


3.5 


other 


103178 AA205475 


Hs.275865 


fibosomat protein S1 8 


9.9 


? 


103179 NNL001777 


Hs.82685 


C047 antigen (Rh-related antigen, integr 


1.3 


other 


103181 X69636 


Hs.334731 


Homo sapiens, done IMAGE:3448306, mRNA, 


2 


other 


103185 NrVL.006825 


Hs.74368 


trarismembrane protein (63kD) t endopbsml 


1.6 


other 


103191 AA401039 


Hs.2903 


protein phosphatase 4 (formerly X), cata 


2.5 


other 


a /\o j in i a a a^/> a 

103193 NMJJ04766 


Hs.75724 


coatomer protein complex, subunit beta 2 


2.2 


TM 


103194 NM_004939 


Hs.78580 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 


6.3 


TM 


103206 X72755 


Hs.77367 


monokine induced by gamma Interferon 


8.8 


TM 


103223 BE275607 


Hs.1708 


chaperonin containing TCP1, subunit 3 (g 


3 


other 


103232 X75962 


Hs.1 29780 


tumor necrosis {actor receptor superfami 


1.8 


other 


103238 AI369285 


Hs.75189 


death-associated protein 


5.6 


TM 


103297 NMJ01545 


Hs.9078 


immature colon carcinoma transcript 1 


1.9 


? 


103330 AI803447 


Hs.77496 


small nuclear ribonudeoprotein poiypept 


2.5 


other v 


103349 X89059 




gb:H.sapiens mRNA for unknown protein ex 


1.6 


other v 


103376 AL036166 


Hs.323378 


coated vesicle membrane protein 


1,8 


other 


103391 X94453 


Hs.1 14366 


pyrroiine-5-carboxyiate synthetase (glut 


2.3 


other 


103392 X94563 




gb:H.sapiens dbi/acbp gene exon 1 & 2. 


4 


TM 


103430 BE564090 


Hs.20716 


translocase of inner mitochondrial membr 


1.3 


other 


103491 AF264750 


Hs.288971 


myeloid/tymphoid or mixed-lineage leukem 


5.7 


? 


103505 AL031224 


Hs.33102 


transcription factor AP-2 beta (activati 


5.1 


other 


103547 AJ376722 


Hs.180062 


proteasome (prosome, macropain) subunit, 


9.7 


7 


103588 NM-006218 


Hs.85701 


phosphoinositide-3-kinase, catalytic al 


2 


other 


103613 NMJJ00346 


Hs.2316 


SRY (sex determining region Y)iox 9 (ca 


1.3 


? 


103621 BE379766 


Hs.150675 


polymerase (RNA) II (DNA directed) polyp 


2 


other 


103622 AA609685 


Hs.278672 


membrane component, chromosome 11, surfa 


2.3 


TM 


103727 AI878883 


Hs.296381 


growth factor receptor-bound protein 2 


1.3 


other 


103754 AKJ15709 


Hs.172089 


Homo sapiens mRNA; cDNA DKFZp586i2022 (f 


1.3 


other 


103780 M094752 


Hs.169992 


hypothetical 43.2 Kd protein 


7.6 


? 


103795 H26531 


Hs7367 


Homo sapiens BTB domain protein (BOPL) m 


1.3 


SS.TM 


103797 M080912 




gb:znQ4d03.r1 Stratagene hNT neuron (937 


1.6 


other 


103813 AI042582 


Hs. 181271 


CGI-120 protein 


1.6 


other 


103855 W02363 


Hs.302267 


hypothetical protein FU10330 


1.6 


other 


103886 AK001278 


Hs.105737 


hypothetical protein FU 10416 similar to 


6.6 


TM 


104052 NM_0G2407 


Hs.97644 


mammagiobin 2 


2.9 


other 


104079 AA251242 


Hs.103238 


ESTs 


1.4 


other 


104174 AA478984 


Hs.6451 


PRO0659 protein 


5.6 


TM 


104227 AB002343 


Hs.98938 


protocadherin alpha 9 


1.6 


other 


104275 AI751970 


Hs.101067 


GCN5 (general control of amino-acid synt 


5.4 


other 


104325 BE379766 


Hs.150675 


polymerase (RNA) II (DNA directed) polyp 


6.4 


other 


104370 AA324597 


Hs.21851 


Homo sapiens cDNA HJ1290Q fis, clone NT 


1.6 


other 


104423 RS3113 


Hs.1432 


protein kinase c substrate BUK-H 


52 


other 


104482 AB037762 


Hs.44268 


myelin gene expression factor 2 


1.2 


other 


104667 AI239923 


Hs.30098 


ESTs 


1.4 


other 


104757 AI694413 


Hs.332649 


olfactory receptor, family 2, subfamily 


2.4 


other 


104804 AI858702 


Hs.31803 


ESTs, Weakly similar to N-WASP [Ksapien 


1.4 


other 


104806 AB023175 


Hs.22982 


K1AA0958 protein 


2.4 


other 


104827 AW052006 


Hs.8551 


PRP4/STK/WD splicing factor 


10.9 


other 


104846 A1250789 


Hs.32478 


ESTs 


5.7 


other 


104854 AA041276 


Hs.154729 


3-phosphoinositide dependent protein kin 


123 


? 


104867 AA278898 


Hs.225979 


hypothetical protein similar to small G 


2.1 


other 


104871 T78044 


Hs.28893 


Homo sapiens mRNA; cDNA DKFZp56402364 (f 


1.4 


other 


104896 AW015318 


Hs.23165 


ESTs 


17.7 


other 


104909 AW408164 


Hs.249184 


transcription factor 19 (SC1) 


5.1 


TM 


104916 AW958157 


Hs.155489 


NS1 -associated protein 1 


1.8 


other 


104919 AA026880 


Hs.25252 


prolactin receptor 


1.5 


other 


104930 AF043467 


Hs.32893 


neurexophGin 2 


2.3 


other 


104973 NMJJ15310 


Hs,6763 


K1AA0942 protein 


5.1 


other 
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04974 Y12059 


Hs.278675 


bromodomain-containing 4 


1.5 


other 


04975 AL136877 


Hs.50758 


SMC4 (structural maintenance of chromoso 


24 


other 


04978 AI199268 


Hs.19322 


Homo sapiens, Similar to RIKEN cOMA 2010 


7.3 


other 


04979 AA937934 


Hs.321062 


ESTs 


1.3 


other 


04994 AM99930 


Hs.334885 


mitochondria} GTP binding protein 


3.6 


? 


05009 BE379584 


Hs.34789 


dolichyWlphosphoofigosaccharide-pnDtei 


5.6 


other 


05012 AF098158 


Hs.9329 


chromosome 20 open reading frame 1 


3.4 


other 


105028 A1050715 


Hs.2331 


E2F transcription factor 5, p130-binding 


2.2 


other 


105041 AB037716 


Hs.26204 


WAA1295 protein 


22 


other 


05045 BE242899 


Hs.129951 


speclde-type POZ protein 


3.9 


? 


05079 AA151342 


Hs.12677 


CGM47 protein 


9.5 


TM 


105087 AA147884 


Hs.9812 


Homo sapiens cDNA FU 14388 (is, done HE 


5.7 


other 


105088 H58589 


Hs^5156 


Homo sapiens cDNA FU11027 fis 1 clone PL 


22 


ofter 


105095 Z78407 


Hs^7023 


vesicle transport-related protein 


2.2 


other 


105110 BE387350 


Hs33122 


KIAA1160 protein 


1.6 


other 


105126 AW975433 


Hs.36288 


ESTs 


6.4 


? 


105127 AA045648 


Hs.301957 


nudix (nucleoside diphosphate finked moi 


22 


other 


105141 AA164687 


Hs.177576 


mannosyl (aipha-1 ^-glycoprotein beta- 


2.8 


other 


105158 AW976357 


Hs.234545 


hypothetical protein NUF2R 


2 


other 


105169 BE245294 


H5.180789 


S164 protein 


1.7 


other 


105186 AA191512 


Hs.28005 


Homo sapiens cDNA RJ11309 fis, done PL 


4.9 


SS.TM 


105254 AA071276 


Hs.19469 


WAA0859 protein 


2 


TM 


105281 AA263143 


Hs.24596 


RAD51 -interacting protein 


2.9 


? 


105288 N99673 


Hs.3585 


ESTs, Weakly similar to AF126743 1 DNAJ 


1.9 


TM 


105302 AA700122 


Hs.3355 


sentrin-specific protease 


82 


? 


105331 AW270037 


Hs.179507 


K1AA0779 protein 


1.8 


ss, 


105359 NM.016015 


Hs.8054 


CG1-68 protein 


8.4 


other 


105366 BE264645 


Hs.282093 


hypothetical protein FU21918 


5.1 


other 


105373 AW887701 


Hs.32356 


hypothetical protein FU 20628 


2.6 


other 


105374 BE242803 


Hs^62823 


hypothetical protein FU10326 


22 


TM 


105387 AW592146 


Hs.108636 


membrane protein CH1 


2.3 


SS.TM 


105393 AF167570 


Hs.256583 


Interieukin enhancer binding factor 3, 9 


5.5 


SS, 


105399 BE386877 


Hs.334811 


Npw38-blnding protein NpwBP 


1.6 


other 


105400 AF198620 


Hs.65648 


RNA binding motif protein 8A 


1.6 


other 


105445 AA252395 




gb2s12g10.s1 NCLCGAPJ3CB1 Homo sapiens 


5.1 


? 


105507 BE268348 


Hs.226318 


CCR4-NOT transcription complex, subunit 


1.6 


other 


105529 AA1 13449 


Hs.32471 


hypothetical protein FU 20364 


1.3 


other 


105530 AB023179 


Hs.9059 


K1AA0962 protein 


3.5 


other 


105547 AA262640 


Hs.27445 


unknown 


9.3 


other 


105564 BE616694 


Hs.288042 


hypothetical protein FU14299 


1.4 


other 


105596 AA579535 


Hs,18490 


hypothetical protein FU20452 


10.9 


TM 


105597 AF054284 


Hs.334826 


splicing factor 35, subunit 1, 155fcD 


Z9 


TM 


105608 AI808201 


Hs.287863 


hypothetical protein FU12475 


1.7 


? 


105610 AA280072 


Hs.99872 


fetal Alzheimer antigen 


1.4 


other 


105617 AK000892 


Hs.4069 


glucocorticoid modulatory element bindin 


1.7 


TM 


105620 AW302245 


Hs.181390 


casein kinase 1, gamma 2 


5.6 


other 


105658 AA985190 


Hs.246B75 


hypothetical protein FU2O059 


9.4 


other 


105697 AW499988 


Hs.27801 


zinc finger protein 278 


2 


TM 


105708 R26944 


Hs.180777 


Homo sapiens mRNA; cONA OKFZp564W0264 (f 


1.7 


other 


105743 BE246502 


Hs.9598 


sema domain, immunoglobulin domain (lg), 


2.7 


other 


105746 AW151952 


Hs.46679 


hypothetical protein FU20739 


1.5 


? 


105759 AI123118 


Hs.15159 


chemoldne-tike factor, alternatively spi 


U 


other 


105771 AI267720 


Hs.153221 


synovial sarcoma, translocated to X chro 


1.6 


other 


105820 AA741336 


Hs.152108 


lranscriptonalunltN143 


22 


other 


105826 AA478756 


Hs.194477 


E3ubfquifintigaseSMURF2 




other 


105856 AI262106 


Hs.12653 


ESTs 


24 


other 


105858 AF151066 


Hs.281428 


hypothetical protein 


29 


other 


105875 AK001708 


Hs.32271 


hypothetical protein FU 10846 


1.4 


other 


105930 AF016371 


Hs.9880 


pepUdyt prolyl isomerase H (cydophlDn 


5.3 


other 


106000 AW194426 


Hs.20726 


ESTs 


1.7 


other 


106011 AW081202 


Hs.12284 


Homo sapiens, clone 1MAGE:2989556, mRNA. 


28 


other 


106017 AA477956 


Hs.26266 


ESTs 


1.4 


other 


106073 AL157441 


Hs.17834 


downstream neighbor of SON 


1.4 


other 


106078 AA130158 


Hs.19977 


ESTs, Moderately similar to ALU8_HUMAN A 


16 


? 


106094 AA533491 


HsJ8317 


hypoflietteal protein FU 14681 


6.9 


other 


106140 AB006624 


Hs.14912 


WAA0286 protein 


1.6 


other 


106271 AA251393 


Hs.289052 


Homo sapiens, Similar to RIKEN cDNA 5430 


10.8 


? 
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106288 AB037742 


Hs.24336 


WAA1321 protein 


1.3 


other 


106300 Y10043 


Hs.19114 


high-mobility group (nonhistone chramoso 


3.7 


other 


106333 AL043114 


Hs.22410 


ESTs, Weakly similar to A54849 collagen 


5.5 


SS, 


106350 AKD01404 


Hs.194698 


cycGnB2 


5.8 


other 


106359 AW390282 


Hs.31130 


transmembrane 7 superfamify member 2 


6.4 


other 


106381 AB040916 


Hs.24106 


KIAA1483 protein 


6.6 


other 


106389 AW748420 


Hs.6236 


Homo sapiens cDNA: FU21487 fis, clone C 


12 


TM 


106457 AF1 19256 


Hs.27801 


zinc finger protein 278 


2.7 


other 


106470 D63078 


Hs.186180 


Homo sapiens cDNA: RJ23038 fis, clone L 


2.3 
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8.3 


? 


121729 AI949597 


Hs.98325 


ESTs 


1.8 


TM 


121731 AA421041 


Hs.180744 


ESTs 


4.1 


TM 


121744 AA398784 


Hs.97514 


ESTs 


7.1 


SS, 


121748 BE536911 


Hs.234545 


hypothetical protein NUF2R 


19.5 


other 


121773 AB033022 


Hs.158654 


WAA1196 protein 


8 


other 


121775 AA421773 


Hs.161008 


ESTs 


1.7 


other 


121776 AA292579 


Hs.125133 


hypothetical protein FU22501 


6.7- 


other 


121786 AI810774 


Hs.98376 


ESTs 


10.5 


other 


121832 AW340797 


Hs.98434 


ESTs 


5.9 


other 


121836 AA328348 


Hs.218289 


ESTs 


3.9 


other 


121839 AA425691 


Hs.191606 


ESTs, Highly similar to K1AA1048 protein 


5 


other 


121842 AF027406 


Hs.104865 


serine/threonine kinase 23 


2.7 


? 


121847 AA446628 


Hs.2799 


cartilage linking protein 1 


2.3 


other 


121871 AW972668 


Hs.293044 


ESTs 


2.9 


TM 
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121882 AA426376 


Hs.98459 


ESTs 


5 


. other 


121911 AA427950 




gkzwSOflWLsl Scares JotaLfetiJS_Nb2Hf8_ 


7.3 


TM 


121915 M428179 


Ks.223405 


ESTs, Moderately similar to A46010 X-fln 


2.5 


other 


121935 AA428647 


Hs.98611 


EST 


23 


other 


121983 AA298760 


Hs.180191 


hypoftefical protein FU14904 


3.4 


other 


121985 AI8S2570 


Hs.299214 


Homo sapiens, clone IMAGE:2822295, mRNA, 


11.4 


other 


121995 AA210863 


Hs.3532 


nemo-Oke kinase 


3.8 


? 


121999 AA430211 


Hs.98668 


EST 


6.5 


other 


122009 AW292763 


Hs.160822 


Homo sapiens cONA: BJ20863 fa, clone A 


2.2 


other 


122013 AA431085 


Hs.98706 


ESTs 


6.6 


other 


122036 W92142 


Hs.271963 


ESTs, Weakly similar to ALU5_HUMAN ALU S 


13,1 


other 


122050 A1453G76 


Hs.166109 


ELAV (embryonic lethal, abnormal vision, 


9.1 


other 


122060 AA431738 


Hs.98750 


EST 


13.1 


? 


122114 AW161023 


Hs.104921 


ESTs 


1.5 


other 


122188 AA398838 




gb2t80d01.r1 SoaresJestisJW Homo sap 


3.4 


other 


122204 AA435936 


Hs.98642 


EST 


5.6 


other 


122246 AA329550 


Hs.29417 


HCF-binding transcripton factor Zhangfe 


5.2 


other 


122257 AA436819 


Hs.98899 


ESTs 


5.6 


other 


122302 AA441801 


Hs.104947 


ESTs 


5.8 


other 


122341 AW601969 


Hs.99010 


hypothetical protein FU 22263 similar to 


2 


other 


122356 AA443794 


Hs.98390 


ESTs 


7.4 


SS.TM 


122369 AA443985 


Hs.303222 


ESTs 


12.2 


? 


122371 AA868555 


Hs,178222 


ESTs 


5 


? 


122372 AA446008 


Hs.336677 


EST 


7.8 


? 


122378 AB032948 


Hs.21356 


hypothetical protein DKFZp762K2015 


2.5 


? 


122405 AA446572 


Hs.303223 


EST 


2.8 


TM 


122412 AA446869 


Hs.119316 


ESTs 


7.4 


other 


122415 AA446918 


Hs.99088 


EST 


1.9 


other 


122418 AA446966 


Hs.99090 


ESTs, Moderately similar to similar to K 


6.9 


? 


122440 AW505139 


Hs.9460 


Homo sapiens mRNA; cONA OKFZp547C244 (fir 


2.6 


other 


122446 AA447603 


Hs.99123 


EST 


1.8 


TM 


122448 AA447626 


Hs.99127 


EST 


3.5 


other 


122458 AI266159 


Hs.104980 


ESTs 


1.5 


other 


122460 AW418788 


Hs.99148 


ESTs. WeaWy similar to S43569 R01H10.6 


9.7 


other 


122464 AA448158 


Hs.99152 


EST 


4.9 


other 


122490 AA448349 


Hs.238151 


EST 


Q2 


? 


122492 AA448417 


Hs.104990 


ESTs 


5.5 


other 


122502 AA204969 


Hs.234863 


Homo sapiens cDNA FU12082 fis, clone HE 


1.3 


other 


122510 AA449232 


Hs.99195 


ESTs 


112 


? 


122530 AW959741 


Hs.40368 


adaptor-related protein complex 1 , sigma 


10,1 


other 


122547 AA779725 


Hs.164589 


ESTs 


2.5 


SS. 


122555 AA194055 


Hs.293858 


ESTs 


1.9 


other 


122570 AA452578 


Hs.262907 


ESTs 


9.5 


other 


122572 AA452601 


Hs.99287 


EST 


11 


? 


122586 AK001910 


Hs.99303 


Homo sapiens cONA FU1 1048 fis, clone PL 


3.4 


other 


122587 AB040893 


Hs.6968 


WAA1460 protein 


2 


other 


122598 AI028173 


Hs.99329 


ESTs 


1.7 


? 


122599 AL355841 


Hs.99330 


hypothetical protein FU23588 


4.4 


? 


122602 AA411925 


. Hs.301960 


ESTs 


4.7 


other 


122607 AA453518 


Hs.98023 


ESTs 


615 


other 


122614 AA4 53530 


HS.99339 


COT 

EST 


•10 7 
W.I 


r 


122616 AA453638 


Hs.161873 


ESTs 


107.3 


? 


122617 AI681535 


Hs.148135 


serine/threonine Idnase 33 • 


121.4 


other 


122618 AA453641 




gbzx48e06.s1 SoaresJestisJJHT Homo sap 


31.1 


SS ( 


122622 AA453987 


Ks.144802 


ESTs 


5.6 


other 


122717 AA456859 


Hs.178358 


ESTs 


8.5 


SS, 


122762 AI376875 


Ks.105119 


ESTs 


10.4 


other 


122829 AW204530 


Hs.99500 


ESTs 


81.8 


? 


122834 AA461492 


Hs.99545 


Homo sapiens cDNAFU 10658 fis, done NT 


3.7 


? 


122836 AA460581 


Hs.290996 


ESTs 


4.6 


other 


122837 AA461509 


Hs.293565 


ESTs, Weakly similar to putative p150 [H 


Z7 


TM 


122838 AA460584 


Hs.334386 


ESTs 


75.3 


other 


122854 AA600235 


Hs.9625 


NIMA (never in mitosis gene a)-rdated k 


7.8 


other 


122856 AI929374 


Hs.75367 


Src-fike-adapter 


5.8 


other 


122B61 AA335721 


Hs.119394 


ESTs 


U 


other 


122866 BE539656 


Hs.283705 


ESTs 


4.2 


other 


122868 AF005216 


Hs.1 15541 


Janus kinase 2 (a protein tyrosine kinas 


5.3 


other 
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122870 AW576312 


Hs.318722 


Homo sapiens cDNA: FU21766 fis, done C 


9.9 


? 


122872 AW081334 


Hs.97103 


ESTs 


5.3 


other 


4f1fl1A A A *1 A A A A 1 A 

122879 AA769410 


l l _ a An r* w a 

Hs.128654 


ESTs 


13.9 


other 


122907 AA470074 


1 1 A AAAAA 

Hs.169896 


ESTs 


11.5 


other 


1/2916 AA47Q14Q 


Hs .229 I/O 


EST 


1.7 


TM 


122981 AA478951 


Hs.105629 


ESTs 


5 


other 


123013 AW968324 


Hs. 17384 


ESTs 


15.4 


other 


123016 AW338067 


Hs.323231 


Homo sapiens cDNA FU1 1946 fis, clone HE 


2.8 


other 


123034 AL359571 


Hs.44054 


nbieln (GSK3B interacting protein) 


8.7 


other 


123072 AI382600 


Hs.1 04308 


ESTs. Weakly similar to K1AA1395 protein 


8.8 


other 


A /MAn^ A A JAr*AAA 

123082 AA485350 


Hs.105661 


ESTs 


4 


? 


123088 AI343652 


Hs. 105667 


ESTs 


3.8 


other 


123110 AA486256 


Hs.193510 


EST 


7.4 


other 


a aa a a l nr*AAiAiA 

123114 BE304942 


Hs.265848 


myomegalln 


2.8 


? 


A AA A f\A Yr*AAil*ff 

123131 T52027 


Hs.271795 


ESTs, Weakly slmBar to 138022 hypotheti 


2.4 


other 


123132 AI061582 


1 1 An A A ^A 

Hs.324179 


Homo sapiens cONA FU12371 fis, clone MA 


15.6 


TM 


123136 AW451999 


II AAA A A A 

Hs.194024 


ESTs 


5.2 


other 


123149 AI734179 


Hs. 105676 


ESTs 


23.8 


TM 


123152 AW601773 


Hs.270259 


ESTs 


5.2 


other 


A AAAm A A InAAAA 

123258 AA490929 


1 \ A AfAt A 

Hs.105274 


ESTs, Weakly similar to RMS1JHUMAN REGUL 


9.3 


? 


123315 AA496369 




gfczv37d10.s1 Soares ovary tumor NbHOT H 


4.2 


TM 


123369 AA504757 


Hs. 105738 


ESTs 


7 


other 


J AAOAJ A A*TAA AAA 

123394 AA731404 


Hs.105510 


ESTs 


3.7 


other 


123433 AW450922 


Hs.1 12478 


ESTs 


3.8 


other 


123466 AA599042 


Hs.1 12503 


EST 


7.4 


other 


123470 AW303285 


Hs.303632 


Human DNA sequence from done RP11-110H4 


3.5 


other 


123471 AB021644 


Hs.197219 


zinc finger protein 14 (KOX 6) 


5.2 


? 


123475 BE439553 


Hs.250528 


Homo sapiens, done IMAG£:4098694, mRNA, 


1.7 


other 


123482 N95Q59 


Hs.55098 


ESTs 


1.6 


other 


123486 BE019072 


Hs.334802 


Homo sapiens cDNA RJ 14680 fis, done NT 


2.4 


other 


123508 AW380388 


ll a rpr A A 

Hs.1 55546 


KIAA1080 protein; Golgi-assodated, gamm 


2.2 


TM 


123615 AA609170 




gbaf12a12,s1 SoaresJestisJJHT Homo sap 


7.9 


other 


123619 AA602964 




gb:no97c02s1 NCLCGAPJ>r2 Homo sapiens 


2.8 


other 


123658 AA609364 




gb2u71d09.s1 SoaresJestisJIHT Homo sap 


17 


? 


A A** A A IAAA/«An 

123674 AI269609 


Hs.105187 


kinesin protein 9 gene 


5.7 


? 


123735 NNL013241 


Hs.95231 


FH1/FH2 domain-containing protein 


10 


other 


123738 AA609891 


Hs.1 12777 


EST 


5.2 


other 


123753 AA609955 


Hs.234961 


Huntinglin interacting protein E 


30.6 


TM 


123804 AA620464 


Hs.261915 


EST, Weakly similar to S65657 alpha-1 C-a 


2.1 


other 


123811 AA620586 




gfcae60g05.s1 Stratagene king carcinoma 


2.7 


other 


a AOArd a r\f\A aaaa 

123951 AB012922 


Hs.1 73043 


metastasis-associated Hike 1 


6.3 


? 


123983 AJ272267 


Hs.146178 


choline dehydrogenase 


4.4 


other 


124001 L42542 


Hs.75447 


ra(A binding protein 1 


7.1 


? 


124006 AI147155 


Hs.270016 


ESTs 


8.3 


SS, 


124070 AI950314 


Hs.154762 


HIV-1 rev binding protein 2 


3.8 


other 


124074 H05635 


Hs.294030 


topoisomerase-rBiated function protein 4 


U 


SS, 


124178 BE463721 


Hs.97101 


putative G protein-coupled receptor 


3.2 


? 


124203 AA372796 


Hs.269339 


ESTs, Weakly slmBar to AF161356 1 HSPC0 


5.7 


other 


124352 AA640891 


Hs.1O2406 


ESTs 


3.1 


TM 


124375 D87454 


Hs.192966 


K1AA0265 protein 


3.5 


other 






go.aq49alQ.xl Stanley Frontal Nd pool i 


57.1 


7 


124390 AA317338 


Hs.7535 


COSW-iike protein 


2.8 


other 


124391 AF155099 


Hs.279780 • 


NY-REN-18 antigen 


7.1 


other 


124417 N34059 




gb:yv28h09^1 Soares fetal Tn/er spleen 


3.3 


other 


124428 H13540 


Hs.82202 


ribosomal protein L17 


29 


other 


124440 AA532519 


Hs.129043 


Human DNA sequence from done 989H1 1 on 


7.9 


other 


124466 R10084 


Hs.113319 


kinesin heavy chain member 2 


26 


TM 


124482 N53935 




gb:yv59d09.s1 Soares fetal fiver spleen 


7.9 


TM 


124498 H79433 


Hs.268997 


ESTs 


7.8 


other 


124515 AA669097 


Hs.109370 


ESTs 


3.3 


other 


124608 N71076 


Hs.102800 


ESTs, Weakly similar to neuronal thread 


4.6 


? 


124631 NMJM4053 


Hs.270594 


FLVCR protein 


3.2 


other 


124634 AI765123 


Hs.143671 


Homo sapiens cDNA FU13533 fis, done PL 


5.8 


other 


124637 AA160474 


Hs.75798 


hypothetical protein 


9.3 


other 


124542 AW968856 


Hs.278569 


sorting nexin 17 


3.5 


other 


124649 N92593 


Hs.313054 


ESTs 


6.1 


TM 


124661 R48170 


Hs.78436 


EphB1 


5.6 


other 
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19/1GQ9 


AAooiool 


U„ 4 4 00.70 

HS.119o78 


cotb uImUu i>MitUrtn kawq L11 imam Mrrrv 

cols, weakly santar to ivuKyjiUMAN mi 1 Vh 


7 0 

7.9 


IM 


4J9A740 

u4712 


orvncc 

K0910O 


11. 40-44/40 

Hs,19114o 


cols 


C 7 

5.7 


other 


19479C 
IZ4r30 




nS.2oobo5 


tolS 


11-0 


r 


u4761 


A A47A7CC 

AA374756 


Hs.93560 


UamahhIhiw -DMA I/I A A 4T74 nmLU 

Homo sapiens mKNA tor kiaa 17/1 protein, 


A 

9 


other 


124768 


a mneocoa 

AW36B528 


Hs, 100855 


ESTs 


83 


other 


1^4/75 


R41772 


tl _ 4An070 

nS.IUQovo 


CO IS 


A O 

4.9 


other 


49A777 
144/77 


K41933 


U. 4 AMU 

HS.14U2J7 


CCTr \AfniLfil olmtlor fn Al 1 H Lit IMAM Al 1 1 C 

cols, Weakly similar to ALU! J1UMAN ALU o 


2.0 


other 


124788 


R43543 


HS.1QQ912 


Homo sapiens cuNA: rLuZJ^o its, clone H 


5.1 


other 


124809 


Al OCCTV» 

AL355722 


Hs. 106875 


Homo sapfens tor nom clone 35214, run 


42 


other 


49AQ44 
1^401 1 


Dicftco 


t|. 9O0O49 

nS.2oo912 


nypotneucai protem rUz2oL/4 


AA O 


other 


124812 


R47948 


1] _ 4 00700 

HS.1 88732 


ESTs 


7.9 


other 


124822 


A A 44ft4<V*l 

AA418160 


Hs.86043 


1 1 - ___t^.^— -t\ki a d 14 1 ceo Co aU«» m 

Homo sapiens cONA PU13550 ns, clone PL 


6.6 


other 


124825 


a A r r\4 n/w 

AA501669 


Hs.336693 


ESTs 


2.3 


SS.TM 


124833 


AW975868 


Hs-294100 


ESTs 


2.7 


SS.TM 


124857 


R63652 


Hs.137190 


ESTs 


2.3 


other 


124860 


R65763 


Hs.1 01477 


EST 


23.9 


7 


124863 


AJ 382555 


1 |_ a nmc A 

Hs.1 27950 


bromodornaln-containing 1 


2 


other 


124876 


Ar 135422 


Hs.27059 


GDP-mannose pyrophosphoryfase A 


4.4 


SS, 


124878 


BE397530 


Hs.288057 


hypolheOcal protein FU 22242 


2.7 


other 


124902 


H37941 


II— «i A4I AO O 

Hs.101883 


ESTs 


5.7 


other 


124903 


AW2S6713 


1J_ 004 AAA 

HS221441 


ESTs 


32.4 


other 


124930 


AI076343 


Hs.1 73939 


EST s, Weakly Similar 10 ALUBJiUMAN 


22.8 


other 


124942 


R99978 


Hs.268892 


ESTs, Moderately similar to B34087 hypot 


6.1 


other 


124958 


AI078645 


Hs.431 


murine leukemia viral (bmM) oncogene h 


1.9 


other 


124980 


T40841 


Hs.98681 


ESTs 


4.5 


? 


125002 


T59338 


Hs .269463 


ESTs, Weakly similar to ALUi _human ALU o 


A ft 

4.9 


other 


125047 


T79815 


Hs.279793 


ESTs 


5 


7 


125051 


T79956 


IJ_ 4AACO(1 

Hs.1 00588 


EST 


4.4C 1 

135.3 


7 


125056 


T81310 


|J_ 4 AAC<V> 

Hs.1 00592 


ESTs 


C A 

5.4 


other 


125101 


AI472Q68 


[_|_ <>QC*WC 

Hs.2oo23o 


1/1 A A 4 OCfi nmlnin 

KlAAiooo protem 


D.O 


other 


125113 


T96595 


HS.302270 


rpT„ l*l*.«LL« *J«i!L»lu. Al t IC LJ1 ItJAkl till 

ESTs, WeaWy SlmHarto ALUr JiUMAN III! 


1.8 


other 


125115 


T97341 




gbiye57e05^1 Scares fetal flver spleen 


9.0 


r 


125125 


AI222382 


HS.24Q767 


II . I, nMA jl-.j i- f---.i J-.-L-L PQ1 4*Sr*4A 

Human UNA sequence tram done rp 1-120 14 


4 c 

1.0 


IM 


125147 


Wool 50 




Empirically selected from AFFX single pr 


4 7 
1./ 


f 


125161 


\kiAACCrr 

W44657 


HS.1 44232 


COT 

col 


4ft 7 


9 
f 


4 0CO/IO 


a a Mftoeo 


HS.1 Jl J/0 


CQTc KAnriamiahi clmttar tn Al 1 IR Ml IMAM 1 

co i s, Mooeraiety simiwr 10 MLUo^UiviMr* i 


1.0 


omer 


125255 


Aru9olo2 


11- 4 4 0£H 

HS.1 18631 


timeless (DrosophBa) homotog 


9.4 


otner 


125279 


AW4U1CU9 


M- 4-7-70 

HS.47/9 


1/1 A A 44 CO nmla'm 

kiaahoo protem 




r 


125280 


AM117AC 

Al 12 J/ 05 


HS.IUoSJz 


COlS 


0.1 


9 


125298 


AtA/n7*)CA9 

AW97Z542 


Lin ooonno 


Homo sapiens cuna. rU2 1014 ns, cone n 


A C 
1.0 


other 


49CCCA 


AIAI909474J 

AW2921/1 


HS.2J9/0 


scaffold attachment factor B 


0.9 


other 


125827 


MM ftftlvtftO 

NIVVJUU34U3 


HS.9743b 


YY1 transcription tactor 


4 n 
1.4; 


r 


125891 


i ion coo 
U29569 


Hs.7138 


cholinergic receptor, muscarinic 3 


0.5 


9 
r 


126005 


itifJ ftft7ft4. 

AW4097Q1 


Hs.1578 


bacuiovirai LAP repeat-containing 5 (sur 


AA O 
14^ 


9 


126202 


AA1 57632 


Hs.2 72630 


vacuolar proton pump delta polypeptide 


2.5 


SS, 


126695 


AA643322 


Hs.172028 


a dtslntegrin and metalloprotelnase doma 


9.1 


SS.TM 


127050 


AW411066 


Hs.274351 


CGI-89 protein 


17 


other 


127274 


AW966158 


Hs.58582 


Homo sapiens cDNA FU12789 lis, clone NT 


12,8 


other 


128355 


AW293012 


Hs.161623 


ESTs 


7.4 


SS, 


128493 


D87466 


Hs.240112 


KIAA0276 protein 


3.1 


TM 


4O0C99 




nS.iUU9o 


puiauve nucleolar kina neiicase 


Q A 
9.*t 


other 


128527 


AA504583 


Hs.101047 


transcription factor 3 (E2A tmmunoglobu! 


1.5 


other 


128528 


R39234 


Hs.251699 


ESTs, Weakly similar to iDN4-GG 1 n14 [H,s 


2.8 


other 


128595 


U31875 


Hs.272499 


short-chain alcohol dehydrogenase family 


12.1 


TM 


128599 


NMJJ15366 


Hs.102336 


Rho GTPase activating protein 8 


2.4 


? 


128604 


AI879099 


Hs.102397 


GIOT-3 for gonadotropin inducible transc 


1.3 


other 


128608 


BE267994 


Hs.102419 


zinc finger protein 


72 


other 


128625 


AB037841 


Hs.102652 


hypotheticai protein ASH1 


1.3 


other 


128629 


AL096748 


Hs.102708 


DKFZP434A043 protein 


Z2 


other 


128639 


AW582962 


Hs.102897 


CGI-47 protein 

coatomer protein complex, subunit epsilo 


2 


TM 


128656 


AA458542 


Hs.10326 


1.4 


other 


128658 


BE397354 


Hs.324830 


diptheria toxin resistance protein requi 


15 


other 


128670 


AA975486 


Hs.103441 


Homo sapiens, Similar to RIKEN cDNA 1700 


7.1 


? 


128691 


W27939 


Hs.103834 


hypothetical protein MGC5576 


7.8 


? 


128696 


BE081143 


Hs.225977 


nuclear receptor coacth/ator 3 


3.8 


other 


128700 


Y15221 


Hs.103982 


small inducible cytokine subfamily B (Cy 


1.6 


other 


128714 


T85231 


Hs.179661 


tubulin, beta 5 


7.8 


other 
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128717 AK001564 


Hs.104222 


hypothetical protein FU10702 


5.5 


other 


128733 BE147740 


Hs. 104558 


ESTs, Moderately similar to I38022 hypot 


2.7 


TM 


128737 AF292100 


Hs.104613 


RP42homotog 


2.8 


TM 


128742 AA307211 


Hs.251531 


proteasome (prosome, macropaln) subunrt, 


4.5 


? 


128746 A1470163 


Hs.323342 


actin related protein 20 complex, subun 


2.2 


other 


128747 AB027249 


Hs,104741 


PDZ-Wnding kinase; T-cell originated pr 


2Z 


other 


128772 BE302796 


Hs.105097 


thymidine kinase 1, soluble 


5.4 


other 


128781 N71826 


Hs.105465 


small nuclear ribonucleoproteln potypept 


53.9 


TM 


128797 NJVL002975 


Hs. 105927 


stem cell growth factor; lymphocyte seer 


13.3 


other 


128806 AW630942 


Hs.106061 


RDRNA-blndlrg protein 


Z6 


other 


128814 AW248431 


Hs.256526 


nuclear prelamin A recognition factor 


22 


other 


128830 BE281170 


Hs.106357 


valosdvcontalning protein 


6 


other 


128835 AK001731 


Hs.106390 


Homo sapiens mRNA; cDNA OKFZp586H0924 (f 


1.6 


SS, 


128854 BE159181 


Hs.168232 


hypothetical protein FU13855 


2.3 


other 


128871 AF189723 


Hs.106778 


ATPase, Ca-H- transporting, type 2d memo 


1.5 


? 


128906 R57988 


Hs.10706 


epithelial protein tost in neoplasm beta 


4.8 


other 


128920 AA622037 


Hs.166468 


programmed cell death 5 


1.4 


other 


128925 R67419 


Hs.21851 


Homo sapiens cDNA FU12900 fis. clone NT 


19 


other 


128946 Y13153 


Hs.107318 


kynurenine 3-monooxygenase (kynurenine 3 


7.3 


? 


128949 AA009647 


Hs.8850 


a disintegrin and metalioprotelnase doma 


2.5 . 


other 


128959 AI580127 


Hs.107381 


hypothetical protein FU1 1200 


1.3 


other 


128965 AW150697 


Hs.107418 


ESTs 


1.4 


? 


128970 AI375672 


Hs.165028 


ESTs 


1.3 


other 


128975 BE560779 


Hs.284233 


NICE^S protein 


14 


other 


128979 AW271217 


Hs.281434 


Homo sapiens cONA RJ 14028 fls, done HE 


1.6 


TM 


128995 AI816224 


Hs.107747 


DKFZP566C243 protein 


1.9 


other 


129019 AI950087 




gbwq05c02j(1 NCLCGAPJ<id12 Homo sapien 


2.9 


other 


129021 AL044675 


Hs. 173081 


KIAA0530 protein 


3.8 


other 


129032 R80Q88 


Hs.108104 


ubtquitm-conjugatlng enzyme E2L 3 


3.4 


other 


129076 AW296806 


Hs.326234 


ESTs, Highly similar to T46422 hypotheti 


5 


other 


129078 AI351010 


Hs.102267 


lysosomal 


2.1 


other 


129088 AA744610 


Hs.194431 


paBacQn 


17.1 


other 


129095 L12350 


Hs.108623 


thrombospondln 2 


2.7 


other 


129096 AA463189 


Hs.288906 


WW DomaiivContaining Gene 


20.9 


TM 


129097 BE243933 


Hs.108642 


zinc finger.protetn 22 (KOX 15) 


3 




129099 AF146074 


Hs.108660 


ATP-blnding cassette, sub-family C (CFTR 


5.8 


TM 


129136 W93048 


Hs.250723 


■ hypothetical protein MGC2747 


6 


. other 


129149 AA356620 


Hs.108947 


WAA0050 gene product 


6.4 


TM 


129172 AW162916 


Hs.241576 


hypothetical protein PR02577 


1.8 


TM 


129192 AA286914 


Hs.183299 


ESTs 


2.1 


? 


129194 AA150797 


Hs.109276 


latexin protein 


3.3 


SS.TM 


129198 N57532 


Hs.109315 


KIAA1415 protein 


5.9 


other 


129207 A1934365 


Hs.109439 


ostooglydn (osteoinductive factor, mime 


8.1 


other 


129228 U40714 


Hs.239307 


tyrosyURNA synthetase 


2.9 


other 


I29229 AF013758 


Hs.109643 


polyadenylate binding proteln-lnteractin 


3.3 


? 


129254 AA252468 


Hs.1098 


DKFZp434J1813 protein 


2.6 


SS.TM 


129255 AI961727 


Hs.109804 


H1 histone family, member X 


7.4 


other 


129288 W26392 


Hs.110080 


EST s, Weakly similar to S13495 pregnancy 


9.6 


other 


129296 AI051967 


Hs.110122 


ESTs 


12 


other 


I29323 AA287239 


Hs.5518 


Homo sapiens cDNA FU1131 1 fis, clone PL 


5.2 


other 


I29340 H75334 


Hs.11050 


F-box only protein 9 


4.7 


SS, 


129347 BE614192 


Hs.279869 


melanoma-associated antigen recognised b 


7.7 


TM 


I29362 U30246 


Hs.110736 


solute carrier family 12 (sodium/potass! 


6.7 


TM 


I29366 BE2203Q6 


Hs.184697 


Homo sapiens done 23785 mRNA sequence 


8.6 


SS, 


29370 A1686379 


Hs.110796 


SAR1 protein 


1.4 


TM 


I29372 NMJJ16039 


Hs.110803 


CGl-99 protein 


2 


other 


I29403 AF149785 


Hs.111126 


pituitary tumor-transforming 1 interacfl 


7.5 


other 


129404 AI267700 


Hs.317584 


ESTs 


5.1 


other 


129423 AA204686 


Hs.234149 


hypothetical protein FU20647 


10.2 


other 


129482 AA188185 


Hs.289043 


spindlin 


6.8 


other 


29513 AW843633 


Hs.306163 


hypothetical protein AL110115 


7.1 


SS, 


29515 AF255303 


Hs.112227 


menibrarte-associated nucleic add binding 


2.5 


other 


29527 AA769221 


Hs.270847 


detta-tubulin 


3.2 


other 


29559 W01296 


Hs.11360 


hypothetical protein FU14784 


7.5 


other 


I29560 AA317841 


Hs.7845 


hypothetical protein MGC2752 


6.8 


other 


129570 A1923097 


Hs.11441 


chromosome 1 open reading frame 8 


2.1 


other 


129575 F08282 


Hs.278428 


progestin induced protein 


1.6 


other 
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129587 H14718 


Hs.11506 


Human done 23589 mRNA sequence 


6.8 


other 


129588 BE408300 


Hs.301862 


postmeiotfr segregation increased 2-Bke 


14 


TM 


129591 N57423 


Hs.179898 


HSPC055 protein 


7.4 


other 


129594 AW403724 


Hs.36989 


coagulation factor VO (serum prothrombi 


9 


? 


129598 AF035537 


Hs.1 15521 


REV3 (yeast homobgHike, catalytic sub 


1.6 


other 


129628 U38945 


Hs.1 174 


cydn-dependent kinase inhibitor 2A (me 


22 


other 


129549 AD000092 


Hs.16488 


caireticufin 


3.3 


other 


129675 NM_015556 


Ha.172180 


K1AA0440 protein 


13.4 


other 


129680 U03749 




gfoHuman chromogranin A (CHGA) gene, pro 


14.1 


7 


129689 AW748482 


Hs.77873 


B7homoiog3 


2.6 


other 


129702 AB04966 


Hs.12035 


ESTs, Weakly simBar to 138022 hypofoeti 


7.5 


TM 


129720 AA156214 


Hs.12152 


APMCF1 protein 


2 


other 


129721 NM.001415 


Hs*211539 


eukaryottc translation initiation factor 


1.7 


TM 


129778 AKD01676 


Hs.12457 


hypothetical protein FU10814 


1.8 


other 


129779 AA394090 


Ha.12460 


Homo sapiens clone 23870 mRNA sequence 


55 


TM 


129800 AF052112 


Hs.12540 


lysosomal 


1.7 


? 


129806 AB023148 


Hs.173373 


KIAA0931 protein 


\2 


other 


129815 BE565817 


Hs.26498 


hypothetical protein FU21657 


3.1 


other 


129840 NM-006590 


Hs.12820 


SnRNP assembly defective 1 homolog 


1.8 


other 


129861 AL049999 


Hs.85963 


DKFZP564M182 protein 


2.3 


other 


129864 AI393237 


Hs.129914 


runt-related transcription factor 1 (acu 


1.7 


SS, 


129869 A1222069 


Hs.13015 


hypothetical protein simitar to mouse Dn 


2.8 


TM 


129945 BE514376 


Hs.165998 


PAI-1 mRNA-b'mding protein 


1.8 


other 


129953 AA412195 


Hs.13740 


ESTs 


2.5 


other 


129972 AW753185 


Hs.180628 


dynamln 1-flke 


1.8 


? 


129983 U09848 


Hs.132390 


zinc finger protein 36 (KOX 1 8) 


1.3 


other 


130010 AA301116 


Hs.142838 


nucleolar phosphopratein Nopp34 


1.6 


other 


130081 AA287325 


Hs.14713 


ESTs 


4.1 


other 


130082 S73265 


Hs.1473 


gastrirweleastog peptide 


1.9 


other 


130097 AL046962 


Hs.14845 


forkhead box 03A 


2.8 


other 


130100 AL135561 


Hs.14891 


hypothetical protein FU21047 


2.3 


other 


130111 X53002 


Hs.149846 


integrin, beta 5 


2.3 


other 


130112 AA9 16785 


Hs.180610 


splicing factor proHne^lutamine rich ( 


3 


other 


130128 L76937 


Hs.150477 


Werner syndrome 


13 


other 


130135 AA311426 


Hs.21635 


tubulin, gamma 1 


6.1 


other 


130211 NMJW3358 


Hs.23703 


ESTs, Moderately similar to CEGT_HUMAN C 


1.6 


other 


130212 D80001 


Hs.152629 


KIAA01 79 protein 


U 


other 


130236 R85367 


Hs.51957 


splicing factor, ajgWne/serine-nch 2, 


2 


other 


130241 AL035568 


Hs.153203 


MyoO family inhibitor 


3.2 


other 


130242 X79201 


Hs.153221 


synovial sarcoma, translocated toXchro 


5.4 


? 


130249 D81983 


Hs.322852 


GAS2-related on chromosome 22 


4.9 


other 


130263 NM_002497 


Hs. 153704 


MIMA (never in mitosis gene a)-related k 


1.4 


other 


130287 AA479005 


Hs.1 54036 


tumor suppressing subtransferable candid 


2.6 


other 


130310 AB011121 


Hs.154248 


. amyotrophic lateral sclerosis 2 (juvenO 


6.3 


other 


130353 Z19084 


Hs.172210 


MUF1 protein 


6.2 


other 


130356 AH27577 


Hs.155017 


nuclear receptor interacting protein 1 


2.4 


other 


130357 AJ224442 


Hs. 155020 


putative methyltransferase 


3.5 


TM 


130359 NMJJ13449 


Hs.277401 


bramodomain adjacent to zinc finger doma 


8.5 


other 


130367 AL135301 


Hs.8768 


hypofoetfcal protein FU 10849 


1.4 


other 


130372 AI077464 


Hs.5011 


RNA binding moBf protein 9 


3.3 


? 


130393 N89W 


ns. 195291 


KiAAOUOo gene proaua 


i ft 

1.0 


other 


130399 AW374106 


Hs.1 55356 


hypothetical protein MGC2840 similar to 


3.4 


other 


130407 BH385099 


Hs.334727 


hypotheflcal protein MGC3017 


2.3 


other 


130409 NMJ)01197 


Hs.155419 


B&2-lnteracting kiQer (apoptosis-induc 


2.7 


TM 


130419 AF037448 


Hs.155489 


NS1-assodated protein 1 


1.8 


other 


130441 U 83 630 


Hs.155637 


protein kinase, DMA-activated, catalytic 


2.3 


other 


130448 BE513202 


Hs.15589 


PPAR binding protein 


4 


TM 


130455 D90041 


Hs.155956 


N-aoetytoansferase 1 (arytamine N-acety 


33.6 


? 


130485 BE245851 


Hs.180779 


H2B histone family, member B 


5 


other 


130487 U49844 


Hs.77613 


ataxia telangiectasia and Rad3 related 


4.4 


other 


130498 138951 


Hs.180446 


karyopherin (importin) beta 1 


1.6 


SS.TM 


130503 BE208491 


Hs.295112 


KlAA0618gene product 


16.1 


other 


130511 132137 


Hs.1584 


cartilage oBgomenc matrix protein (pse 


6.1 


other 


130526 AW876523 


Hs.15929 


hypothetical protein FU12910 


2.1 


other 


130544 AA321238 


Hs.4310 


eukaryotte translation initiation factor 


1.5 


other 


130553 AF062649 


Hs.252587 


pituitary turnoMransJorming 1 


14.4 


? 


130556 AI907018 


Hs.15977 


Empirically selected from AFFX single pr 


4.8 


other 
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130567 


AA383092 


Hs.1608 


replication protein A3 (14kD) 


8 


other 


130568 


AA232119 


Hs.16085 


putative G-protein coupled receptor 


3.4 


other 


130574 


AF083208 


Hs.16178 


apoptosis antagonizing transcription fac 


1.2 


other 


130598 


AL042210 


Hs.16493 


hypothetical protein OKFZp762N2316; KIAA 


1.4 


other 


130601 


AA609738 


Hs.16525 


ESTa 


1.5 


TM 


130614 


AI354355 


Hs.16697 


dowiwegulator of transcription 1 , TBP-b 


1.3 


other 


130617 


M90516 


Hs.1674 


glutamine-fructose-6-phosphate transamin 


1Z1 


TM 


130618 


AA383439 


Hs.16758 


Spir-1 protein 


15.9 


other 


130667 


BE246961 


Hs.17639 


Homo sapiens ubiquitln protein ligase (U 


13.9 


other 


130674 


AL048842 


Hs.194019 


attracts 


1.5 


other 


130675 


AA442233 


Hs.17731 


hypothetical protein RJ 12892 


5.4 


other 


130692 


AA652501 


Hs.13561 


hypoftetical protein MGC4692 


5 


other 


130693 


R68537 


Hs.17962 


ESTs 


2 


other 


130712 


AJ271881 


Hjl279762 


bromodomain-containtig 7 


1.8 


TM 


130714 


AI348274 


Hs.18212 


DNA segment on chromosome X (unique) 987 


2 


TM 


130730 


AB007920 


Hs.18586 


K1AA0451 gene product 


3.8 


? 


130744 


H59696 


Hs.18747 


POP7 (processing of precursor, S. cerevi 


3.2 


? 


130751 


AF052105 


Hs.18879 


chromosome 12 open reading frame 


1.4 


other 


130757 


AL036067 


Hs.18925 


protein x 0001 


5.7 


other 


130768 


AF258627 


Hs.211562 


ATP-binding cassette, sub-family A (ABC1 


5.2 


? 


130789 


AKQ00355 


Hs.8899 


sirtuln (silent mating type information 


1.6 


other 


130836 


J05068 


Hs.2012 


transcobalamln I (vitamin B12 binding pr 


15.7 


ss, 


130841 


AL157468 


Hs.325825 


Homo sapiens cONA FU20848 fis, clone AO 


2.8 


other 


1 30843 


AA447492 


Hs.20183 


ESTs, Weakly similar to AF164793 1 prate 


1.5 


other 


130844 


U76248 


Hs.20191 


seven in absentia (Orosophita) homotog 2 


3.5 


other 


130855 


AJ243706 


Hs.143323 


putative DNA/chromatin binding motif 


1.7 


other 


130861 


NM.016578 


Hs.20509 


HBVpX associated proteM 


1.9 


other 


130879 


NMJJ03416 


Hs.2076 


zinc finger protein 7 (KOX 4, clone HF.1 


1.4 


other 


130880 


BE514434 


Hs.20830 


kinesirvlike 2 


2.1 


TM 


130892 


AL120837 


Hs.20993 


high-glucose-regulated protein 8 


2.5 


other 


130896 


AB033078 


Hs.186613 


sphingosine-1-phosphate lyase 1 


1.7 


other 


130911 


BE409769 


Hs.21189 


DnaJ (Hsp40) homolog, subfamily A, membe 


1.8 


other 


130919 


N79110 


Hs.21276 


collagen, type IV, alpha 3 (Goodpasture 


2.3 


TM 


I30944 


BE382657 


Hs.21486 


signal transducer and activator of trans 


5.4 


other 


130971 


N39842 


Hs.301444 


K1AA1673 


2.2 


SS. 


130993 


T97401 


Hs.21929 


ESTs 


1.6 


other 


31005 


AV658308 


Hs.2210 


thyroid hormone receptor infractor 3 


1.6 


? 


31028 


AI879165 


Hs.2227 


CCAAT/enhancer binding protein (C/EBP), 


U 


other 


31042 


AI826288 


Hs.171637 


hypothetical protein MGC2628 


1.6 


other 


131046 


AA321649 


Hs.2248 


small inducible cytokine subfamily B (Cy 


7.4 


? 


131060 


AA194422 


Hs.22564 


myosin VI 


5.1 


other 


31070 


N53344 


Hs.22607 


ESTs 


7.1 


other 


31076 


M749230 


Hs.26433 


doHchyl-phosphate (UDP-N-acetyiglucosam 


2,1 


TM 


31099 


AL133353 


Hs.226581 


COX15 (yeast) homolog, cytochrome c oxid 


7.1 


other 


31174 


NMJJ06540 


Hs.29131 


nuclear receptor coactivator 2 


1.9 


? 


31185 


BE280074 


Hs.23960 


cyciln B1 


5.8 


? 


31206 


AW138839 


Hs.24210 


ESTs 


2 


other 


31213 


AA885699 


Hs.24332 


CGI-26 protein 


7.1 


TM 


31225 


H62087 


Hs.31659 


thyroid hormone receptor-associated pro! 


7.6 


? 


31231 


N47468 


Hs.59757 


zinc finger protein 281 


2.9 


other 


31233 


089053 


Hs.266012 


fatty-add-Coenzyme A ligase, long-chain 


3.5 


other 


31243 


AW383256 


Hs.24752 


spectrin SH3 domain binding protein 1 


2.8 


? 


31245 


AL080080 


Hs.24766 


thioredoxin domain-containing 


2.8 


SS.TM 


31247 


AL043100 


Hs.326190 


fatty acid amide hydrolase 


5.6 


other 


31281 


AA251716 


Hs.25227 


ESTs 


5.8 


other 


31283 


X80038 


Hs.339713 


Homo sapiens clone F1 9374 APO E-C2 gene 


1.3 


other 


31305 


AV656017 


Hs.184325 


CGl-76 protein 


5 


? 


31320 


AA505691 


Hs.145696 


splicing factor (CC1 .3) 


1.8 


TM 


31339 


AF058696 


H^25812 


Nijmegen breakage syndrome 1 (nibrin) 


2.6 


other 


31375 


AW293165 


Hs.143134 


ESTs 


5.4 


other 


31390 


BE269388 


Hs.182698 


mitochondrial ribosomal protein 120 


5,3 


other 


31410 


BE259110 


Hs.279836 


HSPC166 protein 


2.2 


other 


31412 


NM.012247 


Hs.124027 


SELENOPHOSPHATE SYNTHETASE ; Human selen 


2 


31429 


AL046302 


Hs,26750 


hypothefical protein FU21908 


1.4 


other 


31458 


BE297567 


Hs.27047 


hypothetical protein PLJ20392 


1.7 


other 


31475 


AA992841 


Hs.27263 


KIAA1458 protein 


2 


other 


31501 


AV661958 


Hs.8207 


GK001 protein 


2.6 


other 
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131511 AA732153 


Hs.27865 


Homo sapiens cONA: FU21333 fis, clone C 


2 


other 


131528 AU076408 


Hs.28309 


UDP-^lucose dehydrogenase 


1.6 


TM 


131532 BE268278 


Hs.28393 


hypothetical protein MGC2592 


7.4 


other 


131543 AW966881 


Hs.41639 


programmed cell death 2 


22 


other 


131544 AL355715 


Hs.28555 


programmed cell death 9 (PDC09) 


2.1 


other 


131562 NMJ03512 


Hs.28777 


H2A histone family, member L 


1.7 


other 


131564 T93500 


Hs.28792 


Homo sapiens cDNAFU 11041 fis, done PL 


5.2 


other 


131569 AL389951 


Hs.271623 


nudeoponn 50kD 


5 


other 


131618 BE393822 


Hs.29645 


Homo sapiens mRNA; cONA DKFZp7G 1 C029 (fr 


1.8 


other 


131622 R78195 


Hs.29692 


Homo sapiens cONA RJ 11436 fis, clone HE 


1.3 


other 


131623 AB037791 


Hs.29716 


hypothetical protein FU 10980 


2.2 


TM 


131643 AW410601 


Hs.30026 


HSPC182 protein 


3 


other 


131653 AW960597 


Hs.30164 * 


ESTs 


1.3 


other 


131656 AI218918 


Hs.30209 


WAA0854 protein 


2.8 


other 


131669 X52486 


Hs.3041 


uracU-ONA glycosylase 2 


2.8 


other 


131692 BE559681 


Hs.30736 


KIAA0124 protein 


5.6 


? 


131714 AA642831 


Hs.31016 


putative DNA binding protein 


2.9 


? 


131722 D13757 


Hs.311 


phosphorbosyl pyrophosphate amidotransf 


3.4 


other 


131737 AK001641 


Hs.31323 


inhibitor of kappa light polypeptide gen 


3,9 


? 


131763 A1878932 


Hs.317 


topoisomerase (DNA) l 


3.4 


other 


131772 AA382590 


Hs.170980 


KIAA0948 protein 


255 


other 


131787 D87077 


Hs.196275 


K1AA0240 protein 


2.4 


SS, 


131793 AW966127 


Hs.32246 


Homo sapiens cDNA FU 14656 fis, done NT 


8 


TM 


131795 BE501849 


Hs.32317 


hfgh-nwbSity group 20B 


15 


other 


131798 X86098 


Hs.301449 


adenovirus 5 E1A binding protein 


42 


other 


131817 U20536 


Hs.3280 


caspase 6, apoptosls-related cysteine pr 


4.3 


other 


131824 U28838 


Hs.32935 


TATA box binding protein (TBP^associate 


3.5 


other 


131850 AI251317 


Hs.33104 


ESTs 


5.2 


TM 


131878 AA083764 


Hs.6101 


hypoffieticai protein M6C3178 


5.9 


other 


131885 BE502341 


Hs.3402 


ESTs 


13.7 


other 


131900 AA099014 


Hs.231029 


Homo sapiens, done MGC:1 5961 , mRNA, com 


8.7 


other 


131904 AF078866 


Hs.284296 


Homo sapiens cDNA: FU22993 fis, done K 


5.5 


other 


131905 AA179298 


Hs.3439 


stomaWke 2 


113 


other 


131913 AW207440 


Hs.185973 


degenerative spermatocyte (homolog Oroso 


1.7 


SS, 


131916 AA025976 


Hs.34569 


ESTs 


5.2 


TM 


131925 AF151048 


Hs.183180 
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5.6 


TM 


134834 


AW451370 


Hs.8991 


adaptor-related protein complex 1, gamma 


5.3 


other 


134850 


AI701162 


Hs.90207 


hypoiheticat protein MGC11138 


9.1 


other 


134853 


BE268326 


Hs.90280 


5-aminoimldazote4-carboxamide ribonucle 


Z4 


other 


134880 


AI879195 


Hs.90606 


15 kDa selenoprotein 


2.7 


other 


134925 


AW885909 


Hs.6975 


PRO1073 protein 


15 


other 


134955 


AW401361 


Hs.91773 


protein phosphatase 2 (formerly 2A), cat 


4.9 


other 


134971 


AI097346 


Hs.286049 


phosphoserine aminotransferase ■ 


2 


other 


134975 


R50333 


Hs.92186 


Leman coiled-coB protein 


2.6 


TM 


135011 


AB037835 


Hs.92991 


KIAA1414 protein 


1.4 


? 


135022 


NM_000408 


Hs.93201 


glyceroW-phosphate dehydrogenase 2 (ml 


1.6 


? 


135032 


AW301984 


Hs.173685 


hypothetical protein FU 1261 9 


1.4 


other 


135077 


AW503733 


Hs.9414 


KIAA1488 protein 


1.8 


other 


135083 


AB036063 


Hs.94262 


p53-lnducible ribonucleotide reductase s 


2.5 


other 


135095 


AF027219 


Hs.9443 


zinc finger protein 202 


1.5 


TM 


135096 


AA081258 


Hs.132390 


zinc finger protein 36 (KOX 18) 


2.1 


other 


135153 


AI093155 


Hs.95420 


JM27 protein 


4.4 


? 


135181 


BE250865 


Hs.279529 


px1 9-fike protein 


14.9 


? 


135199 


AA477514 


Hs.96247 


translln-associated factor X 


U 


other 
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135207 N26427 


Hs.9634 


ESTs, Highly similar to C10JWMAN PUTATl 


1.7 


other 


135214 T788Q2 


Hs.96560 


hypothetical protein RJ11656 


6.2 


other 


135243 BE463721 


Hs.97101 


putative G protein-coupled receptor 


2.8 


TM 


135245 AI028767 


Hs.262603 


ESTs 


12.2 


TM 


135257 AW291023 


Hs.97255 


ESTs, Weakly similar to A46010 X-finked 


7.7 


TM 


135263 A1088775 


Hs.55498 


geranytgeranyl diphosphate synthase 1 


1.8 


other 


135274 AA448460 


Hs.112017 


GE36 gene 


4.2 


SS, 


135294 AA150320 


Hs.9800 


protein kinase Njmu-R1 


\2 


other 


135295 A1090838 


Hs.98006 


ESTs 


4.9 


other 


135307 AI743770 


Hs.98368 


ESTs, Weakly similar to K1AA0822 protein 


5.9 


? 


135321 AI652069 


Hs.98614 


rtbosome binding protein 1 (dog 180kD ho 


12.3 


TM 


135354 AA456454 


Hs.183418 


ceil 'division cycle 2-ffice 1 (PITSLRE pr 


5.8 


? 


135361 AA373452 


Hs.167700 


Homo sapiens cOMA FU10174 fis, done HE 


8.1 


other 


135389 U05237 


Hs.99872 


fetal Alzheimer antigen 


1.9 


other 


135400 X78592 


Hs.99915 


androgen receptor (dhydrotestosterone r 


13.9 


TM 


302256 M857131 


Hs.171595 


HIV TAT specific factor! 


1.6 


other 


302276 AW057736 


Hs.323910 


HER2 receptor tyrosine kinase (c-en>b2, 


5.3 


other 


303135 AW592789 


Hs.279474 


HSPC070 protein 


2.2 


TM 


303686 AK000714 


Hs.109441 


MSTP033 protein 


1.4 


SS, 


310085 R43191 


Hs.101248 


Homo sapiens done IMAGE32553, mRNA seq 


5.2 


other 


315516 AA808229 


Hs.167771 


ESTs 


2.3 


? 


317781 NMJ07057 


Hs.42650 


ZW10 interactor 


2.9 


? 


320836 AI268997 


Hs.197289 


rab3 GTPase-acth/ating protein, non-cata 


2 


other 


321114 AA902256 


Hs.78979 


Golgl apparatus protein 1 


5.6 


SS. 


322221 N24236 


Hs.179662 


nucleosome assembly protein 1-fike 1 


1.4 


? 


322474 AF118083 


Hs.29494 


PR01912 protein 


1-3 


other 


322556 BE041451 


Hs.177507 


hypothetical protein 


2.9 


SS. 


323541 AF292100 


Hs.104613 


RP42 homotog 


1.6 


other 


407827 BE278431 


Hs.40323 


BUB3 (budding uninhibited by benzimidazo 


1.8 


other 


408196 AL034548 


Hs.43627 


SRY (sex determining region Y)-box 22 


1.6 


other 


408813 AI580090 


Hs.48295 


RNA heiicase family 1 


6.2 


other 


409176 R73727 


Hs.101617 


ESTs, Weakly similar to T32527 hypotheU 


5.7 


other 


409487 H19886 




gfcyn57a05.r1 Soares adult brain N2b5HB5 


2.7 


other 


413670 AB000115 


Hs.75470 


hypothetical protein, expressed in osteo 


2.6 


? 


414108 A1267592 


Hs.75761 


SFRS protein kinase 1 


2.4 


TM 


414846 AW304454 


Hs.77495 


UBX dornaln-contalrilng 1 


Z4 


other 


416040 AW819158 


Hs.289044 


Homo sapiens cDNA FU12048 fis, done 


2.3 


other 


416980 AA381133 


Hs.80684 


higrwnobiKy group (nonhistone chromoso 


4.2 


TM 


417378 R57256 


Hs.82037 


TATA box binding protein (TBP}-associate 


23.6 


other 


418283 S79895 


Hs.83942 


calhepsin K (pycnodysostosis) 


5.8 


other 


418467 NM-006910 


Hs.85273 


retinoblastoma-blndtng protein 6 


1.3 


other 


420269 U72937 


Hs.96264 


alpha thalassemia/mentaJ retardation syn 


1.6 


? 


420802 U22376 


Hs.1334 


v-myb avian myeloblastosis viral oncogen 


2.3 


? 


421225 AA463798 


Hs.102696 


MCT-1 protein 


1.6 


? 


421642 AF172066 


Hs.106346 


retinoic acid repressible protein 


3.5 


other 


421828 AW891965 


Hs.279789 


histone deacetyiase 3 


5 


other 


421983 AI252640 


Hs.1 10364 


peptidyiprolyt isomerase C (cydophQin 


3.1 


TM 


422052 AA302744 


Hs.104518 


ESTs 


1.9 


TM 


422055 NMJM4320 


Hs.1 11029 


putative terru^btnding protein 


2.4 


other 


423750 AF165883 


Hs.298229 


prefoldin 2 


4.2 


? 


424001 W67883 


Hs.137476 


paternally expressed 10 (PEG10; K1AA105 


7.1 


? 


425182 AF041259 


Hs.155040 


zinc finger protein 217 


2.3 


other 


425284 AF155568 


Hs.155489 


NS1 -associated protein 1 


3.5 


other 


426372 BE304680 


Hs.169531 


DEAD/H (Asp-<3kj-Ala-Asp/Hls) box poiypep 


1.9 


? 


428049 AW183765 


Hs.182238 


GW128 protein 


7.6 


? 


428477 AW500533 


Hs.1 1482 


splicing factor, ajgininefeerine-rich 11 


1.7 


other 


437562 AB001636 


Hs.5683 


DEAD/H (Asp-Gb-Ala-Asp/His) box poiypep 


2.4 


other 


438449 AK001333 


Hs,6216 


Homo sapiens hepatocellular carctnoma-as 


3.8 


other 


441560 F13386 


Hs.7888 


Homo sapiens clone 23736 mRNA sequence 


5.6 


other 


445580 AF167572 


Hs.12912 


skbl (S. pombe) homotog 


2 


TM 


446999 AA151520 


Hs.334822 


hypothetical protein MGC4485 


7.6 


other 


447111 AI017574 


Hs.17409 


cystelne-rich protein 1 (intestinal) 


2.2 


other 


447778 BE620592 


Hs.71190 


ESTs, Weakly similar to S 16506 hypotheii 


2.9 


other 


448873 NM_003677 


Hs.22393 


density-regulated protein 


1.8 


other 


449667 W68520 


Hs.331328 


intermediate filament protein syncoflin 


5.9 


other 


450701 H39960 


Hs.288467 


Homo sapiens cONA FU 12280 fis, done MA 


5.7 


other 


450703 AA011202 


Hs.184771 


nudear factor 1/C (CCAAT-binding transc 


1.4 


other 
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452461 N78223 Hs.108106 transcription factor 

452511 BE408178 Hs.285165 Homo sapiens cDNA FLJ20845 lis, clone AD 

453157 AF077036 Hs.31989 DKFZP586G1722 protein 

453658 BE541906 Hs.87819 Homo sapiens, clone MGC:2492, mRNA, comp 

5 100685 AA328229 Hs.184582 ribosomal protein L24 

100690 AA383256 Hs.1657 estrogen receptor 1 

100833 AF135168 Hs.108802 N^ylrrialeimide-sensitive factor 

100850 AA836472 Hs.297939 cathepstnB 

101161 NMJM6262 Hs.37044 peripherin 

10 102481 U50360 gb:Human calcium, calmoduGn-deperKient p 

102831 AA262170 Hs.80917 adaptor-related protein complex 3, sigma 

103549 BE270465 Hs.78793 protein kinase C, zeta 

103749 AL135301 Hs.8768 hypothetical protein FU10849 

104331 AB040450 Hs.279862 cdk inhibitor p21 binding protein 

15 104532 AI498763 Hs.203013 hypothetical protein RJ 12748 

104563 AU 17403 Hs.306189 DKFZP434F1735 protein 

105032 AA127818 gb:z!12a02.s1 Soaresj>regrranLUterus_NbH 

105039 AA907305 Hs.36475 ESTs 

106531 AA454036 Hs.8832 ESTs 

20 106977 AL043152 Hs.50421 K1AA0203 gene product 

107298 N95657 Hs.6820 ESTs, Moderately similar to YOJ1_CAEEL H 

108717 AA122393 Hs.70811 hypofoefical protein FU20516 

110018 AW579842 Hs.104557 hypothetical protein FU10697 

110330 AI288666 Hs.16621 DKFZP434I1 16 protein 

25 111391 NMJJ03896 Hs.225939 slajyttransferase 9 (CMP-NeuActactosylc 

111392 W46342 Hs.325081 Homo sapiens, clone IMAGE:3659680, mRNA, 

113554 AW503990 Hs.142442 HP1-BP74 

113722 AV653556 Hs.184411 albumin 

115008 AKD01827 Hs.87889 helkase-mol 

30 115062 AA253314 Hs.154103 LIM protein (similar to rat protein kina 

115121 AI634549 Hs.88155 ESTs 

117881 AF161470 Hs.260622 butyrate-induced transcript 1 

119075 M10905 Hs.287820 Cbronectln 1 

119615 AL034423 Hs.75875 ubiquffirnconjugatlng enzyme E2 variant 

35 120253 AA131376 Hs.326401 fibroblast growth factor 12B 

125006 BE065136 Hs.145696 splicing tactor(CC1. 3) 

127609 X80031 Hs.530 collagen, type IV, alpha 3 (Goodpasture 

128868 AA419008 Hs.106730 chromosome 22 open reading frame 3 

128891 F34856 Hs.292457 Homo sapiens, clone MGC:16362, mRNA, com 

40 128959 AI580127 Hs.107381 hypothetical protein FU1 1200 

129209 R62676 Hs.17820 Rho-associated, coiled-coO containing p 

129449 AI096988 Hs.111554 ADP-ribosyiation factor-like 7 

129453 AW974265 Hs.111632 Lsm3 protein 

129629 AK000398 Hs.11747 hypothetical protein FU20391 

45 129917 M30773 Hs.278540 protein phosphatase 3 (formerly 2B), reg 

129922 AF042379 Hs.13386 gamma-tubutin complex protein 2 

129989 AB015856 Hs.247433 activating transcription factor 6 

130182 BE267033 Hs.192853 ubiquitln-conjugating enzyme E2G 2 (homo 

130365 W56119 Hs.155103 eukaryotic translation ciitiation factor 

50 130471 AL121438 Hs.183706 adducin 1 (alpha) 

130542 U64675 Hs.179825 RAN binding protein 2-like 1 

130586 AB007891 Hs.16349 KIAA0431 protein 

130768 AF258627 Hs.211562 ATP-binding cassette, sub-family A (ABC1 

130992 BE398091 HsJ4316 desmopiakln (DPI, DPII) 

55 131047 H23230 Hs.22481 ESTs, Moderately similar to A46010 X-tin 

131135 NMJJ16569 Hs.267182 TBX3-iso protein 

131339 AF058696 Hs.25812 N|megen breakage syndrome 1 (nibrin) 

131760 X76732 Hs.3164 nucleobindin 2 

131774 BE267158 Hs.169474 DKFZP586J01 19 protein 

60 131853 AJ681917 Hs.3321 ESTs, Highly similar to IRXLHUMAN JROGU 

131881 AW361018 Hs,3383 upstream regulatory element binding prot 

131887 W17064 Hs.332848 SWi/SNF related, matrix associated, act! 

132031 AF193844 Hs.3758 COP9 complex subunit 7a 

132192 AA206153 Hs.4209 mitochondrtal ribosomal protein L37 

65 132203 NA/L004782 Hs.194714 synaptosoma^associated protein, 29kD 

132240 AB018324 Hs.42676 K1AA0781 protein 

132348 AW067708 Hs.170311 heterogeneous nuclear ribonucteoprotein 
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other 
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SSJM 


4.8 


other 


U 


TM 


1.6 


other 
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other 


1.7 
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16.9 


other 


3.2 


other 
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? 
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other 


1.8 


other 


2 
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2.1 


other 


1.2 


other 


7 


? 


2.6 


7 


1.6 


other 


4.9 


other 


2J5 


TM 


1.3 


other 


5.3 


TM 


6.3 


other 


5.1 


SS, 


8.4 


other 


3.7 


TM 


U 


other 


2 


other 


1.5 


other 


2.8 


other 


5.8 


TM 


5.7 


other 


1.3 


other 


38.9 


other 


2.9 


? 


1.8 


other 


3 


other 


13.3 


other 


10.9 


other 


2.4 


other 


8.2 


TM 


3.3 


? 


3.9 


other 


5.3 


TM 


4.6 


other 


4 


SS. 


4.6 


other 


11 


other 


2.7 


other 


7.9 


other 


5.6 


TM 


5.2 


other 


1.8 


TM 


1.7 


? 


3.3 


TM 


2.6 


other 


2.9 


TM 


5.6 


other 


13 


other 


3.2 


TM 
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other 
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TM 
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other 
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132528 
132571 
132726 
132883 
133016 
133053 
133197 
133240 
133266 
133285 
133383 
133540 
133784 
133791 
133850 
133859 
133881 
134208 
134403 
134724 
134806 
134859 
135193 



T78736 

AW674699 

N52298 

BE26Sm 

AI439688 

AI065016 

AI275243 

AK001489 

AI160873 

M76477 

BE313555 

AL037159 

BE622743 

M34338 

W29092 

U86782 

U30872 

NMJJ0Q288 

AA334551 

AF045239 

AD001528 

D26488 

X95525 

AA243007 

T70541 

X57766 

S66431 

AA453483 



Hs.50758 
Hs.5169 
Hs.55608 
HS.2364W 



M173417 
AA280588 
AA504223 



F02907 

AA480103 

AA024664 

AA251776 

AA399047 

N34059 

U95367 

AA490899 

T54762 

Z41963 

AA521186 

AA400195 

AA045083 

AA099589 

W85712 

W45728 

U61232 

AA425154 

T39176 

AA496000 

W38150 

T96595 

AA227463 

R46025 

AA233177 

AA338760 

AA412106 

L47276 

D82307 

AA293558 

R37778 

AA250843 

W49521 

080000 



Hs.6390 

Hs.180201 

Hs.242894 

Hs.69233 

Hs.289082 

Hs.7252 

Hs74619 

Hs.301064 

HS76244 

Hs7678 

Hs.178761 

Hs.77204 

Hs.79993 

Hs^2767 

Hs.321576 

Hs.89718 

Hs.90315 

Hs.96103 



SMC4 {structural maintenance of chramoso 7.4 

suppressor of G2 allele of SKP1 , S. cere 6.9 

hypothetical protein MGC955 14 J 

RAB10, member RAS oncogene family 10.3 

hypothetical protein RJ20886 4.4 

Homo sapiens clone FLB3344 PRO0845 mRNA, 1.8 



hypoftettaal protein FU 20671 
ADP-ribosylafion factor-like 1 
zinc finger protein 
GM2 gartgOoskte activator protein 
WAA1224 protein 

proteasome (prosome, macropatn) 26S subu 



1-8 
1.8 
16.1 
10.4 
1.5 
1.7 
12.1 
9.7 
4.2 
22 
9.1 
3.2 
1.4 
1.4 
2.6 
13.3 
3.1 
1.6 
2.5 
4.5 
3.1 
4.6 
1.4 
1.9 
2.2 
2.4 



spermidine synthase 
cellular retinoic add-bindlng protein 1 
26S prateasome-assodated pad1 Iwmoiog 
centromere protein F (350/400kD, mitosin 
peroxisomal biogenesis factor 7 
sperm specific antigen 2 
ring ringer protein 22 
spermine synthase 
K1AA0007 protein 

TATA box binding protein (TBP)-associate 
ESTs 
ESTs 

Human stromerysln-3 mRNA 
Homo sapiens done 23592 mRNA sequence 
ESTs 
ESTs 
ESTs 
ESTs 

ESTs Highly similar to CHROMOSOME 
ESTs Highly similar to Surf-4 protein (M.mu$culus] 5,5 
ESTs £3 
ESTs Weakly simflar to HI! ALU SUBFAMILY J 2.8 
Human NADH:ubiquinone oxidoredudase suburut 6.2 
ESTs 2.3 
ESTs 2.4 
EST-RC.N34059 3.3 
Human GABA-A receptor pi subunit mRNA complete cds 
ESTs 3.3 
ESTs 2.9 
Homo sapiens HP protein (HP) mRNA complete cds 1.3 
ESTs 1.6 
ESTs 1.3 
VITAMIN INDEPENDENT GAMMA-CARBOXYLASE 
Homo sapiens mRNA for GDP dissociation Inhibitor beta 
ESTs Weakly similar to PROCOLLAGEN ALPHA 2(IV 
ESTs Highly similarto HETEROGENEOUS 3.7 
Human tubulin-folding cofactor E mRNA complete cds 
ESTs 5.3 
ESTs Weakly similarto 2K1058.4 fCetegans] ZB 
ESTs 1.9 
EST-RC.W38150 17 
EST-RCJT96595 1.8 
ESTs Weakly similarto No definition Bne found [C.eiegans] 
ESTs 2.8 
ESTs 2 
ESTs 1.3 
ESTs 65 
EST-L47276 3.4 
ESTs Weakly simflar to TH1 protein [D.meianogaster] 
ESTs 15 
ESTs 2.4 
Interferon regulatory factor 5 14.6 
Human prolyl 44tydraxytase alpha (10 subunit 65 
Human mRNA for WAA0178 gene partial cds 2 . 
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R99978 


ESTs Weakly similar to fine-1 protein ORF2 (H^apfensj 


6.1 


AA1 95036 


Human Ro/SSA ribonudeoproteln homolog (RoRet 5.3 


? 


Z38501 


ESTs Weakly similar to PROBABLE E5 


1.4 


other 


U37547 


Huimn IAP homolog B (MJHB) mRNA complete cds 3.2 


other 


AA479961 


ESTs 


1.7 


other 


X57579 


Inhlbin beta A (activin A activin AB alpha polypeptide) 


15.B 


AA449071 


ESTs 


1.3 


TM 


N51855 


ESTs Moderately slmiiarto NAD(4)ADP- 


1.3 


other 


AA421213 


ESTs Weakly similar to F28F8.3 [Cetegans] 


3.2 


other 


AA355201 


ESTs 


1.2 


SS.TM 


N/oflf 


H^apiens mRNA for \rans]in 


1.5 


? 


N73808 


ESTs 


5 


? 


U86782 


Human 26S proteasome-assoclated pad1 


2.2 


other 


AA234817 


ESTs 


13 


other 


013666 


Homo sapiens mRNA for osteoblast specific 


7.5 


SS, 


AA236177 


ESTs 


7.1 


? 


U50648 


Protein kinase interferon-Inducible double 


4.1 


? 


M28211 


Homo sapiens GTP-binding protein (RAB4) 


2.9 


other 


M446949 


ESTs 


2.2 


other 


W03007 


ESTs 


1.2 


other 


W61011 


ESTs 


n 


other 


W87544 


ESTs 


1.2 


other 


X02751 


Neuroblastoma RAS viral (v-ras) oncogene homolog 1.2 


? 


Z14077 


YY1 transcription factor 


12 


other 


Z38839 


ESTs 


u 


? 


AA410894 


ESTs 


1.7 


other 


AA504499 


EST s Highly similar to probable chloride channel 3 JHsap 


1.3 
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TABLE 7A 



5 

10 
15 
20 



Table 7 A shows the accession numbers for those pkeys lacking unigenelD's for Table 7. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset Identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

102481 3128L-28 U50360 

105032 genbanU^127818 AA127818 

409487 1134778.1 H19886 AW402806T10231 



v 
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TABLE 8: Figure 8 from BRCA 001-1 US 

5 Table 8 shows genes upregulated in tumor tissue compared to normal breast tissue. 

Specifically, one column shows the ratio of expression of the indicated gene in breast tumor 
tissue compared to other body tissues, and another column shows the ratio of expression of 
the indicated gene in breast tumor tissue compared to normal breast tissue. 
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Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD; Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal body tissue 

R2: Ratio of tumor to normal breast tissue 

Pkey ExAccn UnigenelD Unigene Title 



100075 
100229 
100262 
100271 
100355 
100522 
100552 
100599 
100676 
100690 
100895 
101046 
101086 
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101201 
101212 
101441 
101447 
101469 
101567 
101600 
101624 
101674 
101861 
101977 
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103546 



AF152333 

AV652249 

O38500 

BE160Q81 

AI907114 

X51501 

AA019521 

X77343 

X02761 

AA383256 

U01351 

K01160 

AA382524 

NMJJ02923 

NMJ06262 

122524 

AJ186220 

AW468397 

M21305 

AA310162 

M33552 

BE561617 

M55998 

NM_002291 

AA350659 

AF112213 

AL036335 

AA334592 

AF015224 

Nr\L003480 

NM.001394 

U96759 

AI379954 

BE244588 

R50032 

AW293542 

X63629 



D38616 

X83492 

BE536700 

T34708 

Y00815 

Z14244 



Hs.284160 
Hs.180107 

Hs.256290 

Hs.71465 

Hs.99949 

Hs.301946 

Hs.334334 

Hs.287820 

Hs.1657 

Hs.75772 

Hs.250959 

Hs.78944 

Hs.37044 

Hs.2256 

Hs.83164 

Hs.100000 

Hs.169248 
Hs.56729 
Hs.1 19192 

Hs.82124 

Hs,83347 

Hs.184062 

Hs^13 

Hs.79914 

Hs.46452 

Hs.300946 

Hs.2359 

Hs.198307 

Hs.79025 

Hs.6456 

Hs.159263 

Hs.75309 

Hs.2877 

Hs.79227 

Hs.54941 

Hs.82359 

Hs.4888 

Hs.272927 

H&75216 

Hs75752 



protocadherin gamma subfamily B, 4 
polymerase (DNA directed), beta 
postmelotic segregation increased 2-fflce 
S100 calcium-binding protein A11 (calgiz 



prolactin-induced protein 



R1 
1 

I. 7 
0.8 
3.2 
3,3 

II. 9 
3.8 
9.4 
3 

4.4 
1 



transcription factor AP-2 alpha (activat 
fibronedin 1 
estrogen receptor 1 

nuclear receptor subfamily 3, group C, m 
NMJX)2122:Hamo sapiens ma)orhistocompat1.7 

histatin 1 0.8 

regulator of G-proteln signalling 2, 24k 1 2 

peripherin 3.1 

matrix metaibproteinase 7 (MMP7; uterin 4.4 

collagen, type XV, alpha 1 3.1 

S100 calcium-binding protein A8 (calgran 0.9 

gb:Human alpha satellite and satellite 3 29.9 

cytochrome c 0.8 

lysosomal 1 

H2A histone family, member Z 2.8 

gb^iuman atpha-1 collagen type t gene, 3 3.1 

laminin, beta 1 1.5 

anglo-associated, migratory ceil protein 3.1 

putative Rab5~interacting protein 1 .3 

secreted phosphoprotain 1 (osteopontin, 1.9 

lumlcan 22 

marnmaglobtn 1 4.2 

Microfibril-assoclatedgtycoprotein-2 11 

dual specificity phosphatase 4 4.5 

von HippeHJndau binding protein 1 1.4 

KIAA0096 protein 0.9 

chaperonin containing TCP1 , subunit 2 (b 1 .5 

collagen, type VI, alpha 2 2.2 

eukaryotlc translation elongation factor 5.6 

cadherin 3, type 1 , P-cadherin (placenta 3.7 

myomestn (M-protein) 2 (165kD) 1.3 

phosphoryiase kinase, alpha 2 (liver) 1 .3 

tumor necrosis factor receptor superfami 0.8 

senyMRNA synthetase 0.9 

Sec23 (S. cerevlsiae) homolog A 1.1 

protein tyrosine phosphatase, receptor t 3,7 

cytochrome c oxidase subunit Vllb 0,9 
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R2 

3.8 
5.3 
4.8 
2.3 
1.4 
0.4 
1.2 
9.4 
7.8 
4.4 
3.9 
4 

4.1 
12 
1.1 
0.6 
3.4 
4.2 
0.3 
4.9 
5.9 
4 

1.7 

4.1 

1.4 

6.9 

4.9 

3.8 

07 

4.2 

0.5 

42 

3.9 

10.9 

6.2 

5.7 

0.5 

4 
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8 
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U 
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103656 


NNL000088 


Hs.172928 collagen, type 1, alpha 1 


3.2 


3 


103758 


AA084874 


gb2n13e04,r1 Stratagene hNT neuron (937 


0.9 


10 


103774 


H24185 


Hs.92918 hypothetical protein 


1.9 


15.9 


103821 


M095971 


Hs.198793 Homo sapiens cONA: HJ22463 fe, clone H 


\2 


3.9 


103869 


BE439604 


Hs.24322 ATPase, H+ transporting, lysosomal (vacu 


1.4 


3.9 


103980 


AW130242 


Hs.293476 hypothetical protein FKSG44 


1.6 


4.1 


104054 


AK001913 


Hs.7100 hypothetical protein 


1.5 


4.3 


104115 


AF183810 


Hs.26102 opposite strand to trichorhinophalangeal 


7 


7 


104189 


AB040927 


Hs.301804 WAA1494 protein 


2 


4.6 


104230 


AB002347 


Hs.15303 K1AA0349 protein 


07 


4.5 


104278 


AW583693 


Hs.109253 N-termlnal acetyttransferase complex ard 


3.3 


3.3 


104295 


AW365522 


Hs.103657 hypothefical protein PR02219 


2.3 


4.2 


104319 


AW804296 


Hs.9950 Sec61 gamma 


3.1 . 


7 


104425 


AF283775 


Hs.35380 x 001 protein 


4 


1.3 


104432 


X51501 


Hs.99949 prolacfirHnduced protein 


3.8 


0.6 


104464 


AW966728 


Hs.54642 methionine adenosyltransferase II. beta 


0.8 


6.7 


104479 


AKD01731 


Hs.106390 Homo sapiens mRNA; cDNA DKFZp586H0924 (f 


1.7 


104505 


W94824 


Hs.11565 RIKEN cDNA 2010100012 gene 


2 


7.5 


104592 


AW630488 


Hs.325820 protease, serine, 23 


1.9 


7.4 


104613 


AF123303 


Hs.24713 hypothetical protein 


1.1 


63 


104636 


R82252 


Hs.106106 protein kinase (cAMP-dependent, cataryii 


ii 


4 


104782 


AW270555 


Hs.171774 hypothefical protein 


1.4 


3.9 


104792 


AA960961 


Hs.305953 zinc finger protein 83 (HPF1 ) 


1.5 


42 


104848 


AA305351 


Hs.274369 uncharacterized hypothalamus protein HAR 


1.1 


4.1 


104849 


A1279065 


Hs.241 507 ribosoma! protein S6 


1.3 


4.6 


104850 


AL133035 


Hs.8728 hypotfieOcal protein DKFZp434G171 


1.2 


3.6 


104852 


W70164 


Hs.20107 ESTs 


0.8" 


42 


104861 


AA058630 


Hs.29759 RNA POLYMERASE I AND TRANSCRIPT RELEASE 


1.7 


104873 


W03831 


Hs.20597 host ceB factor homobg 


0.8 


5.4 


104891 


W44626 


Hs.30627 ESTs 


0.7 


6.6 


104920 


AW955089 


Hs.306083 Novel human gene mapping to chomosome 22 


1 3.9 


104926 


BE298808 


Hs.33363 DKFZP434N093 protein 


3.3 


3.3 


104952 


AW076098 


Hs.74316 desmopfeWn (DP), DP1I) 


12 


3.7 


104963 


AB029020 


Hs.173694 KIAA1097 protein 


1.1 


5.5 


104977 


AI392640 


Hs.18272 amino add transporter system A1 


3.2 


1.4 


105030 


BE613061 


Hs.337772 Homo sapiens, Similar to RIKEN cDNA 0610 1.6 


11.4 


105035 


N39760 


Hs.8859 Homo sapiens, Similar to RIKEN cDNA 5830 1.5 


7.2 


105068 


BE410438 


Hs.9006 VAMP (vesicle-associated membrane protei 


1.1 


3.5 


105159 


AF146277 


Hs.265561 a>2-assodated protein 


1.2 


10 


105178 


AA313825 


Hs.21941 AD036 protein 


3.6 


8,3 


105182 


BE407961 


Hs.18271 golgi phosphoprotein 3 


1.7 


6.8 


105274 


AI554929 


Hs.281866 ATPase, H+ transporting, iysosomal (vacu 


1.1 


3.7 


105303 


BE243327 


Hs.182626 chromosome 22 open reading firame 5 


1.5 


4 


105413 


AI015709 


Hs.172089 Homo sapiens mRNA; cDNA DKF2p58612022 (f 


1.5 


105426 


W20027 


Hs.23439 ESTs 


4.3 


2.9 


105432 


W03516 


Hs.76698 stress-associated endoplasmic reticulum 


1.5 


5 


105443 


AA252372 


Hs.12144 WAA1033 protein 


1.2 


3.6 


105483 


AL137257 


Hs.23458 Homo sapiens cONA: FU23015 fis, clone L 


1.7 


15.8 


105492 


A1805717 


Hs.289112 CGM3 protein 


2 


4.8 


105495 


AL037715 


Hs.28785 microfibrillar-associated protein 3 


1.3 


3.9 


105539 


AB040884 , 


Hs.109694 KIAA1451 protein 


2.7 


11.4 


105594 


AB024334 


Hs.25001 tyrosine 3-monooxygefiase/byptophan 5-mo 


1.3 


6.1 


105623 


BE504200 


Hs.30127 hypothefical protein 


1.7 


4.5 


105807 


M788946 


Hs.16869 ESTs, Moderately similar to CA1C RAT COL 


3.9 


24,6 


105812 


BE614149 


Hs.20814 CGI-27 protein 


1.8 


3.6 


105823 


A1559444 


Hs.293960 ESTs 


1.9 


6.6 


105831 


AA329449 


Hs.247302 twisted gastrutation 


1.5 


4.3 


105851 


AI827976 


Hs.24391 hypolheticalprotehaJ13612 


3.8 


1.9 


105879 


6E392914 


Hs.30503 Homo sapiens cDNA FU 1 1344 fis, done PL 


1.7 


4 


105918 


AW028485 


Hs.26136 hypothetical protehMGC14156 


1.7 


7.4 


105939 


AL137728 


Hs.12258 Homo sapiens mRNA; cONA DKFZp434B0920 (f 


1.2 


105941 


AB033075 


Hs. 10569 development and differentiation enhancin 


1J3 


4.6 


105969 


AB030656 


Hs.17377 coronh, acfin-binding protein, 1C 


1.1 


5.9 


105990 


A1690586 


Hs.29403 hypothetical protein FU220S0 


2 


4.6 


106012 


A1240665 


Hs.8895 ESTs 


4.1 


M 


106060 


NM.001329 


Ks.171391 C-termlnal binding protein 2 


2.6 


7 


106070 


T74445 


Hs.5957 Homo sapiens clone 24416 mRNA sequence 1.4 


10.7 



4.8 



5.1 



14 



3.8 
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106414 
106538 
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106574 
106613 
106617 
105619 
106701 
106721 
106776 
106866 
106868 
106887 
106940 
106968 
107052 
107061 
107149 
107222 
107233 
107295 
107679 
107914 
107965 
108033 
108060 
108081 
108137 
108186 
108215 
108297 
108339 
108371 
108399 
108469 
108470 
108564 
108641 



H62087 

AA425414 

BE613206 

BE568205 

AK000274 

AW051564 

6E044325 



108694 
108824 
108863 
108893 
108992 
109072 
109097 
109160 
109244 
109481 
109484 
109795 
110009 
110107 
110411 
110731 
110756 
110930 
110935 
111051 
111110 
111356 
111357 
111770 
111900 



H09548 

AA459480 

BE387614 

AA741038 

AA206079 

AA487416 

BE185536 

BE503373 

T85594 

AF216751 

BE391904 

BE147611 

AI289507 

BE172058 

BE267795 

AA186629 

AA011510 

AA027229 

AF109219 

AW368993 

AA291440 

AA093668 

AI283611 

AW068579 

AI879238 

AA333660 

AW151340 

M074374 

AF086070 

AA079487 

AA079500 

M23114 

AA1 12059 

AA058522 

AA036725 

AK001332 

AA133456 

BE276891 

AA152312 

AI732585 

AA167512 

BE220601 

BE179030 

AA878923 

AA366263 

AA173942 

BE075297 

AW151660 

AW001579 

NM_014899 

N212Q7 

BE242691 

AI753230 

A1681293 

AK001566 

BE301871 

BE314949 

R27975 

AF131784 



Hs.31659 

Hs.33287 

Hs.279607 

Hs.28827 

Hs.278635 

Hs.28285 

Hs.227280 

Hs.30212 

Hs.5367 

Hs.23956 

Hs25797 

Hs.6670 



Hs.268231 

Hs.301183 

Hs.334335 

Hs.339808 

Hs.26813 

Hs.12482 

Hs.6354 

Hs.299883 

Hs.82689 

Hs.22595 

Hs.80120 

Hs.60512 

Hs.61329 

Hs.108787 

Hs.323748 

Hs.73149 

Hs.28578 

Hs.263479 

Hs.7780 

Hs.299315 

Hs.71331 

Hs.51615 

Hs.67639 

Hs.237519 



Hs.1526 

Hs.429 

Hs.185751 

Hs.61847 

Hs.44672 

Hs.102543 

Hs.194691 

Hs.72047 

Hs.22394 

Hs.301997 

Hs.64239 

Hs.289069 

Hs.72531 

Hs.326416 

Hs.6614 

Hs.31444 

Hs.9645 

Hs.188006 

Hs.182999 

Hs.14947 

Hs.323562 

Hs.12186 

Hs.23618 

Hs.4867 

Hs.87128 

Hs.269401 

Hs.25318 



thyroid hormone receptor-associated prot 1 ,5 

nuclear factor l/B 5.4 

catpastatin 1.8 

mitog en-activated protein kinase kinase 5.1 

HDCMA18P protein 1.2 

patched related protein translocated in 1 .8 

U6 snRNA-assoclated Sm-flke protein 2.3 

thyroid receptor interacting protein 15 1 .2 

ESTs. Weakly similar to I38022 hypotheti 0.9 

hypothetical protein FU2Q502 1.3 

splicing factor 3b, subunit 4, 49kD 1.6 

ESTs 1.7 

hypothetical protein RJ20420 1 

Homo sapiens cDNA: FU231 1 1 fls, clone L 1.6 

molecule possessing ankyrin repeats Indu 3.3 

hypothetical protein FU13576 1.4 

hypothetical protein FU10120 3.3 

CDA14 3 

glyceronephosphate O-acyttransferase 17 

stromal cell derived tactor receptor 1 1 2 

hypothetical protein FU23399 1.8 

tumor rejection antigen (gp96) 1 1 2 

hypothetical protein FU10637 1.4 

UDP-N-acetyt-alpha-D-galactosamlneipoIyp 2.6 

ESTs 1.8 

ESTs, Weakly similar to T16370 hypotheti 1.3 

phosphatidyfinositol glycan, class N 1 .6 
Homo sapiens clone CDABP0086 mRNA sequen 

paired box gene 8 1.1 

muscleblind {DrosophilaHIke 0.7 
ESTs, Weakly similar to HMG1JIUMAN HIGH 
Homo sapiens mRNA; cDNA DKFZp564A072 (fr 

coHapsin response mediator protein-5; C 1 .5 

hypothetical protein MGC5350 1.5 
ESTs, Weakly similar to ALU7_HUMAN ALU S 

ESTs 1.3 

EST 1 

gbzm97f08.s1 Stratagene colon HT29 {937 1 .5 
gbzm96h10.s1 Stratagene colon HT29 (937 
ATPase, Ca++ transporting, cardiac musd 
ATP synthase, H+ transporting, mltochond 
ESTs 
ESTs 

hypothetical protein FLJ10470 
glucocorticoid receptor DNA binding fact 
retinolcacld induced 3 
ESTs 

hypothetical protein FU 10893 
gbzp10f12s1 Stratagene fetal retina 93 
hypothetical protein FU 13033 
Human DNA sequence from clone RP5-1174N9 

hypothetical protein FU21016 3.8 

hypothetical protein FU11838 1.9 
Homo sapiens mRNA; cDNA DKFZp564H1916 (f 

ESTs, Weakly similar to A43932 mucin 2 p 4.6 
ESTs 

Homo sapiens mRNA for KIAA1741 protein, 
KIAA0878 protein 
ESTs 
ESTs 

hypothetical protein DKFZp564K142 
hypothetical protein FU22558 
hypothetical protein FU10704 
mannosyl (afcha-1,3-)-glycoproteIn beta- 
hypometlcal protein FU23309 
ESTs, Moderately similar to S65657 alpha 



1.1 
2 

1.1 
\2 
1.4 
1.4 
1.2 
U 
1.1 
1.2 
1.3 
4 



1.2 
3.7 
2.8 
1.6 
3.1 
1.9 
2 

1.1 
1 

3.3 
1.2 



Homo sapiens clone 25194 mRNA sequence 3.2 



3.6 
1.2 
4 

6.1 

5.9 

5.4 

115 

3.6 

4.4 

3.6 

7.3 

6.1 

5.4 

5.4 

1.2 

6.3 

1.8 

3 

7.6 
4.3 
6.5 
6.9 
3.5 
4.3 
4 

3.5 
3.5 
1.8 
3.5 
5.6 
1.2 
3.1 
4.6 
4 

6.3 
3.8 
3.6 
3.6 
4.3 
4.9 
3.5 
3.6 
3.6 
3.5 
4 

3.6 
4.1 
3.5 
5 

6.1 
1.7 
7.7 
4 

3.7 
7.4 
3.5 
3.3 
3.7 
3.5 
1.2 
7.5 
4 

3.8 
8.2 
6.1 
5.4 
0.8 



8.1 



5.6 
6.9 



4.7 



7.4 
1.3 
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111903 NMJM906 Hs.166351 KIAA1072 protein 1 5.4 

111951 NNL014927 Hs.100527 KIAA0902 protein 1 3.8 

112141 AW137198 Hs.278682 PhosphatWylglycerophosphate Synthase 14 3.5 

.112193 R49499 Hs.138238 ESTs 1.5 3.6 

5 112197 NM.003655 Hs.5637 ESTs 4.6 2 

112610 AW500106 Hs.23643 serme/threonine proteh kinase MASK 3.3 10.5 

112971 Z42387 Hs.83883 transmembrane, prostate androgen Induced 3.2 3 

112984 T16971 Hs.289014 ESTs, Weakly similar to A43932 mucin 2 p 3.7 10.8 

113056 AF019226 Hs.8036 glioblastoma overexpressed 4.5 3.7 

10 113449 AW160683 Hs.158006 hypothetical protein M 4.4 

113497 AF143321 Hs.15572 hypoihetical protein IMAGE 109914 0.9 3.6 

113508 A1D42936 Hs.211571 holccytochrorne c synthase (cytochrome c 1.1 3.5 

113531 AK001898 Hs.16740 hypothetical protein RJ1 1036 1.2 3.9 

113604 AI075407 Hs.296083 ESTs, Moderately similar to 154374 gene 1.7 5.3 

15 113674 NMJH4214 Hs.5753 irwsitot(myoH{or4HTK)nophosphatase2 0.8 6.1 

113841 W30681 Hs.146233 Homo sapiens cONA: FU22130 fis, done H 1.7 6.2 

113857 AW243158 Hs.5297 OKFZP564A2416 protein . 1.2 4.6 

113931 BE255499 Hs.3496 hypothetical protein MGC1 5749 1.5 4 

113936 W17056 Hs.83623 nuclear receptor subfamily 1, group I, m 3.8 1 

20 113987 AA345519 Hs.9641 complement component 1, q subcomponent, 1.2 4.7 

114132 A1342493 Hs.24192 Homo sapiens cONA FU20767 fis, clone CO 0.3 4.3 

114156 6E179882 Hs.336920 glutathione peroxidase 3 (plasma) 1.1 4.3 

114213 N58309 Hs.19575 CG1-11 protein 1.6 9.2 

114636 AA075488 gb:zm88d01.s1 Stratagene ovarian cancer ' 1.6 3.7 

25 114760 AI929382 Hs.252692 hypothetical protein FU20343 1.4 4 

114781 T10446 Hs.95388 ESTs 1 4.3 

114795 AB037858 Hs.173484 hypothetical protein RJ10337 1.6 9.2 

114901 AV660012 Hs.196437 hypothetical protein FU10788 1.4 5.2 

115096 AI683069 Hs.175319 ESTs 3.7 1 

30 115518 BE541042 Hs.23240 Homo sapiens cDNA: FU21848 fis, done H 3.2 4,2 

115646 N36110 Hs.305971 solute carrier family 2 (facilitated glu 1.5 3.9 

115764 AW582256 Hs.91011 anteriorgradlent2{Xenepuslaevls)hom 1.3 5.9 

115802 AW410233 Hs.206521 YME1 (S.cerevlsiaeHike 1 1.7 6,6 

115994 AB037836 Hs.109315 K1AA1415 protein 1.5 9.1 

35 116032 BE383668 Hs.42484 hypothetical protein FU10618 0.9 4.3 

116046 BE395293 Hs.94491 hypothetical protefo HJ20297 1.6 5.5 

116274 A1129767 Hs.1 82874 guanine nucleotide binding protein (G pr 3.2 2.4 

116310 Z24854 Hs.42299 ESTs 0.8 4.7 

116356 AI371223 Hs.288671 Homo sapiens cONA FU11997 fis, done HE 2.4 3.9 

40 116429 AF191018 Hs.279923 putative nucleotide binding protein, est 5.5 5.5 

116461 AA313607 Hs.58633 Homo sapiens cONA: FU22145 fis, done H 5 1.3 

116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 8.7 4.5 

116578 D21262 Hs.75337 nucleolar and coiled^oV phosphprotein 3.2 6.9 

116579 AW886411 Hs.81915 leukemia-associated phosphoproteln p18 ( 3.2 3 
45 116589 AI557212 Hs.17132 ESTs. Moderately similar to 154374 gene 3.1 8.3 

116786 H25836 Hs.301527 ESTs, Moderately similar to unknown [H.s 3.2 4.5 

117170 N25929 Hs42500 ADP-ribosylation factor-like 5 7 5.5 

117215 N20066 Hs.133207 PTPRF interacting protein, binding prote 1.2 6.2 

117280 M18217 Hs.172129 Homo sapiens cDNA:FU21409 fis, done C 4.5 2.4 

50 < 117576 A1383467 Hs44597 ESTs 1.4 4.2 

117667 U59305 Hs.44708 Ser-Thr protein kinase related to the my 4.3 0.5 

117881 AF161470 Hs.260622 butyrate-tnduced transcript 1 2.1 5.7 

118336 BE327311 H$47166 HT021 3.6 7.7 

118475 N66845 gb:za46c11.s1 Soares fetal fiver spleen 4.2 0.5 

55 118493 AL353944 Hs.50115 Homo sapiens mRNA;cDNAOKFZp761J11 12 (f 3.5. 

118505 N67343 gb:yz5Qb07.s1 Morton Fetal Cochlea Homo 2.1 3.8 

119159 AF142419 Hs.15020 homolog of mouse quaking OKI (KH domain 3.7 1.5 

119307 BE048061 Hs.37054 ephrfcWtf 3 1.1 

119355 BE218319 Hs.5807 GTPaseRab14 1.1 5.6 

60 119462 BE041667 Hs.314544 Homo sapiens cervical cancer suppressor- 1.4 4.3 

119771 AI905687 Hs.2533 EST 3.2 1 

119940 AL050097 Hs.272531 DKFZP586B0319 protein- 4.3 0.7 

119943 BE565849 Hs.14158 copinelU 3.5 1,9 

120407 AA235207 Hs.250456 hypothetical protein DKFZp762F2011 1.5 3.7 

65 120493 AW968080 Hs.152939 Homo sapiens clone 24630 mRNA sequence 4 1.4 

125677 AF150208 Hs.108327 damage-specific ONA binding protein 1 (1 1.6 6.8 

120867 AA350781 Hs.96967 ESTs 1.1 3.6 
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121358 
121603 
121723 
122223 
122378 
122946 
123155 
123158 
123327 
123495 
123526 
123533 
123768 
123961 
123999 
124000 
124038 
124059 
124083 
124148 
124153 
124252 
124314 
124375 
124432 
124447 
124539 
124543 
124564 
124574 
124605 
124639 
124659 
124737 
124760 
124763 
124792 
124842 
124940 
124949 
124960 
124995 
125030 
125034 
125058 
125076 
125090 
125103 
125154 
125155 
125159 
125170 
125181 
125193 
125260 
125262 
125272 
125388 
125824 
125852 
125970 
126192 
126469 
126510 
127095 
127524 
128312 



BE262956 

AA416785 

AA243499 

AF169797 

AB032948 

AI718702 

AF121856 

AF161426 

AA421581 

W28673 

AA608657 

AA608751 

AI932318 

AL050184 

AF084555 

BE563957 

AB037860 

BE387335 

AW195237 

BE300094 

AU077333 

BE613340 

AK001552 

087454 

N39016 

N48000 

D54120 

AI393320 



AL036596 

AA749315 

H60193 

AI680737 

BE270465 

AW408586 

BE410405 

R44357 

R56485 

AF068846 

AI903210 

AL023513 

T52700 

AA610577 

BE548446 

T83731 

AA973971 

T91518 

AA570056 

W38419 

AA837043 

AK000669 

AL020996 

R40815 

W67577 

H05635 

AW884980 

BE612888 

W27235 

Z45258 

AW630088 

AW504721 

AW160399 

BE384361 

AA057593 

AA340277 

AI243596 

J04182 



Hs.178292 

Hs.249495 

Hs.104800 

Hs.27413 

Hs.21356 

Hs.308026 

Hs.284291 

Hs.218329 

Hs.178443 

Hs.106747 



Hs.188762 

Hs21610 

Hs.7351 

Hs.74861 

Hs.173933 

Hs.283713 

Hs.7734 

Hs.227751 

Hs.160483 

Hs.334725 

Hs.215766 

Hs.192966 



K1AA01 80 protein 

heterogeneous nuclear ribonucieoprotein 
hypothetical protein FU10134 
adaptor protein containing pH domain, PT 
hypothetical protein DKFZp762K2015 



sorting nexin 6 
hypothetical protein 
ESTs 

serine carboxypepfidase 1 precursor prot 
gb:ae55dQ4.s1 Stratagene lung carcinoma 
gbae56h07.s1 Stratagene lung carcinoma 



1.5 
2.2 
3.4 
3.9 
1.4 
1.4 
U 
2.4 
0,9 
1.3 
2.1 
2.1 



Hs.146409 

Hs.104573 

Hs.108275 

Hs.42322 

Hs.77171 

Hs.21143 

Hs.289068 

Hs.78793 

Hs.91052 

Hs.76288 

Hs.48712 

Hs.103804 

Hs.336780 

Hs.194766 

Hs.110044 

Hs.187775 

Hs.5167 

Hs.3343 



Hs.122730 

Hs.143669 

Hs274428 

Hs.8518 

Hs.12396 

Hs.84298 

Hs.294030 

Hs.171957 

Hs.180224 

Hs.64311 

Hs.286013 

HsJ6550 

Hs.177516 

Hs.30376 

Hs.182885 

Hs.334762 

Hs.10248 

Hs.94830 

Hs.150101 



ESTs, Moderately similar to H2BLHUMAN H 1.1 

DKFZP434B203 protein 1.1 

cyclic AMP phosphoprotein, 19 kD 1.4 

activated RNA polymerase II transcriptio 1 .9 

nudear factor l/A 1.5 

ESTs, Weakly similar to S64054 hypotheti 14.8 

hypothetical protein FU22174 1.2 

lectin, galactoside-blnding, soluble, 1 2.5 

erythrocyte membrane protein band 7.2 (s 1 

Homo sapiens, Similar to RIKEN cDNA 9430 1.5 

GTP-binding protein 1.8 

KIAA0265 protein 1.1 

ESTs, Weakly similar to ALUC_HUMAN 111! 1.3 

gb:yy98e12.s1 SoaresjnuIt]pIe_sclerosts_ 2.7 

cell division cycle 42 (GTP-binding prot 2.1 

ESTs 1 

ESTs 1.4 

A kinase (PRKA) anchor protein 2 0.7 

minichromosome maintenance deficient (S. 1.1 

OKFZP586C1324 protein 1.4 

Homo sapiens cDNA FU1 1918 fis, clone HE 1.5 

protein kinase C, zeta 0.7 
ESTs. Moderately similar to ALUS JIUMAN A 0,9 

calpain 2, (m/II) large subunlt 1.3 

hypothetical protein FU20736 1.8 

gb:yg93h09.s1 Soares infant brain 1N1BH 1 

heterogeneous nuclear ribonucieoprotein 3.2 

tubulin, beta polypeptide 1 

seizure related gene 6 (mouselike 0.9 

ESTs 0.9 

ESTs 1.2 
Homo sapiens mRNA; cONA DKFZp434F152 (fir 

phosphoglycerate dehydrogenase 0.9 
gb»q02h08.s1 NCLCGAP_Lu5 Homo sapiens 

gbye20f05.s1 Stratagene lung (937210) H 32 
ESTs, Moderately sbnilar to WAA1215 pro 
gb2c78a07.s1 Pancreatic Islet Homo sapl 
ESTs 

TRF2-interacting telomencRAPI protein 



N 



5.3 
0.9 
1.1 
1.1 
1.1 
1 

1.2 
1 

1.3 
1.1 
1.4 
2.4 



ESTs, Weakly similar to 2004399A chromos 
C074 antigen (invariant polypeptide of m 
topolsomerase-related function protein 4 
triple functional domain (PTPRF Interact 
myosin regulatory light chain 
a dislntegrin and metalloproteinase doma 
short cofled-coil protein 
Homo sapiens mRNA; cDNA DKFZp564B1264 (f 

high density lipoprotein binding protein 1.9 

hypothetical protein 1.4 

ESTs, Weakly similar to JC5024 UDP-galac 2 

hypothetical protein RJ14735 1.3 

Homo sapiens CDNA FU20167 lis, clone CO 1.3 

ESTs, Moderately similar to T03094 A-kln 4.3 

lysosomal 1.5 
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4.1 

5.5 

3.2 

3.9 

7.1 

3.7 

4.9 

3.6 

5.2 

5.1 

5.2 

9.3 

3.6 

3.5 

3.8 

11.2 

4.4 

11.5 

62 

12.7 

4.1 

8.4 

102 

4.8 

4.1 

4.3 

5.7 

4.1 

4 

4 

3.5 
3.6 
9.9 
4 

3.6 
3.9 
4.2 
3.6 
3.4 
4.4 
5.2 
3.5 
5 

« 
6 

1 3.7 

25 

6.6 

6.1 

4.3 

4.1 

3.8 

3.6 

7.8 

4.9 

4.8 

16.1 

5.3 

8.7 

18 

3.8 

4.1 

3.7 

4.1 

5 

0.9 
4.7 
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128453 X02761 
128460 T16206 
128491 H08379 
128495 NMJ05904 
128546 NMJKJ3478 
128574 AI185977 
128611 NMJJ14721 

128652 AA432202 

128653 087432 
128655 A1246669 
128684 BE246444 
128717 AKD01564 
128774 AA476220 
128790 AF026692 
128805 AA194554 
128827 A1638184 
128840 AI917602 
128869 AA768242 

128889 D60985 

128890 AI222020 
128915 AK000140 
128920 AA622037 
128926 AF155Q96 
128930 AA298958 
128942 AW247536 
128948 AW953622 
128953 AB020716 

128979 AW271217 

128980 AA258924 
129005 AI770025 
129009 C15105 
129013 AA371156 
129068 AI634522 
129106 AW504486 
129113 BE543205 

129125 AB002450 

129126 AW881089 
129151 N23018 
129230 AA335362 
129234 M18916 

129238 8E542214 

129239 W57656 
129241 AI878857 
129243 BE169531 
129247 R49920 
129250 AA344367 
129258 AA250970 
129260 AF077200 
129270 AA357185 
129277 AB007896 
129284 AA318224 
129300 W94197 
129318 AF189062 
129352 AW511656 
129362 U30246 
129379 BE278964 
129390 AA318271 
129416 AA016188 
129427 A1498631 
129470 W92931 
129472 AL050260 
129475 NJWL004477 
129498 AA449789 
129501 AI631811 
129527 AA769221 
129545 R18087 
129579 AW517695 



Hs.287820 

Hs.237164 

Hs.165563 

Hs.100602 

Hs.101299 

Hs.38260 

Hs.102471 

Hs,103147 

Hs.10315 

Hs.324275 

Hs.283685 

Hs.104222 

Hs.54457 

Hs.105700 

Hs.183434 

Hs.106334 

Hs.106440 

Hs.80618 

Hs.106909 

Hs.182364 

Hs.107139 

Hs.166468 

Hs.107213 

Hs.10724 

Hs.10729 

Hs^23025 

Hs.107362 

Hs.281434 

Hs.10758 

Hs.13323 

Hs330716 

Hs.107942 

Hs.152925 

Hs.108689 

Hs-288771 

Hs^78391 

Hs.108806 

•Hs.171391 

Hs.109646 

Hs.282997 

Hs.109697 

Hs.109701 

Hs.109708 

Hs.109727 

Hs.109733 

Hs.109760 

Hs^51946 

Hs.279813 

Hs.109918 

Hs.110 

Hs.296141 

Hs.1 10165 

Hs.285976 

Hs.170177 

Hs.1 10736 

Hs.11085 

Hs.250905 

Hs.111244 

Hs.111334 

Hs^50899 

Hs.323817 

H^203772 

Hs.75511 

Hs.180403 

Hs.270847 

Hs.323769 

Hs.286218 



1 \2 
ESTs, Highly similar to LDHH_HUMAN UAC3.1 
hypothetical protein DKFZp434N1429 0.6 
MAD (mofoers against decapentaptegic, Or 1 .3 
cuKin 5 1 
ubiquitin specific protease 18 0.8 
K1AA0680 gene product 1.3 
hypothetical protein FU21347 1.4 
solute carrier family 7 (cationlc amino 1.2 
WW domain-containing protein 1 0.8 
hypothetical protein FU 20396 3 
hypothetical protein RJ10702 2.8 
C081 antigen (target of antiproGferafiv 1.1 
secreted frizzled -re la ted protein 4 1 
ATPase, H+ transporting, lysosomal (vacu 5.3 
Homo sapiens clone 23836 mRNA sequence 2.2 
ESTs 1 
hypothetical protein 0.8 
DKFZP566D193 protein 4.6 
CocoaCrtsp 3 
hypothetical protein 0.2 
programmed cell death 5 15 
hypothetical protein HJ20585 4 
MOS023 protein 1.2 
hypothetical protein 1.4 
RAB31 , member RAS oncogene family 2.3 
KIAA0909 protein 0.9 
Homo sapiens cDNA FU 14028 fis, clone HE 1.5 
NNL002495*:Homo sapiens NADH dehydrogena 
hypothetical protein FU22059 1.2 
Homo sapiens cDNA FU 14368 fis, clone HE 2.1 
DKFZP564M1 12 protein 2.4 
KIAA1268 protein ■ 1,2 
sterol regulatory element binding transc 1 .2 
DKFZP586A0522 protein 0.5 
CGM09 protein 1 
Homo sapiens mRNA; cDNA DKFZp566M0947 (f 



C4erminal binding protein 2 2.1 

EmpiricaBy selected from AFFX single pr 0.9 

glucosidase, beta; acid (includes giucos 1.1 

ESTs 11 

ubiquftin-Gke 5 3.2 

hematological and neurological expressed 1 .9 

TAK1-binding protein 2; KIAA0733 protein 1 2 

CGI-131 protein 1.5 

Empirically selected from multiple AFFX 1 

pofy(A)-bindIng protein, cytoplasmic 1-1 1.3 

hypothetical protein 1.6 

ras homolog gene family, member H 1 .8 

putath/e L-type neutral amino acid trans 1 . 1 

ESTs 2.5 

ribosomal protein L26 homolog 1 .6 

tumor metastasis-suppressor 1 .8 

Metel (mouse) homolog 0.9 

solute carrier family 12 (sodium/potass! 1.4 

CGI-1 11 protein 1 

hypothetical protein 1 

hypothetical protein 1.8 

ferriSn, fight polypeptide 1.1 

heat shock factor binding protein 1 1.8 

DKFZP547E1010 protein 1 

FSHD region gene 1 1.1 

connective tissue growth factor 1 .9 

STRIN protein 1.1 

delta-tubuOn. 1.1 

dspiatin resistance related protein CRR 1 

junctional adhesion molecule 1 2.3 
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4.3 
44.4 
13.1 
4 

5.1 
4 

3,7 

3.9 

3.6 

4.1 

1,6 

4.8 

10.6 

3.8 

5.3 

5.3 

4.5 

3.6 

3.7 

1.5 

3.9 

15.2 

4 

4.5 
5 

5.6 

3.9 

3.6 

0.8 

57 

9.9 

3.8 

3.8 

5.5 

3.7 

5.2 

15 

9.7 

8.6 

3.5 

128 

5.1 

5.7 

6.6 

3.5 

5.4 

4.1 

3.9 

4.2 

6.1 

4.8 

5.1 

6.5 

4 

9.2 

4.8 

4.1 

10.7 

4.8 

9.3 

5 

4.2 
6.8 
97 
4.3 
4.2 
3.5 
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129606 AW968941 Hs.166254 hypofoetical protein DKFZp566l1 33 2.4 4.4 

129619 AA209534 Hs.284243 tetraspari NET-6 protein 3.2 13 

129620 D79338 Hs.239720 CCR4-N0T transcription complex, subunit 1.6 4.6 

129621 AL110212 Hs.301005 purlne-rich element binding protein B 1.1 5.7 
5 129634 AB020335 Hs. 1 81300 seH {suppressor of lin-12, C.elegans}- 0.9 4.3 

129663 A1207406 Hs.11866 translocase of inner mitochondrial membr 1.9 4.8 

129679 AW889132 Hs.1 1916 ribokinase 0.9 4.1 

129688 U53209 Hs.24937 transformer^ alpha (htra-2 alpha) 1.3 4.7 

129691 M26939 Hs.1 19571 collagen, type III, alpha 1 (Ehlers-Oanl 4.7 3.7 

10 129712 U46386 Hs. 12102 sorfagnexlh3 U 3.6 

129747 AL050272 Hs.12305 DKFZP566B183 protein 1 8.9 

129788 BE397454 Hs.124969 Homo sapiens clone 24707 mRNA sequence 1.4 3.6 

129796 BE218319 Hs.5807 GTPaseRab14 29 5.1 

129797 M62839 Hs.1252 apolfpoprotein H (beta-2-glycoproteIn I) 0.3 5.1 
15 129800 AF052112 Hs.12540 lysosomal 1.6 8.8 

129834 AL080084 Hs.296155 CGMOO protein 0.9 5.3 

129836 AW410233 Hs.206521 YME1 (S.cerevisiae)-like 1 1.8 9.9 

129843 NMJ514840 Hs.200598 KIAA0537 gene product 0.9 3.6 

129874 AA626937 Hs.181551 hypothetical protein MGC2594 1.4 9.5 

20 129878 Z43161 Hs.283714 30 kDa protein 1.1 6.3 

129904 AL1 19499 Hs.13285 neuronal potassium channel alpha subunit 1 3.5 

129917 M30773 Hs.278540 protein phosphatase 3 (formerly 28), reg 2 5.1 

129976 X14008 Hs.234734 lysosomal 0.9 4.9 

129982 Z14221 gb:H.sapiens germfine transcript of Ig h 1.2 3.6 

25 130007 R15917 Hs.142570 Homo sapiens clone 24629 mRNA sequence 4.3 1.3 

130060 BE277024 Hs.146381 RNA binding motif protein, X chromosome 1.6 3.8 

130064 X57815.comp Empirically selected from AFFX single pr 1,2 8.2 

130068 M93143 Hs.262869 plasminogen* 1.4 7.9 

130090 H97878 Hs. 132390 zinc finger protein 36 (KOX 18) 1.4 12.3 

30 130095 AK001635 Hs.14838 hypothetical protein FU 10773 0.2 4.6 

130102 W61005 Hs.14896 DHHC1 protein 1 ■ 4.1 

130112 AA916785 Hs. 180610 splicing factor proline/glutamine rich ( 1.2 5,3 

130115 T47294 Hs, 1 49923 X-box binding protein 1 3.8 0.8 

130123 NM.005095 Hs.150390 zinc finger protein 262 1 4.2 

35 130150 BE094848 Hs.15113 rKjrrragentisate1,2^bxygenase(hornogenti 0.5 4 

130161 R42678 Hs.1 51 385 WAA0564 protein 1 3.7 

130210 M23115 Hs.1526 ATPase, Ca++ transporting, cardiac musd 0.4 4.4 

130213 BE278370 Hs.15265 heterogeneous nuclear rlbonucleoprotein 1.7 7.5 

130215 BE301883 Hs.152707 glioblastoma amplified sequence 1 5.6 

40 130232 U29463 gfcHuman cytochrome b561 gen 1.2 4.2 

130252 U92014 Hs.153527 Homo sapiens pTM5 mariner-like transposo 1.3 3.6 

130281 W78907 Hs.15395 sirrflartoarginyi-tRNA synthetase (argi 1.5 4.4 

130343 AB040914 Hs.278628 K1AA1481 protein 2.9 7.5 

130385 AW067800 Hs.155223 stanniocatdn 2 3.2 0.2 

45 130414 AW842182 Hs.241392 small Inducible cytokine A5 (RANTES) 1.4 10.6 

130417 AW163518 Hs.1 55485 huntingtin Interacfing protein 2 1.7 11.7 

130440 AA852868 Hs.1 32853 WAAQ171 gene product 1.1 5 

130442 NMJXJ6245 Hs.1 18244 protein phosphatase 2, regulatory subuni 1.4 4.3 

130465 AW362955 Hs.15641 Homo sapiens cONA FLJ14415 fis, clone HE 1.6 7.6 

50 130479 R44163 Hs.12457 hypothetical protein FLJ10814 0.9 4.1 

130499 AB007915 Hs.158286 KIAA0446 gene product 1 3,8 

130546 AI598022 Hs.193989 TAR DNA binding protein 1.3 4.7 

130568 AA232119 Hs.16085 putafive G-proteln coupled receptor 1.2 9.4 

130606 AI652143 Hs.288382 hypothetical protein FU131 11 1 4.1 

55 130612 BE242873 Hs.16677 WD repeat domain 15 1.1 3.6 

130616 AL049963 Hs.284205 up-regulated by BCG-CWS 0.6 3.8 

130623 AL045128 Hs.1691 glucan(1,4-alpha-). branching enzyme 1 0.9 6.6 

130629 AL042896 Hs.1697 ATPase, H+ transporting, lysosomal (vacu 0.9 3.9 

130632 AW073971 Hs.238954 ESTs,WeaWy similar to KIAA1204 protein 0.9 6.9 

60 130639 AI557212 Hs.17132 ESTs, Moderately similar to 154374 gene 2.6 3.9 

130641 AF158555 Hs.239189 glutaminase 1.2 13.8 

130653 AI861791 Hs.278479 TSPY-like 1.3 4 

130655 AI831962 Hs.17409 cysteine-rich protein 1 (intestinal) 2.5 4 

130666 AL117508 Hs.194035 KIAA0737 gene product 1.3 6.2 

65 130669 AI928985 Hs.17680 hypothetical protein MGC1314 similar to 1.4 3.9 

130693 R68537 Hs.17962 ESTs 3.2 0.8 

130694 NMJ014827 Hs,17969 KIAA0663 gene product 1.1 4.8 
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130696 AA325308 Hs.18016 Homo sapiens mRNA; cONA DKFZp586H0324 (f 1.8 5 

130701 Z98883 Hs.18079 phosphatldyllnositol glycan, dass Q 1.1 67 

130707 AW190925 Hs.203559 hypothetical protein FU12701 1.2 4.1 

130731 AJ932971 Hs.18593 Homo sapiens cONA: RJ21449 fis, clone C 14 6.9 

5 130787 AF072813 Hs.252831 refcuton3 1.2 112 

130796 AA088809 Hs.19525 hypofoetical protein RJ22794 1.8 6.8 

130808 NWL001761 Hs.1973 cycfinF 1.3 4.1 

130863 Y 10805 Hs.20521 HMT1 (hnRNP memyltransferase, S. cerevl 3.2 5.9 

130902 AB037750 Hs.21061 WAA1329 protein 1 .3.8 

10 130908 AW195747 Hs^1122 hypothetical protein FU1 1830 similar to 1.3 7.9 

130911 BE409769 Hs.21189 OnaJ (Hsp40) homolog, subfamily A, membe 27 3.7 

130913 BE390905 Hs.21198 transtocase of outer mitochondriaJ membr 1.9 4 

130923 H96115 Hs^1293 UDP-N-acteylgfucosamine pyrophosphoryias 1.9 10.3 

130959 AB023182 Hs.184523 WAA0965 protein 1.5 6.8 

15 130967 AA393071 Hs.182579 leucine arrdnopeptidase 1.4 5.5 

130975 AA099923 Hs.283728 PEST-containlng nuctear protein 1.3 3.8 

131037 BE243101 Hs.22391 chromosome 20open reading frame 3 1.9 4.1 

131039 D87436 Hs.166318 Gpin 2 1.6 3.5 

131060 AA194422 Hs.22564 myosin VI 4.5 5 

20 131097 AL137682 Hs.22937 l-kappa-B-interacting Ras-iike protein 2 2 37 

131101 BE387561 Hs.22981 DKFZP586M1523 protein 1.6 • 4.5 

131104 W27770 Hs.301756 ESTs ( WeaWy similar to T31475 hypotheti 0.9 3.5 

131107 BE620886 Hs.75354 GCN1 (genera) control of amino-atfd synt 2.1 4.5 

131109 BE564123 Hs.23060 DKFZP564F0522 protein 1.1 4.6 

25 131136 AB033099 Hs.23413 KIAA1273 protein U 42 

131148 AW953575 Hs303125 conduced protein PIGPC1 4.5 13.5 

131150 X77753 Hs.23582 tumor-associated calcium signal transduc 3.4 0.4 

131156 AI472209 Hs.323117 ESTs 0.8 4.9 

131164 AW013807 Hs.182265 keratin 19 3.3 2.4 

30 131181 H25094 Hs.293663 ESTs, Moderately similar to 138022 hypot 0.6 4 

131194 AW864222 Hs.24083 KIAA0997 protein 1.4 3.8 

131199 AW979155 Hs.298275 amino acid transporter 2 1.2 8,5 

131215 AL050107 Hs.24341 transcriptional co-acUvator with PDZ-bl 0.7 47 

131216 AI815486 Hs.243901 Homo sapiens cDNA RJ20738 fis, clone HE 2.1 8.2 
35 131233 D89053 Hs.268012 fatty-acld-Coenzyme A Bgase, long-chain 1.7 3.5 

131237 AW956868 Hs.24608 DKFZP564D177 protein 1.3 5.4 

131262 AU077158 Hs.24930 tubufirMpecific chaperone a 1.6 4.8 

131263 AU077002 Hs.24950 regulator of G-protein signalling 5 1.4 4.4 
131367 AI750575 Hs.173933 nuclear factor VA 3.3 . 22 

40 131372 AW293399 Hs.144904 nudear'feceptorco-repressorl 1.6 3.9 

131373 NM.006052 Hs.26146 Down syndrome critical region gene 3 1 11.1 

131388 NMJM810 Hs.92200 KIAA048O gene product 5 2 

131492 AI452601 Hs.288869 nuclear receptor sutrfamiry 2, group F, m 0.9 3.5 

131493 AW960146 Hs.284137 hypothetical protein FU12888 1 3.5 
45 131514 BE270734 Hs.2795 lactate dehydrogenase A 2 6.5 

131524 AB040927 Hs.301804 WAA1494 protein 1.5 107 

131528 AU0764Q8 Hs.28309 UDP-ghicose dehydrogenase 1.3 4.7 

131534 AF157326 Hs.184786 TBP-lnlerading protein 1.3 4.9 

131555 T47364 Hs.278613 Interferon, alpha-inducible protein 27 1.5 8 

50 131578 AA936296 Hs.234265 DKFZP586G011 protein 1.8 3.5 

131589 C18825 Hs.29191 epithelial membrane protein 2 1.3 8.2 

131609 D83032 Hs.169984 nuclear protein 2.8 3.9 

131626 BE514605 Hs.289092 Homo sapiens cDNA: FU22380 fe, clone H 1.3 11.2 

131670 H03514 Hs.10130 ESTs 1.3 4.8 

55 131697 C19034 Hs.288613 Homo sapiens cDNA RJ14175 fis, done NT 3.2 97 

131701 AF103798 Hs.30819 hypothetical protein 1.3 5.2 

131703 AW160865 Hs.30888 cytochrome c oxkiaso subunit Vila pofype 1.3 7.8 

131739 AF017986 Hs.31386 secreted frlzzled-related protein 2 10.6 14.7 

131764 AI805664 Hs.31731 peroxkedoxln 5 1.1 3.6 

60 131781 AF077036 Hs.31989 OKFZP586G1722 protein 1.6 37 

131791 X62111 gb:H.sapiensVII«5 gene for irraramogtobuJ 1.1 3.5 

131853 AI681917 Hs.3321 ESTs, Highly similar to IRX1 HUMAN IROQU 5.3 1.2 

131870 NMJJ14874 Hs.3363 WAAG214 gene product 0.6 4.2 

131903 NM.004642 Hs.3436 deleted In oral cancer (mouse, homolog) Z4 4.9 

65 131913 AW207440 Hs.185973 degenerative spermatocyte (homolog Droso 24 6 

131930 AA772603 Hs.69476 Homo sapiens cDNA FU12758 fis, done NT 17 9.2 

131941 BE252983 Hs.35086 ubiquitin specific protease 1 0.5 5.2 
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131947 AI123939 Hs.182997 ESTs 0.7 4.1 

131961 AA129782 Hs.3576 Homo sapiens mRNA full length insert cON 0.9 4.8 

131964 AW381148 Hs.196365 2,3-bisphosphoglycerate mutase 11 6.1 

131974 AF208856 Hs.268122 hypothetical protein 1.3 3.9 

5 131983 AF1 19665 Hs.184011 pyrophosphatase (Inorganic) 3.3 6.9 

131997 AF229181 Hs.136544 CS box-containing WD protein 0.9 5.2 

132006 AW162336 Hs.3709 tow molecular mass ubiquinone-binding pr U 3.6 

132063 BE277910 Hs.3833 3*i>hosph«denc^ff-phosphosulratesy 3.2 1.8 

132065 BE379335 Hs.211594 proteasome (prasome, macropaln) 26S subu 1 1 3.6 

10 132071 AF217798 Hs.3850 US1 -interacting protein NUDEL; endoolig 0.7 5.2 

132079 AI701457 Hs.38694 ESTs 2 5.3 

132094 NMJM6045 Hs.3945 CGI-107 protein M 4.3 

132116 AW960474 Hs.40289 ESTs 3.1 3.1 

132164 AI752235 Hs.41270 procoUagen-iysine,2-Qxoglutarate5-<iio 1.8 3.7 

15 132181 AW961231 Hs.16773 Homo sapiens done TCCCIA00427 mRNA sequ 1.2 5 

132208 AL031709 Hs.241575 hypothetical protein CAB561 84 1.4 4.2 

132258 AA306325 Hs.4311 SUMO-1 activating enzyme subunit 2 2 10.3 

132303 BE177330 Hs.325093 Homo sapiens cDNA: FU21210 fis, clone C 1.2 4.1 

132316 U28831 Hs.44566 KIAA1641 protein 5.9 1.6 

20 132358 NrvL003542 Hs.46423 H4 histone family, member G 5.8 1.5 

132384 AA312135 Hs.46967 HSPC034 protein 2.1 9.3 

132397 AA021160 Hs.4750 hypothetical protein DKFZp564K0822 1.3 4.6 

132413 AW361383 Hs.260116 metailoprotease 1 (pitrilysln family) 2 4.9 

132442 AW970859 Hs.313503 ESTs 1.2 5 

25 132534 BE388673 Hs.5086 hypoftetical protein MGC10433 2 3.9 

132540 BE396290 Hs.5097 synaptogyrin 2 1.4 5.1 

132554 AF065391 Hs.194718 zinc finger protein 265 1.2 4 

132575 AV660538 Hs.2841 62 60S ribosomal protein L30iso!og 3 1.7 

132585 AF029750 Hs. 179600 TAP binding protein (tapasin) 1.8 4.7 

30 132602 AW606927 Hs.5306 hypothetical protein DKFZp586F1 122 slmil 1.6 4.9 

132608 AA353044 Hs.5321 ARP3 (actlrwelated protein 3, yeast) ho 1.8 8.1 

132718 NM.004600 Hs.554 Sjogren syndrome antigen A2 (60kD, ribon 4.2 2 

132719 AI264357 Hs.55405 hypothetical protein MGC16212 1.1 5.3 
132730 AK000868 Hs.5570 hypofoetical protein FU 10006 1.4 5.2 

35 132765 BE222975 Hs.56205 Insulin induced gene 1 1.1 5.8 

132782 F07424 Hs.279840 zinc finger protein 222 1.3 3.7 

132793 AB020713 Hs.56966 WAA0906 protein 2.3 6.3 

132805 AW975748 Hs.5724 sclerostin 0.7 7.7 

132863 BE268048 Hs.236494 RAB10, member RAS oncogene family 1.8 6.2 

40 132894 D63209 Hs.5944 solute carrier family 1 1 (proton-coupled 1.5 20.8 

132930 AA579258 Hs.6083 Homo sapiens cONA:FU21028fe, done C 1 3.8 

132932 AW1 18826 Hs.6093 Homo sapiens cDNA: FU22783 fis, clone K 0.7 5.4 

132933 BE263252 Hs.6101 hypothetical protein MGC3178 1.6 4.1 
132965 AI248173 Hs.191460 hypothetical protein MGC 12936 1 4.2 

45 132984 BE539199 Hs.62112 zinc finger protein 207 1.5 4.4 

132990 X77343 Hs.334334 transcription factor AP-2 alpha (activat 13.9 0.8 

132998 YO0062 Hs.170121 protein tyrosine phosphatase, receptor t 0.6 4.6 

133002 AW499985 Hs.42915 ARP2 (actin-related protein 2. yeast) ho 1.5 11.1 

133011 NM.006379 Hs.171921 sema domain, Immunoglobulin domain (Ig), 3.5 1 

50 133012 AA847843 Hs.62711 Homo sapiens, clone IMAGE:3351295, mRNA 1 4.5 

133040 AW502761 Hs.30909 WAA0430 gene product 0.9 5.5 

133056 H12028 Hs.6396 Jumping translocation breakpoint 1.7 5.3 

133063 A1654133 Hs.30212 thyroid receptor interacting protein 15 0.6 4.9 

133067 AK000708 Hs.169764 hypothetical protein FU20701 1.2 3.5 

55 133080 AF089816 Hs.6454 chromosome 19 open reading frame 3 1.2 17.5 

133110 AA808177 Hs.65228 ESTs 0.9 5.1 

133150 AV655783 Hs.661 Empirically selected from AFFX single pr 11 4.5 

133175 AW955632 Hs.66666 ESTs, Weakly similar to S19560 proline-r 1.5 4.8 

133199 AF231981 Hs.250175 homotog of yeast long chain poryunsatura 5.5 5.9 

60 133203 AA464362 Hs.6748 hypothetical protein PP1665 U 3.7 

133206 AB037773 Hs.6762 hypothetical protein 1.6 8.6 

133221 W32474 Hs.301746 RAP2A, member of RAS oncogene family 2.4 4.8 

133229 AL137480 Hs.6834 KIAA1014 protein * 1 4.2 

133241 AW796524 Hs.68644 Homo sapiens microsomal signal peptidase 1.3 3.9 

65 133257 BE617892 Hs.6895 actin related protein 2/3 complex, subun 1.4 5.4 

133271 Z48633 Hs.283742 ftsaplens mRNA forreirotransposon 3.1 0.7 

133273 N27672 Hs.69469 dendritic ceB protein 2.5 6.5 
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133287 AW797437 


Hs.69771 B-fector, properdin 


1.3 


4 


133291 BE297855 


Hs.69855 NRAS^elatedgene 


1.4 


5 


133292 AA304961 


H&699 peptidylprolyl feomerase 8 (cydophflin 


2.2 


6.8 


133294 AJ001385 


Hs.69997 zinc finger protein 238 


1.5 


4.3 


133300 AF116666 


Hs.70333 hypothetical protein MGC10753 


1.4 


6.3 


133302 X04898 


Hs.237658 apoGpoprotein A-tt 


0.2 


3.6 


133308 U56979 


Hs.250651 H factor 1 (complement) 


0.6 


5 


133347 BE257758 


Hs.71475 add cluster protein 33 


\2 


4.2 


133370 AF245505 


Hs.72157 DKFZP564I1922 protein 


3.7 


5.8 


133404 AB007916 


Hs.214646 WAA0447 gene product 


U 


5.1 


133408 AI738719 


Hs. 193427 hexoWnase2 


0.9 


6.3 


133422 AB033081 


Hs.73287 WAA1235 protein 


U 


3.7 


133442 AL137663 


Hs.7378 Homo sapiens mRNA; cDNA DKFZp434G227 (fr 


0,7 


133448 M27749 


Hs.288168 Immanogtobulin lambda-like polypeptide 1 


1.1 


4.3 


133449 AF038962 


Hs,7381 voltageKiependent anion channel 3 


0.7 


4.2 


133501 AI962602 


HsJ4284 hypothetical protein MGC2714 


3.1 


5.9 


133504 NM.004415 


HsJ4316 desmoplaWn(DPI 1 DPII) 


4.3 


115 


133506 BE562956 


H&74346 hypothetical prate* MGC14353 


1.8 


19.7 


133532 D87452 


Hs.74579 KIAA0263 gene product 


1.2 


5.4 


133574 H97991 


Hs.193313 Empirically selected from AFFX single pr 


1.4 


3.9 


133586 AI929645 


Hs.225936 synapsinl 


0.8 


4.9 


133589 L37368 


Hs.75104 RNA-binding protein S1, serine-rich doma 


2 


10.8 


133591 AI423369 


Hs.75111 protease, serine, 11 (IGF binding) 


2.1 


4.5 


133606 U10564 


Hs.75188 wee1+(S.ponfoe)hornolog 


3.3 


1.1 


133617 BE244334 


Hs.75249 ADP-ribosylafion factor-like 6 interact] 


2.3 


5.6 


133651 AI301740 


Hs.173381 dihydropyrimldinase-fike2 


0.8 


13.5 


133660 H14843 


Hs.303154 popeye protein 3 


1 


9.1 


133663 AJ006239 


Hs. 75438 qulnoid dfoydropteridine reductase 


0.5 


5.8 


133668 L77964 


Hs.271980 rnitogen-aotivaled protein kinase 6 


1.1 


6.9 


133671 AW503116 


Hs.301819 zinc finger protein 146 


1.8 


3.8 


133681 AI352558 


Ks.75544 tyrosine 3-monooxygenase/uyptophan 5-mo 


1.5 


11.1 


133694 W17187,comp Hs.232400 heterogeneous nuclear ribonucleoprotein 


2 


3.9 


133708 AI018666 . 


Hs.75667 synaptophysin 


0.6 


3.5 


133737 AW001130 


Hs.75824 KIAA0174 gene product 


1.2 


7.2 


133743 AI929587 


Hs.75847 CREBBP/EP300 inhibitory protein 1 


1.5 


5 


133750- BE410769 


Hs.75873 zyxin 


\2 


4.8 


133765 M62194 


Hs.75929 cadherin 1 1 , type % OB-cadherin (osteob 


3.2 


4.1 


133776 BE268649 


Hs. 177766 ADP-ribosyitransferase (NAD+; poly (ADP- 


2.1 


3.8 


133799 W24087 


Hs.76285 DKFZP564B167 protein 


1.9 


12.6 


133800 AF075337 


Hs.76293 thymosin, beta 10 


2.6 


6.6 


133802 AW239400 


Hs.76297 G protein-coupled receptor kinase 6 


1 


4,9 


133806 D25969 


Hs.76325 step 11 splicing factor SLU7 


0.5 


3.8 


133817 AW578716 


Hs.7644 H1 hlstone family, member 2 


1.5 


4.5 


133829 AW630088 


Hs.76550 Homo sapiens mRNA; cONAOKFZp564B1 264 (f 


3.7 


133841 AA345824 


Hs.76688 carboxylesterase 1 (monocyte/macrophage 


0.3 


4.4 


133845 AA147026 


Hs.76704 ESTs 


5.5 


2.9 


133863 AI815523 


Hs.76930 synuctein, alpha (non A4 component of am 


0.6 


4.8 


133887 X07767 


Hs.77271 protein kinase, cANP-dependent, cataiyti 


1 


10.2 


133892 AW859528 


Hs .301 497 aminotransferase 1 


0.9 


4.8 


133913 AU076964 


Hs.7753 calumenln 


Z8 


10.5 


133914 A1458213 


Hs.77542 ESTs 


1.8 


9.0 


133917 AL031177 


Hs.7756 proteasome (prosome, macropain) 26S subu 15 


6.6 


133947 L41066 


Hs.77810 nuclear factor of activated T-ceOs, cyt 


1.5 


3.8 


133986 M54968 


Hs.184050 v-Kkas2 Kirsten rat sarcoma 2 viral on 


0.9 


4.3 


133987 L15409 


Hs. 174007 von HippeRJndau syndrome 


2.3 


4.3 


133989 AL040328 


Hs.78202 SWl/SNF related, matrix associated, acti 


3.3 


3.4 


133990 R48316 


Hs.7822 Homo sapiens mRNA; cDMA DKFZp564C1216 (f 


1.3 


134029 BE150882 


Hs.143601 hypothetical protein hCUUso 


1 


6.5 


134040 NM.003470 


Hs.78683 ubiquitin specific protease 7 (herpes vi 


1.7 


3.6 


134042 AI027881 


Hs,7869 lysosomal 


1 


7.5 


134049 AF1 17236 


Hs.78825 matrin3 


1.2 


4 


134095 NM.004354 


Hs.79069 cydinG2 


2.7 


4.8 


134098 BE513171 


Hs.79086 mitochondrial ribosomal protein L3 


3.3 


2.1 


134207 Z43039 


Hs.170198 WAAQ009 gene product 


U 


3.5 


134210 AF035606 


Hs.80019 programmed ceO death 6 


1.7 


6.9 


134218 U77735 


Hs.80205 plm-2 oncogene 


0.8 


5.3 


134270 X68194 


Hs.80919 synaptophysin-Qke protein 


1.4 


11.4 



4.8 



5.6 



5.7 
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134277 NMJJ04369 Hs.80988 collagen, type VI, alpha 3 2.6 35 

134280 NMJJ00712 Hs.81029 bfliverdln reductase A 18 5.8 

134288 AI022650 Hs.8117 erbb2-lnteracfing protein ERBIN 1.1 3.6 

134296 R00603 Hs.8128 phosphatidytserine decarboxylase 1.1 5.9 

5 134300 NMJJ01430 Hs.8136 endothelial PAS domain protein 1 0.5 4.8 

134310 AL037800 Hs.8148 selenoprotein T 1.7 7.9 

134343 D50683 Hs.82028 transforming growth factor, beta recepto 0.8 7.6 

134364 X76534 Hs.82226 glycoprotein {transmembrane) nmb 2.2 3.6 

134374 N22687 Hs.8236 ESTs 1.9 3.6 

10 134378 AL035786 Hs.82425 actin related protein 2/3 complex, subun 1.5 8.3 

134382 BE512856 Hs.109051 SH3 domain binding glutamic acid-rich pr 1.1 3.6 

134415 AI750762 Hs.82911 protein tyrosine phosphatase type IVA, m 1.9 4.6 

134417 NMJJ06416 Hs.82921 solute carrier family 35 (CMP-sialicaci 1.2 7.5 

134421 AU077196 Hs.82985 collagen, type V, afeha 2 6.6 8.7 

15 134439 Z23024 Hs.138860 Rho GTPase acfivating protein 1 2 3.9 

134454 NM_013230 Hs.286124 CD24 antigen (smail cefl rung carcinoma 3.5 1.1 

134494 D86981 Hs.84084 amyloid beta precursor protein (cytoplas 1.5 4.4 

134501 W84869 Hs.211568 eukaryotlc translation initiation factor 1.2 5.7 

134505 AW960673 Hs. 177530 ATP synthase, H+ transporting, mitochond 1.3 3.9 

20 134520 BE091005 Hs.74861 activated RNA polymerase II transcriptio 1.8 4.3 

134528 M23161 Hs.84775 Human transposon-Oke element mRNA 0.8 5.6 

134545 AI902899 Hs.85155 butyrate response factor 1 (EGF-response 1.4 5 

134553 AI203545 Hs.296169 S-phase response (cycfiiHelated) 0.8 3.9 

134573 NM_016142 Hs.279617 steroid dehydrogenase homotog 1.3 5.7 

25 134576 AB033017 Hs,8594 KIAA1 191 protein 0.9 3.7 

134577 BE244323 Hs.85951 exportin, tRNA (nuclear export receptor 4 6,8 

134579 AW936928 Hs.85963 DKFZP564M182 protein 2.2 4.3 

134582 AA927177 Hs.86041 CGG triplet repeat binding protein 1 1.6 3.6 

134600 AF078859 Hs.86347 hypoflieflcal protein 2.1 3.5 

30 134655 AF265208 Hs. 123090 SW1/SNF related, matrix associated, act! 1.7 4.2 

134700 AKD00606 Hs.8868 golgi SNAP receptor complex member 1 4.4 0.9 

134737 017530 Hs.89434 drebrin 1 3.1 1.6 

134762 T51986 Hs.283108 hemoglobin, gamma G 0.5 4.6 

134843 AA428520 Hs.90061 progesterone binding protein 1.3 3.7 

35 134854 J03464 Hs.179573 coflagen, type I, alpha 2 8.7 17.3 

134865 AA587775 Hs.66295 multi-PDZ-dornalrKontaining protein 1.7 4 

134868 AB020689 Hs.90419 KIAA0882 protein 3.4 0.9 

134874 AI803761 Hs.90458 serine palmitoyltransferase, long chain 1.3 6.9 

134885 AJ002030 Hs.9071 progesterone membrane binding protein 1.4 9.6 

40 134891 R51083 Hs.90787 ESTs 1 10.1 

134908 BE089782 Hs.9877 hypothetical protein 1.9 3.9 

134934 AF005043 Hs.91390 poly (ADP-ribose) glycohydrolase 1 4.3 

134970 BE560779 Hs.284233 NICE-5 protein 1.4 10.4 

134982 AK002085 Hs.92308 Homo sapiens cDNA FU 11223 fis, clone PL 1.6 4.1 

45 135011 AB037835 Hs.92991 KIAA1414 protein 1.2 5.6 

135032 AW301984 Hs.173685 hypothetical protein FU12619 1.7 7.6 

135035 AL034344 Hs.284186 forkhead box C1 3.2 0.6 

135051 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 4.2 '4.1 

135060 AK001887 Hs.259842 protein kinase, AMP-adivated, gamma 2 n 1.3 4.8 

50 135062 AK000967 Hs.93872 KIAA1682 protein 2 3.7 

135077 AW503733 Hs.9414 WAA1488 protein 2.8 3.7 

135082 AB017363 Hs.94234 WzzledtDrosophilaJhomolog 1 2.4 4.8 

135107 T97257 Hs.337531 ESTs, Moderately similar to 138022 hypot 1.4 5.8 

135143 AA132813 Hs.69559 KIAA1 096 protein 1.8 8.5 

55 i 135156 BE563088 Hs.9552 binder of Art Two \2 6.8 

135172 AB028956 Hs.12144 WAA1Q33 protein 3.1 1.4 

135181 BE250865 Hs.279529 px19-iike protein 1.3 7.5 

135222 AA534009 Hs. 183487 interferon stimulated gene (20kD) 1.3 3.8 

135232 AL038812 Hs.96800 ESTs, Moderately similar to ALU7_HUMAN A 2.1 3.9 

60 135289 AW372569 Hs.9788 hypothetical protein MGC10924 similar to 0.9 8.4 

135290 AA331901 Hs.184736 hypofiietical protein FU10097 1 3.8 

135291 T83882 Hs.97927 ESTs 1.2 3.5 
135349 AA114212 Hs.9930 serine (or cysteine) proteinase inhibfto 2.6 8.9 
135357 AI565004 Hs.79572 cathepsln D (lysosomal aspartyl protease 15 5.4 

65 135396 M16029 Hs.287270 ret proto-oncogene (multiple endocrine n 0.4 7.9 

135399 W79431 Hs.326249 ribosomal protein L22 1.5 4.5 

135400 X78592 Hs.99915 androgen receptor (dBiydroteslosterone r 3.2 1.8 
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302963 
303131 
303150 
310125 
312662 
319429 
320591 
406779 
410691 
410763 
415738 
420186 
422055 
425815 
426218 
427397 
427466 
427505 
427723 
428673 
430219 
430450 
25 432866 
433423 
437562 
437667 
437754 
440252 
441471 
448292 
449404 
449964 
451389 
452685 
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AW176909 
AW673106 
AW081061 
AA887146 
AA147979 
AA233808 
BE616412 
AA054761 
M412048 
AW239226 
AF279145 
BE539367 
NMJ15925 
NM.014320 
R94023 
AF1 19043 
A1929685 
AA523543 
AA361562 
AI355260 
AW601325 
X99209 
R23553 
BE395875 
BE407127 
AB001636 
BE616412 
R60366 
8E513940 
AL042986 
BE281316 
H51066 
AW001741 
N73222 
AI634651 
RC_H15847_s 
RCJV84712 
X14008_ma1_f 
RCJi86543J 
H07011 

RCJW164586.S 
RCJWJ70485 
RCJi98714js 
RCJW406145J 
AA458584 
AA031548 
X02761 

RCJ\A487193 

R25326 

RCJVA393805 

RC AA449333 

RCIM287681_s 

RC_AA490864 

RC.C14243J 

R21443 

RCJVA251902 

M21121_s 

C00038_s 

Y00503 

RCJ*27006J 

RCJ\A416886 

RC_AA460450 

RC_AA488433 

RCJ\A278400J 

U28831 

RC.AA199588 



Hs.224410 Homo sapiens cDNA FU 12843 Us, done NT 3.6 

Hs.42346 cahineurfai-binding protein caisartin-1 3.3 

Hs.151945 mitochondria) ribosomal protein L43 0.9 

Hs.103180 DC2 protein 3 

Hs.8217 stromal antigen 2 62 

Hs,285005 iTiltochondna! Import receptor Tom22 1.2 

Hs.286241 protein kinase, cAMP-dependent, reguiato 1 

Hs.286218 junctional adhesion molecule 1 1 .5 

Hs.169149 karyopherin alpha 1 (irnportin alpha 5) 12 

Hs279574 CGK39 protein; cell death-regulatory pr 1.3 

Hs.65450 retfculon4 M 

Hs.8966 hypothetical protein RJ21776 2 

Hs.295953 ESTs. Weakly similar to AF220049 1 uncha 1.3 

Hs.95697 Bver^pedficbHl^^trawcrlpttonfa 1.5 

Hs. 11 1029 putative heme-hlnding protein 2 

Hs.337531 ESTs i Moo^ralBlysirnaartol38022hypot 1.7 

Hs.168005 Homo sapiens cONA RJ13372 fis, clone PL 33 

Hs.177656 calmodulin 1 (phosphorylase kinase, delt 1.3 

Hs.7678 cellular reMcacfrbinaTng protein 1 1.1 

Hs. 178761 26S proteasome-assodated padl homolog 3.2 

Hs.279789 hlstone deacetytase 3 2.8 
Hs.324278 Homo sapiens mRNA; cONA DKFZp566M063 (fr 

Hs.235887 HMT1 (hnRNP methyltransferase, S. cerevi 1 .8 
Hs.241469 hypothetical protein 
Hs.279609 mitochondrial carrier homolog 2 
Hs.8997 heat shock TOkO protein 1 A 
Hs.5683 DEAD/H (Asp-Glu^Asp/His) box polypep 
Hs.286218 junctional adhesion molecule 1 
Hs.5822 Homo sapiens cONA FU22120 hs, done H 
Hs.6101 hypothetical protein MGC3178 
Hs.7857 erythrocyte membrane protein band 
Hs.47334 hypothetical protein FU14495 
Hs.23581 leptin receptor gene-related protein 
Hs.273193 hypothetical protein FU10706 
Hs.279009 matrix Gla protein 
Hs.30250 v-maf musculoaponeurotfc fibrosarcoma (a 
pepSdylprotyl feomerase B (cydophQIn 6} 
calumenln 

lysozyme (renal amyloidosis) 
ESTs 

ESTs; Weakly similar to SAS {H sapiens] 



1.1 
1.5 
1.3 
1.6 
U 
2 

1.1 
0.5 
2.5 
1.1 
1.4 
4 

0.8 
1.8 
3.5 
0.9 
1.8 
1.8 
ESTs 
3.4 
1.6 
ESTs 
3.4 



Homo sapiens clone 23967 
ESTs 

SRY (sex determining region Y)-box 4 
cell division cycle 42 (GTP-blnoTng protein; 25kD) 

fibronecttn 1 3.6 

secreted mzzled-reiated protein 4 4.7 

Homo sapiens mRNA for putative vacuolar 0.9 

ESTs; Weakly similar to (deffine not 1.1 

ESTs 2.9 

ESTS 

ESTs; Highly similar to heat shock factor 1 .4 

ESTs; Highly similar to heat shock factor 1 .7 

ESTs 1.6 

Homo sapiens tysophospho lipase (LPL1) 2.2 

small inducfcle cytokine A5 (RANTES) 0.9 

ESTs 18 

kerafin 19 3.1 

ESTs 1.6 

ESTs; Weakly similar to predicted using 3.1 

fibroblast growth factor receptor 2 (bacteria- 1 .5 

ESTs; Weakly similar to deduced amino acid 1 .1 



3.6 

I. 6 
42 
17.3 
4 

6.6 
3.5 
4.7 
,6.6 
3.5 
13.9 
5.1 
3.9 
6.2 

II. 3 
3.6 
2.8 
4.7 
3.7 
2.5 
22 
1.1 
8.8 
5.6 
6.1 
7.6 
6.5 
3.5 
5.7 
6.2 
3.7 
4.9 
3.6 
3.5 
112 
5.6 
4.8 
4.6 
4.5 

'6.6 
3.9 
6.2 
2.6 
3.5 
4.6 
0.4 
3.1 
152 
4 
5 

8.4 

4.6 

1.3 

5 

5 

3.7 
3.8 
9.9 
4.8 
1.1 
3.7 
3.1 
3.7 
4 



5.2 



0.8 



3.9 



Homo sapiens HRIHFB21 15 mRNA; partial cds 
Human protein immune-reactive with anti-PTH4.4 0.6 
Homo sapiens actirwelated protein Arp3 (ARP3) 1 .8 4.7 
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AF006082 


Homo sapiens actin-related protein Arp2 (ARP2) 


1.6 


10.9 


RC_H90899 


desmoptekin (DPI; DPI!) 5.4 


5.5 




RCJV95070 


desrnopfaWn (DPI; DPI!) 5 


2.6 




RCJ90946J 


Human mRMA for KIAA263 gene; complete ods 


1.1 


3,9 


D87258 


protease; serine; 1 1 (IGF binding) 2.4 


3.5 




AA313414_S 


ESTs; Weakly similar to cONA EST EMBLT1 157 


1.5 


5.3 


RC - H73484_s 


ESTs; Weakly similar to similar to Yeast 13 


6.3 




AFFX-HUMISGF3A/M97935 3 


2.3 


13.5 


AFFX-HUMRGE/M10098.5 1.1 


7.9 




AFFX-M2783Q.5 




0.5 


7.4 


AFFX-M27830.5 




0.6 


' 5.4 


RC_AA063431J 


ESTs 


0.8 


4.1 


RCJ63769J 


ferritin; light polypeptide 1.1 


3.7 
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TABLE 8A 

Table 8 A shows the accession numbers for those pkeys lacking unigenelD's for Table 8. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset Identifier number 

CAT number. Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 



108469 116761J AA079487 AA128547AA1 28291 AA079587 AAQ79600 
20 125076 190299J AA973971 T88817 AA253263 

114638 109698J AA075488 AA129081 AA074851 AA082852 AA074732 AA084908 AA084751 AA076042 AA131172 AA085374 AA079519 
AA074510 AA1 13824 AA102437 AA070833 AA070143 AA084693 AA084389 AA076373 AA075492 AA062834 AA084335 
AA078829 AA079344 AA069916 AA079275 AA070914 AA654069 AA081976 AA080957 AA0831 15 M070942 AA085296 
123526 genbank u AA608657 AA608657 
25 123533 genbankJVA608751 M608751 
125090 genbankJ91518 T91518 
125154 genbanLW38419 W38419 
118475 genbanLN66845 N66845 
118505 genbanK.N67343 N67343 
30 101046 entrez_K01160KQ1160 

129982 221 J67 Z14221 AW381862 M97920 AW401444 Z66542 M29470 AW406502 X6101 1 M34024 AA327072 Z14166 Z14167 Z14165 

AW403806 Z14200 AA383972 214205 Z14201 M18513 Z14202 AW403684 X14584 AF062221 U43760 X65892 X65883 X62107 
Z80847 X65885 X65893 AF062142 X65891 X17675Z47274 Z47277 Z47276 X65888 Z47275 X62109 AF062140 L01278 
AF062134 AF062139 X81723 Z8084O X81733 X81743 X81744 X81732 Z80843 AW402942 AW403516 X6591 9 AF062190 
35 AF062177 AF062222 AF062115Z47240AF062263 AF062261 AF062223AF062211 Z47238 AW401714 AW404008 AW404991 

AF062280 M99595 Z47214 Z47232 Z47218 M26995 AF062184 X65895 L38433 X81731 Z1 1946 Z47226 AF062205 AF174012 
L01276 AF062168 AF062136 X81755 X81748 AF174019 AF062285 L06924 L11699 AW402665 109070 L28049 L08090 
AW407843 Z14171 AW402944 L08083 AW405627 L33035 L26907 M17750 L28052 M17751 AJ239360 U19885 L14821 X56526 
AB014341 112087 L12098 U68231 L12184 AF062242 AF022000 U64499 U00570 AI268604 Y15773 X64239 X62969 U00506 
40 X73605 X99360 U00577 D83677 AB021539 AF035796 Z33899 U00588 AJ239353 AF174062 Z33901 X98899 AF174058 X63080 

D83676 AB021529 Z18318 U00488 L01412 X81746 U21262 U21272 U00560 AF174060 U00547 U00561 Z18321 S73957 
M26435 AF115130 AF115117 Z92896 U21254 Z92895 AF115112 AF062290 T28938 Y09386 AF174067 U27189 Y09384 
U77373 AF174057 M17749 X69692 AF174038 U64478 U64486 AF174098 X99365 AF174077 AF174083 AF174089 AF015133 
AF127792 AJ006171 U00510 U58144 AF004323 AF115109 AF004324 AF015130 AF090414 AF090418 AB021536 AB021519 
45 - Z96957 AF021986 Z92898 Z96956 L34164 AF062251 AF052524 U00549 U00541 D83687 AF052522 AF087420 AF052520 

AF02201 1 L43085 U00536 L19915 L43083 AF090420 AB021530 AB021534 Z96955 Z96954 Z33903 X62964 AF103282 U00509 
AF062298 AF062289 ARJ62307 AW408326 133037 L04337 L04328 X81742 L04336 L03385 AF032360 S56184 AF062191 
AF191092 AI906954 AF103184 L04343 AW364860 AF001424 AFi03i63 Z98717 AF1Q3143 AjQ08250 UJ4323 AF10332i 
AF103335 L04342 L03818 L03817 AW404978 AW403690 U86801 AF103150 AJ010334 AF035027 AJ007327 AF103115 
50 AF017458 AJ008207 AJ008183 AJ008196 AJ008241 AJ008208 AF103210 AF068668 AF06867O AF068671 AF068664 

AF068669 AF068666 AF068665 AF068672 AW371244 AW403670 AW408074 AW404575 AW3621 53 AW403B03 AW406702 
AW351514 D78345 T29140 J00231 NNL002179 AW405146 AA301091 X04646 H64660 AW402990 AW406534 T93007 
AI857980 AW368899 AI905833 AW406586 AA482084 AI872299 M71 5266 AW404328 AI831674 AI709348 AA6031 12 
AW514864 AA485775 H64492 AW4047B9 AA487630 AA715498 AA295885 T27613 T98113 
55 108470 genbankJ\A079500 AA079500 
101447 entre^_M21305 M21305 
124447 genbanleN48000 N48000 
101624 entreOte5998 M55998 

131791 221 260 X62111 S67984 AJ131056 Z47243Z47235 AF062268Z47237 AJ131058 AF062130AF062282 X62108AA385989AA464794 
60 X69861 AW402964 M90808 Z98735 Z98734 Z98736 AF035018 X79161 U00545 AF174046 AF174071 U00552 U96288 

AF021989 AF062255 AF174061 S73953 AF062135 AF062155 X64147 U00555 AF174070 AF129754 U86789 Z98714 Z98738 
S75886 AJ008175 AJ008186 AJ008188 Y08403 H15753 H22208 AA327241 AW405737 H42300 H39677 H26239 H26903 
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H45128 R86072 AA327565 AI660584 AW361537 AW383759 S71043 H15014 H45570 H42819 H45523 H45134 R72043 
H24543 H27636 H27610 T28147 H25496 AW364071 AW364072 H45561 H45556 H42605 AI735017 T47421 R48719 H27570 
H44599 AI459598 H42347 H41938 H24993 AA345888 H22339 A1538691 AJ012264 AA664201 AI880450 AA327310 AI991250 
AI833028 AW001210 A1956075 H30467 AA326915 H41943 AI749266 AI744441 AA327377 AW512326 AI735170 H01634 
5 AI587047 AI571623 AA327486 AA327103 AA327195 AA326973 T28143 

124842 217726J R56485 R37248 R59992 

103758 AA064874_f_at AA084874J 

130064 221J264 X57815 L29157 AA367448 AW328098 AW404536 L29156 L29163 H27683 R83195 AA295096 AW327822 H25458 AW404692 
X578 19 X57823 AW405604 AW404447 Z34914 AW406542 AA427726 M604389 AW4Q5606 AW405918 AW4051 17 
1 0 AW407182 L03632 AW405058 L03627 AW407470 R72738 121959 AW375738 X87888 AF124166 AF054638 AJ241406 

U09902 AW405284 M12025 AA360219 L03631 L03629 AW327727 AF194810 AF058076 AF194686 AF063774 AF063737 
AF063755 AF063740 AF063739 AF063738 S73129 L43088 AF063775 L33026 AF060138 AF194813 AF194812 AF19481 1 
AF194809 AF194814 AF194808 AF064505 AF064503 AF063765 AF063757 AF063752 AF063716 AF063736 AF194806 
AF058077 AF083747 AF063772 AF063781 AF060137 AF194805 AF060134 AF060132 AF058074 AF063754 AF063704 
1 5 U38589 Z18332 AF060122 AF194807 AF060135 AF064506 AF064504 AF063773 L26540 AF058072 AF06O131 AF064500 

AF060136 AF064513 AF194683 AF194711 AF058075 AF063717 Z19546 AF194581 X72746 U96393 U09890 X98897 
AF194592 M80916 AJ241405 AF194632 AJ388659 AF194625 Z74694 AF194588 AF194601 U09901 U0991 1 U09892 
AJ010336 AJ006162 AJ249377 AJ241414 AA327392 U97248 X72747 Z46850 X95739 AW406701 AJ243109 AF194609 
S80758 AF194595 AF194596 L22483 Z70262 D84141 AW405758 AF001788 AF194580 084143 U76684 AF194593 L03630 
20 X87892 X91134 U21249 Z46346 AJ132426 AF103659 AJ233718 AF021038 AJ233727 H24657 U09882 S75627 AA573599 

AF047231 AF047232 AF103616 AW404484 Z46848 S76132 AF103663 AF103713 Y17940 AF047216 AF103595 Y17956 
AF052799 AF052797 AF052802 AF052798 AF052801 AF052794 AF052796 AF047218 AF052800 AF047217 AF052795 
AF032351 AF103701 AF103708 AF103710 AF103706 S62232 AF103645 AF103632 AF103647 AF103644 AF103640 
AF103692 AF093581 AF103620 AW405934 AI445389 AW383753 AA360256 AF099676 H21654 H39501 AI820828 H53669 
25 W26785 AW384496 AW407708 AA541663 AA91 1602 AI821461 AA588300 AA327050 H42717 AI951280 AA421322 AI923193 

AA864302 H25133 D87023 J03011 M61771 D87017 AA526865 AA253450 

130232 18831_2 U29463 U06715 W24970 M584303 T39581 AA155603 AA305043 AA429426 W05664 M102382 AA482044 W24487 AA319060 

T88946 F10106 AA232161 AA2431 17 AA158937 AA100864 

109097 genbankJ\A167512 AA167512 
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TABLE 9: Figure 9 from BRCA 001-2 US 

5 Table 9 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

1 0 ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD; Unigene number 

Unigene Title: Unigene gene title 



IS Pkey ExAccn UnigenelD UnlgeneTitle 



20 



25 



30 



35 



100690 
102211 
103587 
104115 
105038 
105500 
105990 
106155 
106373 
106414 
110009 
111900 
114540 
116470 
117280 
119771 
121723 
124059 
131148 
132371 
134169 
302235 
452410 



AA383256 

BE314524 

BE270266 

AF183810 

AW503733 

AW602166 

A1690586 

M425414 

AW503807 



BE075297 

AF131764 

A1904232 

AI272141 

M18217 

A1905687 

AA243499 

BE387335 

AW953575 

AA235448 

AI690916 

AL049987 

AL133619 



Hs.1657 

Hs.78776 

Hs.82128 

Hs.26102 

Hs.9414 

Hs.222399 

Hs.29403 

Hs.33287 

Hs.21907 

Hs.28827 

Hs.6614 

Hs.25318 

Hs.75323 

Hs.83484 

Hs.172129 

Hs.2533 

Hs.104800 

Hs.283713 

Hs.303125 

Hs.46677 

Hs.178137 

Hs.166361 

Hs.29383 



estrogen receptor 1 

putative transmembrane protein 

5T4 oncofetal trophoblast glycoprotein 

opposite strand to trichorWnophalangeal syndrome I 

KIAA1488 protein 

CEGP1 protein 

hypothetical protein FU22050 

nuclear factor VB 



rrdtogen-actlvated protein kinase kinase kinase 2 

ESTs, Weakly similar to A43932 mucin 2 precursor, Intestinal 

Homo sapiens clone 25194 mRNA sequence 

prohibltin 

SRY (sex determining region Y)-box 4 

Homo sapiens cONA: FU21409 fis, done COL03924 

EST 

hypothetical protein RJ10134 

ESTs, Weakly similar to S64054 hypothetical protein YGLOSOw 

p53-induced protein PIGPC1 

PRO2000 protein 

transducer of ERBB2, 1 

Homo sapiens mRNA; cONA DKFZp564F1 12 

Homo sapiens mRNA; cONA DKFZp434 
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TABLE 10: Figure 10 from BRCA 001-3 PCT 

Table 10 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unlgene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal body tissue 

R2: Ratio of 90" percentile tumor to body 

R3: Ratio of 75* percenfile body to tumor 

R4: Ratio of tumor to normal breast tissue 



Pkey ExAccn UnigenelD 


Unigene Title 


R1 


R2 


R3 


R4 


100082 AA130080 Hs.4295 


proteasome (prosome, macro pain) 26S subu 


4.2 


152 


36 


12.2 


100103 M380887 Hs.5085 


doUchyf-phosphate mannosyltransferase p 


9.8 


123 


13 


5 


100131 012485 Hs.11951 


ectonudeotide pyrophosphatase/phosphodi 


13.2 


244 


19 


9.9 


100147 D13666 Hs. 136348 


osteoblast specific factor 2 (fasdclin 


15.7 


1030 


66 


5 


100154 H60720 Hs.81892 


KIAA0101 gene product 


4.1 


320 


78 


10.6 


100157 014661 Hs.119 


Wilms 1 tumour 1-assoclating protein 


4.7 


119 


26 


3 


100169 AL037228 Hs.82043 


D123 gene product 


5.1 


106 


21 


9.2 


100203 .BE242284 Hs.172199 


adenylate cyclase 7 


4.7 


47 


1 


4.3 


100210 D26361 Hs.3104 


KIAA0042 gene product 


4.7 


47 


4 


0.7 


100219 AW972300 Hs.118110 


bone marrow stromal cell antigen 2 


3.8 


350 


93 


1.9 


100234 D29677 Hs.3085 


KIAA0054 gene product; Helicase 


4.1 


64 


16 


3 


100248 NM.015156HS.78398 


K1AA0071 protein 


3.4 


77 


23 


5.9 


100252 NWL006207HS.170040 


platelet-derived growth factor receptor- 


4.5 


45 


4 


4 


100260 D38491 Hs.322478 


KIAA0117 protein 


55 


59 


1 


2.6 


100279 042084 Hs.82007 


KIAA0094 protein 


3.5 


96 


28 


1.3 


100286 BE247550 Hs.86859 


growth factor receptor-bound protein 7 


3.1 


306 


98 


1.5 


100294 AA331881 Hs.75454 


peroxiredoxin 3 


12.8 


128 


1 


11.7 


100335 AW247529 Hs.6793 


platelet-activating factor acetyihydrola 


4.2 


187 


44 


5.4 


100365 AI878927 Hs.79284 


mesoderm specific transcript (mouse) horn 


4.5 


129 


29 


3.1 


100375 080004 Hs.75909 


K1AA0182 protein 


3.5 


78 


23 


4.8 


100409 D86957 Hs.80712 


KIAA0202 protein 


10.2 


102 


1 


4.8 


100410 086961 Hs.79299 


lipoma HMGIC fusion partner-like 2 


4 


40 


1 


3.8 


100414 NM_014735Hs.82292 


KIAA0215gene product 


3.2 


32 


2 


2.9 


100418 D86978 Hs.84790 


KIAA0225 protein. 


3.6 


36 


7 


3.2 


100438 AA013051 Hs.91417 


topoisomerase (DNA) II binding protein 


5.6 


76 


14 


2 


100439 AA347720 Hs.122669 


K1AA0264 protein 


3.5 


35 


9 


3.1 


100448 AF234887 Hs.57652 


cadherin, EGF LAG seven-pass G-type race 


5.5 


145 


27 


2.2 


100449 D87470 Hs.75400 


KIAA0280 protein 


3.4 


34 


1 


1.2 


100522 X51501 Hs.99949 


prolactin-induced protein 


22.7 


760 


34 


1.4 


100552 AA019521 Hs.301946 


lysosomal 


14.4 


144 


9 


4.7 


100643 NM_005032Hs.4114 


plastin3{Tisoform) 


4.1 


259 


63 


1.9 


100661 BE623001 Hs.132748 


Homo sapiens ribosomal protein L39 mRNA, 


3.3 


116 


36 


2.2 


100666 L05424 Hs.169610 


CD44 antigen (homing function and Indian 


8.5 


85 


1 


3.2 


100667 L05424 Hs,169610 


CD44 antigen (homing function and Indian 


3 


594 


201 


2.3 


100745 BE207168 Hs.144630 


nuclear receptor subfamily 2, group F, m 


5 


82 


17 


0.9 


100774 J05581 Hs.89603 


mucin 1, transmembrane 


3.5 


37 


11 


2.8 


100783 AF078847 Hs.191356 


general transcription factor IIH, polype 


9.7 


97 


10 


7.2 


100821 M26460 


gb:Homo sapiens (clone 104) retlnobiasto 


3.3 


33 


1 


0.8 


100864 BE563957 Hs.74861 


activated RNA polymerase II transcriptio 


3.7 


477 


130 


3.1 


100877 X80821 Hs.27973 


KIAA0874 protein 


6.3 


63 


4 


5.7 


100892 BE245294 Hs.180789 


S164 protein 


4.7 


47 


1 


4.2 


101038 BE297139 Hs.79411 


replication protein A2 (32kD) 


3.8 


115 


30 


7.1 


101046 K01160 


NlvL002122:Homo sapiens major histc)cornpat 


3,9 


390 


100 


11.1 


101079 BE264901 Hs.250502 


carbonic anhydrase VIH 


3.9 


39 


8 


3.6 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



101084 AW409934 Hs.75528 


nucleolar GTPase 


4.1 


53 


13 


4 


101104 AWB62258 Hs.169266 


neuropeptide Y receptor Y1 


15.3 


153 


1 


14.1 


101185 NNL001621HS.170087 


ary! hydrocaroon receptor 


11.3 


113 


8 


3.9 


101188 L20320 Hs.184298 


cycfin-dependent kinase 7 (homotog of Xe 


3.1 


118 


38 


2 


101201 L22524 Hs.2256 


matrix metatoprotelnase 7 (MMP7; uterin 


8.2 


396 


46 


0.9 


101232 AU077288 Hs.242894 


ADP-ribosytation factor-like 1 


4 


110 


28 


10.7 


101275 BE545277 Hs.3273 


Ts translation elongation factor, mitoch 


4.2 


50 


12 


4.4 


101300 BE535511 Hs.74137 


transmembrane trafficking protein 


6.6 


135 


21 


13.1 


101396 BE267931 Hs.78996 


proliferating cell nuclear antigen 


6.4 


249 


39 


22.4 


101447 M21305 


gbiHuman alpha satellite and satellite 3 


6.5 


878 


135 


0.8 


101448 NNL000424Hs.195850 


keratin 5 (epidermolysis bullosa simplex 


•4.8 


622 


130 


0.7 


101470 NMJJ00546H3.1846 


tumor protein p53 (U-FraumenI syndrome) 


5.1 


97 


19 


9.3 


101478 NMJ)02890Hs.758 


RAS p21 protein activator (GTPase acfiva 


9.6 


96 


1 


8.5 


101464 AA053486 Hs.20315 


toterferon-lnduced protein with tetratri 


11.2 


112 


8 


5.9 


101507 X16896 Hs.82112 


interieuWn 1 receptor, type I 


3.9 


39 


2 


3.5 


101621 BE391804 Hs.62661 


guanytate binding protein 1, interferon- 


3.6 


36 


1 


2.6 


101624 M55998 


gb:Human alpha-1 collagen type 1 gene, 3 


3.1 


2898 


923 


2,2 


101664 AA436989 Hs.121017 


H2A hlstone family, memberA 


6.9 


103 


15 


8.4 


101684 M63256 Hs.75124 


cerebellar degeneration-related protein 


6.4 


64 


2 


4.9 


101724 L11690 Hs.620 


bullous pemphigoid antigen 1 (2307240kD) 


9.4 


94 


1 


0.3 


101754 S70114 Hs.239489 


TTA1 cytotoxic granule-associated RNA-W 


8.9 


89 


5 


8 


101767 M81057 Hs.180884 


carboxypepSdase B1 (tissue) 


3.6 


824 


227 


1.4 


101791 M83822 Hs.62354 


ceil division cycle 44ike 


9 


144 


16 


13 


101794 M84605 Hs.957 


putative opioid receptor, neuromedin K ( 


3.3 


36 


11 


2.4 


101803 AW024390 Hs.155691 


pre-B-celi leukemia transcription factor 


5.4 


180 


34 


15.9 


101809 M86849 Hs.323733 


gap junction protein, beta 2, 26kO (conn 


12 


120 


8 


9 


101839 AA446644 Hs.692 


GA733-2 antigen; epithelial glycoprotein 


3.1 


353 


116 


2.8 


101888 AL049610 Hs.95243 


transcription elongation factor A (Sll> 


7.3 


73 


1 


5.3 


101960 AL036287 Hs.194662 


calponin 3, acidic 


3.8 


399 


105 


3.3 


102009 BE245149 Hs.82643 


protein tyrosine kinase 9 


4.6 


148 


32 


11-3 


102095 U11313 Hs.75760 


sterol carrier protein 2 


9.5 


95 


4 


8.8 


102123 NMJX)1809Hs.1594 


centromere protein A (17kD) 


4.2 


42 


7 


3.4 


102125 NM_005456Hs.288215 


sialyltransferase 


9.3 


93 


4 


3 


102139 NWL004419Hs.2128 


dual specificity phosphatase 5 


5.4 


137 


26 


2.5 


102162 AA450274 Hs.1592 


CDC 16 (ceil division cycle 16, S. cerevl 
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cytokine receptor-Eke molecule 9 


3.4 


34 


2 


3 


4 *i AA^ A 1 IAAAAJ t 1 — A iff 1 /* 

132316 U28831 Hs.44566 


Lift a i ini j - 

KIAA1641 protein 


18.6 


186 


10 


1.5 


132325 N37065 Hs.44856 


hypothetical protein FU12116 


5.5 


323 


59 


105 


132358 NM_003542Hs.46423 


H4 lustone family, member G 


3.3 


979 


298 


2.2 


132364 AA312135 Hs.46967 


HSPC034 protein 


3.6 


36 


1 


3.1 


132388 W32624 Hs.278626 


Ara/AbHnteracting protein ArgBP2 


5.9 


186 


32 


3.7 


132393 AL135094 Hs.47334 


hypothetical protein RJ14495 


4.2 


159 


38 


7.1 


132407 BE613126 Hs.47783 


B aggressive lymphoma gene 


4.6 


46 


1 


4.3 


40A,JAe IIO^PiA IK JAf AA? 

132425 N87549 Hs.1 25287 


zinc finger protein ZNF140-Cke protein 


3.6 


146 


41 


1.1 


132440 AB020699 Hs. 112751 


\f% H | AO A A A*—. * 

WAA0892 protein 


3.3 


33 


4 


2.9 


132465 AW1 69847 Hs.49169 


WAA1634 protein 


8.3 


145 


18 


3.7 


132522 AB023164 Hs.5070 


KIAA0947 protein 


4.6 


46 


1 


4.4 


132528 T78736 Hs.50758 


SMC4 (structural maintenance of chromoso 


9.3 


93 


1 


8.4 


132530 AA306105 Hs.50785 


SEC22, vesicle trafficking protein (S. c 


4.9 


49 


1 


4.4 


132543 BE568452 Hs.5101 


protein regulator of cytokinesis 1 


11.8 


201 


17 


19.1 


132572 AI929659 Hs.237825 


signal recognition particle 72kD 


3.8 


38 


1 


3 


132592 AW803564 Hs.288850 


Homo sapiens cDNA: FU22528 fis, clone H 


4.8 


93 


20 


3.1 


132602 AW606927 Hs.5306 


hypoftetical protein OKFZp586F1 122 simii 


6.1 


61 


2 


5.9 


132616 BE262677 Hs.283558 


hypoftetfcal protein PR01855 


3.4 


193 


58 


1Z3 


132617 AF037335 Hs.5338 


carbonic anhydrase Xil 


14.2 


390 


28 


22.5 


132618 AL050025 Hs.279916 


hypothetical protein FU20151 


3.3 


909 


274 


3.2 


132632 AU076916 Hs.5398 


guanine monphosphate synthetase 


5 


50 


1 


4.1 


132668 AB018319 Hs.5460 


KIAA0776 protein 


4.2 


171 


41 


12.6 


132742 AA025480 Hs.292812 


ESTs, Weakly similar to T33468 hypotheti 


6.5 


65 


1 


5.6 


132790 AW242243 Hs.168670 


peroxisomal famesylated protein 


3,7 


37 


1 


2.2 


132811- U25435 Hs.57419 


(XXTC-blnding factor (zinc finger protei 


7 


115 


17 


5.4 


132852 AL120050 Hs.58220 


Homo sapiens cONA: FU23005 fis, clone L 


3.3 


61 


19 


5.1 


132856 NM_001448Hs.58367 


glypican4 


4.8 


48 


1 


3.6 


132880 BE077155 Hs.177537 


hypothetical protein DKFZp761B1514 


12.6 


126 


8 


9.9 


132902 A1936442 Hs,59838 


hypothetical protein FU10808 


11 


187 


17 


10.4 


132906 BE613337 Hs.234898 


geminin 


3,3 


106 


33 


2.6 


132914 AL047045 Hs.60293 


Homo sapiens clone 122482 unknown mRNA 


3,5 


110 


32 


2.1 


4^<\A^A A PAA jI ^ AA U- ajaaA 

132968 AF234532 ns.61638 


myosin X 


4.1 


62 


15 


4.9 


132977 AA093322 Hs.301404 


RNA binding motif protein 3 


22.1 


221 


9 


17.8 


4 f\AAAn \r»^rt |A 1 AAJAAJ 

132990 X77343 Hs.334334 


transcription factor AP-2 alpha (activat 


12.7 


311 


25 


2.4 


J\ AAAA * A A J 4 AT J A ll /nnAAf 

132994 M1 12748 Hs.279905 


t 1 lAAAiA AH /"\ AA J A A 

clone HQ0310 PRO0310p1 


3 


380 


127 


5.5 


133011 NwL006379Hs. 171921 


sema domain, Immunoglobulin domain (ig), 


7.3 


271 


37 


22 


133015 AJ002744 Hs.246315 


UDP-N-acetyt-alpna-D-gabctosamlne:poIyp 


4.6 


427 


93 


10.4 


133070 U92649 Hs.64311 


a dtsintegrin and metaBoproteinase doma 


3.6 


36 


1 


3.1 


133091 AKD01628 HS.64691 


ixi a Anjnn ...1.1,, 

KIAA0483 protein 


5.2 


117 


23 


5 


133192 AA218564 Hs.67052 


vacuolar protein sorting 26 (yeast homol 


3.1 


359 


118 


2.5 


133197 AI275243 Hs.180201 


hypothetical protein FU20671 


5.1 


58 


12 


5.7 


133199 AF231981 Hs.250175 


homolog of yeast long chain pofyunsatura 


3 


816 


275 


3.9 


133221 W32474 Hs.301746 


RAP2A, member of RAS oncogene family 


3.1 


234 


76 


8.6 


133240 AK001489 Hs^42894 


ADP-rfbosylafon factor-like 1 


8.1 


81 


1 


4,6 


133271 Z48633 Hs.283742 


H-sapiens mRNA far retrotransposon 


114 


124 


6 


10.8 


133291 BE297855 Hs.69855 


NRAS-relaled gene 


3.3 


33 


1 


2.9 


133294 AJ0O1388 Hs.69997 


zinc finger protein 238 


7.9 


234 


30 


18.9 


49.QQCA AMOOOIA Ltr> "74 im 

13335U Al499Zzu HS.71573 


hypothetical protein pljiuo/4 


4.6 


AR 

40 


5 


Q C 
J.0 


133362 AK001519 Hs.7194 


CG1-74 protein 


5 


110 


22 


9.7 


133370 AF245505 Hs.72157 


DKFZP564I1922 protein 


3.2 


725 


227 


3.2 


133407 AF017987 Hs.7306 


secreted frizzied-related protein 1 


4.1 


374 


91 


1.1 


133422 AB033061 Hs.73287 


KIAA1235 protein 


4.3 


43 


1 


3.9 


133435 AI929357 Hs.323966 


Homo sapiens clone H63 unknown mRNA 


5.5 


186 


34 


165 


133479 W01556 Hs.238797 


ESTs, Moderately similar to 138022 hypot 


3.5 


35 


7 


2.1 


133493 AW998046 Hs.194369 


arglnine-glutamic acid di peptide (RE) re 


3.6 


39 


11 


0.4 


133504 NM.004415HS.74316 


desmopiakin(DPI,DPII) 


4.1 


640 


158 


3 


133517 NM_000165Hs.74471 


gap junction protein, alpha 1, 43kO (con 


3.2 


351 


111 


5.2 


133536 W25797 Hs.177486 


amyloid beta (A4) precursor protein (pro 


3.2 


226 


71 


2,8 


133578 AU077050 Hs.75066 


transOn 


3.4 


178 


53 


8.8 


133633 D21262 Hs.75337 


nucleolar and coGed-body phosphproteln 


4.7 


47 


1 


4 


133640 AW246428 Hs.75355 


ubiquitin-conjugatlng enzyme E2N (homolo 


8.5 


85 


1 


12 


133669 NMJ)06925Hs.166975 


splicing factor, arglnine/serine-rich 5 


3.6 


36 


1 


0.4 


133681 AJ35255B Hs75544 


tyrosine 3-monooxygenase/oyplophan 5-mo 


3.4 


234 


68 


10.7 


133746 AW410035 Hs.75862 


MAD (mothers against decapenteptegic, Or 


9.3 


93 


1 


7.8 
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Hs.8078 
Hs.278614 
Hs.81057 
Hs.8117 



133765 M62194 Hs.75929 
133780 M557660 Hs.76152 
133784 BE622743 Hs.301064 
133814 NM_002462Hs.76391 
5 133829 AW630088 Hs.76550 
133845 AA147028 Hs.76704 
133913 AU076964 Hs.7753 
133968 AA355986 Hs.232068 
133990 R48316 Hs.7822 
10 133999 AA535244 Hs.78305 
134032 NM_0O5O25Hs.78589 
134054 AF091622 Hs.78893 
134087 U51166 Hs.173824 
134089 R51273 Hs.79029 
15 134095 NMJJ04354HS.79069 
134098 BE513171 Hs.79086 
134110 U41060 Hs.79136 
134125 NR.014781HS.50421 
134246 D28459 Hs.80612 

20 134257 C05768 
134272 X76040 
134282 R45621 
134288 AI022650 
134321 BE538082 Hs.8172 

25 134326 AW903838 Hs.81800 
134328 AW959281 Hs.8184 
134348 AW291946 Hs.82065 
134359 NM_001982Hs.199067 
134367 AA339449 Hs.82285 

30 134374 N22687 Hs.8236 
134380 AU077143 Hs.179565 
134395 AA456539 Hs.8262 
134401 AI916662 Hs.211577 
134405 AW057903 Hs.82772 

35 134415 AI750762 Hs.82911 
134417 NAL0G6416HS.82921 
134419 W95642 Hs.82961 
134421 AU077196 Hs.82985 
134436 U29344 Hs.83190 

40 134485 X82153 Hs.83942 
134487 AF061739 Hs.83954 
134495 D63477 Hs.84087 
134520 BE091005 Hs.74861 
134542 M14156 Hs.85112 

45 134570 U66615 Hs.172280 
134590 AW903849 Hs.173840 
134604 NMJ)02884Hs.865 
134612 AW068223 Hs.171581 
134643 AW299723 Hs.87223 

50 134654 AK001741 Hs.8739 
134656 AI750878 Hs.87409 
134672 AF271212 Hs.322901 
134700 AK000606 Hs.8868 
134711 X04011 Hs.88974 

55 134722 AF129536 Hs.284226 
134856 BE281128 Hs.9030 
134880 AI879195 
134917 X87241 
134921 AL137491 

60 134982 AK002085 Hs.92308 
134989 AW968058 Hs.92381 
135029 H58818 Hs,187579 
135035 AL034344 Hs.284186 
135051 AI272141 Hs.83484 

65 135062 AK000967 Hs.93872 
135073 W55956 Hs.94030 
135098 AW274526 Hs.277721 



Hs.90606 

Hs.166994 

Hs.125511 



cadherin 11, type 2, OB-cadherin (osteob 


3.2 


560 


174 


2.6 


deco/in 


5.4 


144 


27 


13.3 


arfapfki 1 


4.7 


47 


1 


4.1 


myxovlnjs (Influenza) resistance 1, homo 


3.3 


380 


114 


4.9 


Homo sapiens mRNA; cONA OKFZp564B1264 (f 6.7 


304 


46 


7.8 


ESTs 


6.2 


600 


97 


4.1 


catumenin 


3.3 


889 


267 


5 


transcription factors (represses Inter* 


3.7 


91 


25 


2.6 


Homo sapiens mRNA; cONA DKFZp564C1216 (f 3.4 


91 


27 


8.5 


RAB2, member RAS oncogene family 


7.8 


78 


1 


5.6 


serine (or cysteine) proteinase inhibito 


5.9 


59 


1 


3.3 


KIAA0244 protein 


5.8 


58 


1 


4.9 


thymine-DNA giycosylase 


6.4 


100 


16 


4.4 


ESTs 


5.1 


51 


9 


3.8 


cydtn G2 


5 


SO 




o.c 


mitochondrial ribosoma) protein L3 


4.8 


246 


51 


3.9 


UV-1 protein, estrogen regulated 


4.5 


1472 


330 


2.1 


K1AA0203 gene product 


4.6 


69 


15 


5.8 


ubiquitin-conjugatlng enzyme E2A (RAD6 h 


7 


97 


14 


7.5 


Homo sapiens done FBD3 Cri-du-chat crit 


3.4 


34 


5 


2.6 


protease, serine, 15 


3.6 


36 


1 


2.8 


hypothetical protein MGC2718 


6.7 


67 


9 


5.7 


erbb2-interacting protein ERBiN 


4.5 


137 


31 


12 


ESTs, Moderately similar to A46010 X-iin 


5.2 


52 


1 


4.9 


chondral*) sulfate proteoglycan 2 (vers 


8.6 


568 


66 


22.4 


ESTs 


4.8 


53 


11 


3.7 


interleukin 6 signal transducer (gp130, 


7.1 


71 


4 


6.4 


v-erW)2 avian erythroblastic leukemia v 


3 


68 


23 


2.8 


phosphoribosylglycinamlde formyitransfer 


4.4 


44 


1 


4.1 


ESTs 


13.3 


445 


34 


6 


mlntehromosome maintenance deficient (S. 
lysosomal 


4.5 
6 


45 

60 


2 
5 


3.4 
5.9 


kinectin 1 (Wnesin receptor) 


4.1 


301 


73 


6.1 


collagen, type X), alpha 1 


4.6 


1216 


267 


4.4 


protein tyrosine phosphatase type fVA, m 


4.9 


163 


34 


15.1 


solute carder family 35 (CMP-sialic act 


4.9 


49 


3 


3.8 


trefoil factor 3 (intestinal) 


3.2 


1872 


592 


3.3 


collagen, type V, alpha 2 


6.3 


1075 


171 


3.8 


fatty acid synthase 


3.3 


710 


217 


2 


cathepsin K (pycnodysostosis) 


34.3 


411 


12 


5.1 


protein associated with PRK1 


4.8 


153 


32 


4.3 


WAA0143 protein 


3.1 


147 


48 


12.7 


activated RNA polymerase II transcripfjo 


3.3 


33 


1 


2 


insunn-Tike growth factor 1 (somatomedl 


4.2 


42 


5 


2.6 


SWl/SNF related, matrix associated, acti 


3.9 


39 


1 


2.5 


HUEL (C4orf1)-interacting protein 


3.7 


41 


11 


0.6 


RAP1A, member of RAS oncogene family 


5.2 


52 


1 


3 


ubkptin C-terminal hydrolase UCH37 


4.9 


49 


1 


3.7 


bone morphogenefic protein receptor, fyp 


5.2 


52 


5 


3.5 


hypothetical protein FU10879 


6.4 


64 


1 


5.1 


thrombospondin 1 


12.6 


126 


1 


10.8 


disrupter of silencing 10 


5.4 


81 


15 


2.6 


golgl SNAP receptor complex member 1 


3.4 


179 


52 


1.5 


cytochrome b-245, beta polypeptide (chro 


3.2 


143 


45 


13.9 


F-box only protein 6 


7 


70 


6 


6 


TONDli 


3.1 


31 


1 


2.3 


15 kDa selenoprotein 


5.7 


57 


1 


5 


FAT tumor suppressor (DrosophBa) homoJo 


3.2 


153 


48 


4.7 


Homo sapiens mRNA; cDNA DKFZp434P1530 (f 4 


452 


114 


2 


Homo sapiens cDNA FU1 1223 fis, clone PL 


5.1 


150 


30 


7.2 


nudix (nucleoside diphosphate linked moi 


8.2 


114 


14 


9.9 


hydroxystenofd (17-beta) dehydrogenase 7 


11.5 


115 


1 


10 


forkheadboxCI 


5.4 


259 


48 


1.4 


SRY (sex determining region Y)-box 4 


3.3 


1296 


394 


2.2 


KIAA1682 protein 


3.8 


240 


64 


3.2 


Homo sapiens mRNA: cDNA OKFZp586E1624 (f 8.1 


101 


13 


7.9 


ovarian carcinoma antigen CA125 


3.3 


33 


1 


2.6 
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i 



135117 V&2493 


Hs.94694 


Homo sapiens cDNA FU10561 fis. clone NT 


5.3 


53 


1 


4.1 




S 1J5144 NM_016255Hs.95260 


Autosomal Highly Conserved Protein 


7.4 


74 


5 


2.4 




135154 AK001835 Hs.267812 


sorting nexin 4 


6.6 


69 


11 


6.3 




135155 AI207958 


Hs.166556 


Homo sapiens, Similar to TEA domain fami 


6.1 


61 


1 


5.1 




135172 AB028956 


Hs.12144 


KIAA1033 protein 


3.4 


88 


26 


1.4 




135242 AI583187 


Hs.9700 


cycfinEI 


3.1 


31 


1 


2.3 




135243 BE463721 


Hs.97101 


putative G protein-coupled receptor 


3.4 


169 


50 


9.1 




135269 NMJJ03403HS.97496 


YY1 transcription factor 


3.4 


475 


142 


2.5 




135356 BE312948 


Hs.18104 


hypoMcal protein FU11274 


3.1 


31 


10 


1.7 




135357 AJ565004 


Hs.79572 


cathepsin D (lysosomal aspartyl protease 


4.7 


710 


151 


25 




135389 U05237 


Hs.99872 


fetal Alzheimer antigen 


20.6 


206 


4 


19.1 




135397 L14922 


Hs.166563 


replication factor C (activator 1) 1 (14 


3.2 


32 


1 


2.4 




-135400 X78592 


Hs.99915 


androgen receptor (dihydrotestosterone r 


3.2 


117 


37 


9.4 




AW71525 


HJL247486 


ESTs 


3.8 


58 


16 


5.5 




X70683 


Hs.93668 


ESTs 


1.8 


1047 


596 


16 




L14922 


Hs.82128 


5T 4 oncofetal trophoblast glycoprotein 


5 


285 


58 


1.2 




M23263 


Hs.904 


amyto-1;6-glucosidase; 4^Jpha-g!ucanotransferas 


3.1 


31 


1 


2.6 


A1267886 


Hs.148027 


polymerase (RNA) II (DMA directed) polypeptide 8 


7.8 


137 


18 


11.9 


AA044840 


Hs^41676 


stromal cefl-derived {actor 1 


4.7 


114 


25 


0.9 




N90960 


Hs^27459 


ESTs; Moderately similar to Ml ALU SUBFAMILY 


4.7 


151 


32 


9.3 


AA873285 


Hs.137947 


ESTs 


4.7 


47 


3 


4.4 




T56679 


Hs.065 


RAP1A; member of RAS oncogene family 


4 


40 


1 


3.4 




AA305536 




"EST176522 Colon carcinoma (Caco-2) cell line II 


3.6 


121 


34 


118 


AI369384 




arylsulfatase 0 


3.5 


113 


33 


1.7 




AA219081 


Hs.242396 


ESTs; Moderately similar to ill! ALU SUBFAMILY] 


3.4 


107 


32 


9.9 
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TABLE 10A 



Table 10 A shows the accession numbers for those pkeys lacking unigenelD's for Table 10. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences ware clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 

Pkey: Unique Eos probeset Identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

123619 371681J AA602964 AA6092QO 

20 104602 524482 J H47610R86920 

121581 283769J AA416568AA442889 M4 17233 AA442223 

123523 genbank_AA6Q8588 AA608588 

100821 tigr_HT43Q6 M26460 U09116 

125091 genbankJ91518 T91518 
25 125150 NOTJ : OUND^entreiLW38240 W38240 

118475 genbankj!66845 N66845 

104787 genbankJ\A027317 AA027317 

106055 genfcmLAA417034 AA417034 

113702 genbankJ07307 T97307 
30 101046 entre?J<01160 K01160 

101447 entre2LM21305 M21305 

101624 entre^M55998 M55998 

124677 genbankj*01073 R01073 

110581 genbank_H61560 H61560 
35 119023 genbanK_N98488 N98488 

110775 genbanU*22414 N22414 

112092 genbankjl44538 R44538 

112253 genbanR_R51818 R51818 

107014 genbankJ\A598820 AA598820 
40 114988 genbanMA251089 AA251089 
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TABLE 11: Figure 11 from BRCA 001-3 PCT 

5 Table 11 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

1 0 ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unkjene number 

Unigene Title: Umgene gene title 

R1: Ratto of tumor to normal body tissue 

R2: Ratio of 90* percentile tumor to normal body 

15 R3: Ratio of 75* percentile norma! body to tumor 

R4: Ratio of tumor to normal breast tissue 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pkey ExAccn UnigenelD Unlgene Title 


R1 


R2 


R3 


R4 


100131 D124S5 Hs.11951 ectonudeotidepyrophosphatase/phosphodi 


m 


244 


19 


9.9 


100147 013666 Hs.13634S osteoblast specific factor 2 (fascidin 


15.7 


1030 66 


5 


100522 X51501 Hs.99949 prolactMnduced protein 


22.7 


760 


34 


1.4 


100666 L05424 Hs. 169610 CD44 antigen (homing function and Indian 


8.5 


85 


1 


3.2 


101104 AW862258 Hs.169266 neuropeptide Y receptor Y1 


15.3 


153 


1 


14.1 


101478 NM.002890Hs.758 RAS p21 protein activator (GTPase activa 


9.6 


96 


1 


8.5 


101724 L11690 Hs.620 bullous pemphigoid antigen 1 (230/240kD) 


9.4 


94 


1 


0.3 


101754 S70114 Hs.239489 T1A1 cytotoxic granule-assc<fcted RNA-bi 


8.9 


89 


5 


8 


101888 AL049610 Hs.95243 transcription elongation factor A (SI \y 


7.3 


73 


1 


5.3 


102165 BE313280 Hs.159627 death associated protein 3 


9.3 


93 


5 


8 


102304 AF015224 Hs.46452 mammaglobin 1 


8.5 


2058 243 


1.4 


102348 U37519 Hs.87539 aldehyde dehydrogenase 3 tamliy, member 


6.4 


428 


67 


2,3 


102457 NM.001394Ks.2359 dual specificity phosphatase 4 


20.2 


202 


5 


1.3 


102567 U63830 Hs.146847 TRAF family member-associated NFKB activ 


8.2 


82 


1 


6.8 


102823 D85390 Hs.5057 carboxypeptidase D 


5.6 


56 


1 


5.3 


103557 AL133415 Hs.297753 vimentin 


7.5 


136 


18 


3.4 


103613 NMJ»Q346Hs.2316 SRY (sex determining region Y)-box 9 (ca 


7.3 


73 


1 


5.2 


104115 AF183810 Hs.26102 opposite strand to trlchomlnophalangeal 


29 


290 


1 


26.8 


104667 A1239923 Hs.30098 ESTs 


14.9 


149 


1 


6.4 


104804 AI858702 Hs.31803 ESTs, Weakly similar to N-WASP [H.sapien 


7.7 


77 


1 


5.1 


104807 AJ139058 Hs.125790 leucine-rich repeat-containing 2 


7 


70 


1 


6.5 


104896 AW015318 Hs.23165 ESTs 


7.4 


74 


1 


6 


104943 AF072873 Hs.1 14218 frbzied (Drosophila) homoiog 6 


16.2 


162 


1 


4.2 


105038 AW503733 Hs.9414 WAA1488 protein 


5.5 


55 


1 


5.2 


105329 AA234561 Hs.22862 ESTs 


ZB 


131 


47 


3.9 


105500 AW602166 Hs.222399 CEGP1 protein 


25.4 


508 


20 


3 


105516 AK001269 Hs.30738 hypofteficaJ protein FU10407 


8.3 


83 


3 


1.8 


105730 AW377314 Hs.5364 DKF2P564I052 protein 


6.9 


69 


1 


4.4 


106012 A1240665 Hs.8895 ESTs 


21.2 


212 


6 


17.4 


106095 AF1 15402 Hs.1 1713 E744ke factor 5 (ets domain transcript 


26.3 


356 


14 


1 


106155 AA425414 Hs.33287 nuclear factor 1/8 


9.9 


483 


49 


1.8 


107102 AB037765 Hs.30652 K1AA1344 protein 


6.3 


63 


1 


5.4 


107136 AV661958 Hs.8207 GK001 protein 


2.5 


392 


155 


4.3 


107151 AW378065 Hs.8687 ESTs 


15.6 


156 


7 


10.8 


107922 BE153855 Hs.61460 Ig superfamHy receptor LNIR 


9 


90 


1 


55 


108339 AW151340 Hs.51615 ESTs, Weakly similar to ALU7_HUMAN ALU S 


18.7 


187 


1 


17 


109112 AW419196 Hs.257924 hypoftelical protein FU 13782 


4.1 


334 


82 


3.4 


109292 AW975746 Hs.188662 K1AA1 702 protein 


7.1 


71 


1 


6.5 


109415 U80736 Hs.1 10826 trfnudeoSde repeat containing 9 


12.3 


123 


1 


113 


103912 AW390822 Hs.301528 L-kynurer^e/alpr^-anTinoadipate aminotra 


14.2 


142 


1 


9.5 


110009 BE075297 Hs.6614 ESTs, Weakly simiiar to A43932 mucin 2 p 


6.3 


693 


110 


12 


110915 8E092285 Hs.29724 hypofoetical protein FU13187 


20.9 


209 


1 


195 


111164 N46180 Hs.122489 Homo sapiens cONA FU13289 fo, clone OV 


7.7 


77 


1 


5 
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111179 AK000136 Hs.10760 asporin (LRR class 1) 


25.1 


288 


12 


6.7 


111190 AKD02055 Hs.151046 hypothetical protein RJ 11 193 


6,3 


63 


1 


5.8 


111223 AA852773 Hs.334638 KIAA1866 protein 


3.6 


402 


112 


4.9 


111357 BE314949 Hs.87126 hypothetical protein RJ23309 


3.8 


425 


111 


4 


112244 AB029000 Hs.70823 KIAA1077 protein 


5.7 


567 


100 


6.7 


113047 AI571940 Hs.7549 ESTs 


9.6 


124 


13 


9 


113702 T97307 gb:ye53h05.s1 Soares fey liver spteen 


113 


129 


11 


11.7 


114124 W57554 Hs.125019 lymphoid nuclear protein (LAF-4) mRNA 


24.2 


242 


10 


5.6 


114138 AW384793 Hs.15740 Homo sapiens mRNA; cDNA DKFZp434E033 (fr 6.7 


67 


1 


6.3 


114768 AF212848 Hs.182339 ets homologous factor 


13.7 


137 


1 


8.9 


114860 AL157545 Hs.42179 bromodomain and PHD finger containing, 3 


9.1 


91 


1 


7.6 


114965 AI733881 HsJ2472 BMP-R1B 


35.9 


359 


10 


29.7 


1 14988 AA251089 gb:zsO4f05.s1 NCI_CGAPJ3CB1 Homo sapiens 1 1 .5 


115 


1 


6.9 


115206 AW183695 Hs.186572 ESTs 


5.8 


58 


1 


5 


115719 AW992405 Hs.59622 Homo sapiens, clone IMAGE:3507281, mRNA, 


7.6 


144 


19 


13.9 


115844 AI373062 Hs.332938 hypothetical protein MGC5370 


6.2 ■ 


62 


1 


5.4 


116470 AI272141 Hs.83484 SRY (sex determining region Y)-bax 4 


1.8 


1047 596 


1.6 


116786 H25836 Hs.301527 ESTs, Moderately simllarto unknown (H.s 


22.8 


228 


9 


12.4 


117280 M18217 Hs.172129 Homo sapiens cDNA: FU21409 Us, done C 


3.9 


322 


83 


4.4 


117412 N32536 Hs.42645 solute carrier family 16 (monocarboxync 


17.4 


174 


9 


6.9 


118472 AL157545 Hs.42179 bromodomafo and PHD Anger containing, 3 


14.5 


145 


1 


2.4 


119271 AI061118 Hs.65328 Fanconi anemia, complementation group F 


8.2 


82 


1 


6.4 


119771 AI905687 Hs.2533 EST 


3.5 


2072 


I 595 


21 


120562 BE244580 Hs.302267 hypothetical protein FU10330 


8.5 


127 


15 


1.6 


121463 AK000282 Hs.239681 hypothetical protein FU20275 


10.3 


103 


1 


9.3 


121723 AA243499 Hs.104800 hypothetical protein FU10134 


2.9 


214 


74 


3.7 


122963 AA478446 Hs.69559 KIAA1096 protein 


7.2 


72 


1 


5.7 


123137 AI073913 Hs.100686 ESTs, Weakly similar to JE0350 Anterior 


9.9 


351 


36 


13.9 


123619 AA602964 gb:no97c02.s1 NCl_CGAPJPr2 Homo sapiens 


8.5 


85 


1 


4.3 


123709 AA706910 Hs.1 12742 ESTs 


3.9 


60 


16 


4.8 


124006 AI147155 Hs.270016 ESTs 


5.8 


321 


55 


17 


124059 BE387335 Hs.283713 ESTs, Weakly similar to S64054 hypothec 


10.4 


880 


85 


5.3 


124308 AA249027 Hs.241507 ribosomal protein S6 


10.5 


105 


1 


9.9 


125279 AW401809 Hs.4779 K1AA1 150 protein 


13.1 


131 


1 


5.1 


125517 AA287921 Hs.164950 ESTs 


6.7 


67 


1 


6 


127439 D60237 Hs.14368 SH3dornainblrrfingglutarnlcado^ichpr 


30.6 


306 


4 


26.5 


128305 AI954968 Hs.279009 matrix Gla protein 


7.5 


75 


1 


6.5 


128482 A1694143 Hs.296251 programmed cell death 4 


7.2 


72' 


1 


5.8 


128790 AF026692 Hs.105700 secreted frizrfea^related protein 4 


17.4 


409 


24 


7.8 


128925 R67419 Hs.21851 Homo sapiens cONA FU129QQ lis, clone NT 


7.1 


392 


56 


3.6 


129017 AA115333 Hs.107968 ESTs 


8.2 


82 


1 


7.4 


129229 AF013758 Hs.109643 polyadenylate binding protein-interactln 


7.1 


71 


1 


6.2 


129337 NMJ)14918Hs.110488 K1AA0990 protein 


9.5 


95 


1 


8.5 


129366 BE220806 Hs.184697 Homo sapiens clone 23785 mRNA sequence 


7.1 


150 


21 


14.5 


129821 AB028945 Hs.12696 cortactin SH3 domain-binding protein 


11.4 


114 


1 


10 


130036 BE061916 Hs.125849 chromosome 8 open reading frame 2 


6.7 


67 


1 


5.7 


130057 AF027153 Hs.324787 solute carrier family 5 (Inositol transp 


1 


1 


1 


1 


130095 AKD01635 Hs.14838 hypothetical protein FU10773 


14.6 


219 


15 


7.6 


130343 AB040914 Hs.278628 KIAA1481 protein 


13.2 


331 


25 


12.4 


130385 AW067800 Hs.155223 stanniocalcin 2 


72.2 


722 


1 


1.9 


130407 BE385099 Hs.334727 hypothetical protein MGC301 7 


6.5 


65 


4 


5.3 


130441 U63630 Hs.155637 protein kinase, DNA^activated, catalytic 


6.1 


61 


1 


5.7 


130455 D90041 Hs.155956 N^cetyltransferase 1 (arylamlne Nicety 


10.8 


706 


66 


9.2 


130604 AA383256 Hs.1657 estrogen receptor 1 


32.2 


322 


1 


4.7 


130617 M90516 Hs.1674 glutamine-fnjctose^-phosphatetransarnin 


10 


100 


1 


7.6 


130712 AJ271881 Hs,279762 bromodomaln-containing 7 


175 


175 


2 


12.8 


131148 AW953575 Hs.303125 p53-induced protein PIGPC1 


3.8 


585 


153 


3.7 


131388 NM.014810HS.92200 K1AA0480 gene product 


7.6 


76 


1 


5 


131564 T93500 Hs.28792 Homo sapiens cDNA FU11041 fis, done PL 


4.7 


381 


81 


6.4 


131742 AA961420 Hs.31433 ESTs 


11.7 


117 


1 


10.1 


131877 J04088 Hs.156346 topoisornerase (DNA) II alpha (170kD) 


6.8 


68 


1 


5.6 


131985 AA503020 Hs.36563 hypothetical protein FU22418 


40.2 


402 


1 


4 


132316 U28831 Hs.44566 K1AA1641 protein 


18.6 


186 


10 


1.5 


132528 T78736 Hs.50758 SMC4 (structural maintenance of chromoso 


9.3 


93 


1 


8.4 


132742 AA025480 Hs.292812 ESTs, Weakly similar to T33468 hypothetl 


6.5 


65 


1 


5.6 


132990 X77343 Hs.334334 transcription factor AP-2 alpha (acCvat 


12.7 


311 


25 


24 


133015 AJ002744 Hs.246315 UDP^cetySalpha^aladosamlnerpolyp 


4.6 


427 


93 


10.4 
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133199 AF231981 Hs^50175 homobg of yeast long chain polyunsatura 

133240 AKD01489 Hs.242894 ADP-ribosytation factor-like 1 

133271 Z48633 Hs.283742 H.saplens mRNA for retrotransposon 

133640 AW246428 Hs.75355 ubiquilirwxinjugating enzyme E2N (homoto 

5 133746 AW410035 Hs.75862 MAD (mothers against decapentapteglc, Dr 

133999 AA535244 Hs.78305 RAB2, member RAS oncogene family 

134110 U41060 Hs.79136 LIV-1 protein, estrogen regulated 

134485 X82153 Hs.83942 cathepsln K (pycnodysostosls) 

134654 AK001741 Hs.8739 hypothetical protein FU10879 

10 134880 A1879195 Hs.90606 1 5 kDa selenoprotein 

135029 H58818 Hs. 187579 hydroxysteroid (17-beta) dehydrogenase 7 

135389 U05237 Hs.99872 fetal Alzheimer antigen 

128305 A1954968 Hs.279009 matrix Gla protein 



3 


816 


275 


3.9 


8.1 


81 


1 


4.6 


12.4 


124 


6 


10.8 


8.5 


85 


1 


72 


9.3 


93 


1 


7.8 


7.8 


78 


1 


5.6 


4.5 


1472 330 


2.1 


34.3 


411 


12 


5.1 


6.4 


64 


1 


5.1 


5.7 


57 


1 


5 


115 


115 


1 


10 


20.6 


206 


4 


19.1 


9.4 


94 


3 


5.3 
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TABLE 11A 



Table 11A shows the accession numbers for those pkeys lacking unigenelD's for Table 1 L 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



CAT number Gens duster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

123619 371681J AA602964 AA609200 

20 113702 genbankJT97307 T973Q7 

114988 genbank^AA251089 AA251089 



10 



Pkey: 



Unique Eos probeset identifier number 
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TABLE 12: Figure 12 from BRCA 001-3 PCT 



5 Table 12 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey. Unique Eos probeset kJemlfier number 

1 0 ExAocn: Exemplar Accession number, Genbank accession number 

UnigenelO: Unigene number 

Unlgene Title: Unigene gene title 

R1 : Ratio of tumor to normal body tissue 

R2: Ratio of 90* percentile tumor to body 

1 S R3: Ratio of 75°> percentile body to tumor 

R4: Ratio of tumor to normal breast tissue 

Pkey ExAccn UnigenelD Unlgene Title 

20 100131 012485 Hs.11951 phosphodiesterase I (PC-1) 

105500 AW602166 Hs.222399 ESTs 

112244 AB029000 Hs.70823 K1AA1077 protein 

114124 W57554 Hs.125019 ESTs 

119771 AI905687 Hs.2533 ESTs 

25 121723 AA243499 Hs.104800 ESTs 

128790 AF026692 Hs.105700 secreted frizzied-related protein 4 

131148 AW953575 Hs.303125 ESTs 

131985 AA503020 Hs.36563 ESTs 

133199 AF231981 Hs.250175 Homo sapiens clone 23904 mRNA sequence 3 



R1 


R2 


R3 


R4 


m 


244 


19 


9.9 


25.4 


508 


20 


3 


57 


567 


100 


6.7 


24.2 


242 


10 


5.6 


3.5 


2073 


595 


2.1 


2.9 


214 


74 


3.7 


17.4 


409 


24 


7.8 


3.8 


585 


153 


3.7 


40.2 


402 


1 


4 


3 


816 


275 


3.9 
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TABLE 13: Table 1 from BRCA 001-5 US 
5 Table 13 depicts a preferred group of genes upregulated in breast cancer cells. 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



Pkey: 
ExAccn: 
UnigenelD: 
Unigene Tide: 
R1: 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 



Ratio of iumor to normal body tissue 



Pkey ExAccn UniGenelD Unigene Title 



100038 M97935 

100039 M97935 

100040 M97935 

100041 M97935 
100082 AB003103 
100091 AF000177 
100100 AF005084 
100103 AF007875 
100114 000596 
100121 D10495 
100123 D10523 
100128 D11094 
100131 D12485 
100137 D13627 
100144 D13643 
100147 D13666 
100154 D14657 
100164 D14812 
100169 D14878 
100190 D21090 
100203 025538 
100209 D26308 

100215 D26598 

100216 026599 
100219 D28137 
100227 D28915 
100248 031888 
100287 D43950 
100294 D49396 
100307 D50525 
100335 063391 
100340 D63487 
100355 D78129 
100363 D78514 
100368 D79987 
100372 D79997 
100375 D80004 
100379 082060 
100387 D83777 
100393 D84145 
100398 D84557 

100405 D86425 

100406 D86479 
100409 086957 
100421 D86985 

100446 D87464 

100447 D87465 

100448 D87469 



Hs.4295 

Hs.1 11783 

Hs.11538 

Hs.5085 

Hs.62962 

Hs.155342 



Hs.61153 

Hs.11951 

Hs. 15071 

Hs.75616 

Hs.136348 

Hs.81892 

Hs.173714 

Hs.82043 

Hs.178658 

Hs.172199 



Hs.62793 

Hs.1390 

Hs.118110 

Hs.82316 

Hs.78398 

Hs.1600 

Hs.75454 

Hs.699 

Hs.6793 

Hs.82563 

Hs.71465 

Hs.78563 

Hs.153479 

Hs.184339 

Hs.75909 

Hs.278721 

HS75137 

Hs.39913 

Hs.155462 

Hs.82733 

Hs.118397 

Hs.80712 

Hs.79276 

Hs.10037 

Hs.74583 

Hs.57652 



control 
control 
control 
control 

proteasome (prosome; macnopain) 26S sub 
Lsm1 protein 

actln related protein 2/3 complex; subunit 

doiichyi^hosphate mannosyltransferase p 

thymidyiate synthetase 

protein kinase C; delta 

oxoglutarate dehydrogenase (iipoamide) 

proteasome (prosome; macropain) 26S sub 

phosphodiesterase l/nucleotide pyrophosp 

chaperonin containing TCP1; subvm 8 (t 

Human mRNA for K1AA0018 gene; comp 

osteoblast specific factor 2 (fasciciin l-like 

K1AA0101 gene product 

MORF-related gene X 

D123 gene product 

RAD23 (S. cerevisiae) homoiog B 

adenylate cyclase 7 

bliiverdin reductase B (flavin reductase (N 
proteasome (prosome; macropain) subunit 
proteasome (prosome; macropain) subunit 
bone marrow stroma) cell antigen 2 
interferon-induced; hepatitis Oassodated 
WAA0071 protein 

chaperonin containing TCP1; subunit 5 (e 
antioxidant protein 1 
hypothetical protein 

ptatetekactivafjng (actor acetylhydrolase; 
K1AA0153 protein 

Homo sapiens mRNA for squalene epoxid 
u biquitin -conj ugating enzyme E2G 1 (horn 
extra spindle poles; S. cerevisiae; homoio 
KIAA0175 gene product 



Ke4 gene; mouse; human homoiog of 
KIAA0193gene product 
novel RGO-contatnfng protein 
mlnichromosome maintenance deficient (m 
nidogen 2 

AE-bhding protein 1 
K1AA0202 protein 

Human mRNA for KJAA0232 gene; comp 
K1AA0274 gene product 
KIAA0275 gene product 
EGF-Bke-domain; multiple 2 

215 



R1 

16.7 

6.3 

8.3 

14.8 

7.5 

4.9 

4.7 

13.4 

15.9 

4.6 

7.5 

4.4 

8.7 

9.5 

6 

8.5 

10.5 

4.6 

7.9 

5.6 

9.9 

4.9 

14.2 

11-3 

5.7 

5.7 

7.4 

5.6 

12.9 

8,4 

6.8 

4.4 

1Z6 

4.6 

6.5 

8.4 

4.5 

8.1 

10.7 

72 

72 

5.4 

4.3 

11.9 

9.7 

6.4 

10 

6.2 
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100467 D89052 Hs.7476 ATPase; H+ transporting; lysosomal (vacu 7.5 

100458 D89289 Ha.118722 (ucosytoansferase8(alpha(1;6)fucosyltr 5 

100486 HT1112 Hs.10842 Ras-Uxe Protein Tc4 16.9 

100497 HT1400 Hs.79137 Carboxyl Melnyftransferase, Aspartate, A 5,6 

5 100618 HT2710 Hs.1 14599 Collagen. Type V5I, Alpha 1 7.5 

100661 HT3018 Hs.132748 Rfoosomal Protein L39 Homolog 4.4 

100667 HT3127 Hs.169610 Epican, AUSpQcell 4.6 

100668 HT3938 Hs.169610 Epican, AIL Splice 12 4.4 
100676 HT3742 Hs.287820 Fibronedin, All Splice 1 9 

10 100775 HT26388 Hs.89603 Mudn 1. Epithelial, Alt Splice 9 4.7 

100783 HT4018 Hs.1 91 356 Basic Transcription Factor, 44 KdaSubun 13.7 

100829 HT4343 Hs.278544 Cytosolic Acetoacetyl-Coenzyme A Thlo 10.6 

100830 HT4344 Hs.4756 Rad2 5.5 
100840 HT4392 Hs.183418 Protein Kinase Pitslre, Alpha, AIL Splice 4.1 

15 100850 HT417 Hs.297939 CafoepsinB 4 

100866 HT4582 Hs.75113 Transcription Factor Ilia 4.9 

100906 HT5158 Hs.5398 Guanosine 5 , -Monophospha(e Synthase 8.7 

100914 HT511 Hs.324178 Ras Inhibitor Inf 7.2 

100916 HT544 Hs.73946 Endothelial Cell Growth Factor 1 5.9 

20 100945 HT884 Hs.180686 Oncogene E6-Ap, Papillomavirus 4.6 

100975 J02923 Hs.76506 lymphocyte cytosoDc protein 1 (Upiastin 30.1 

100988 J03589 H$mS0 ubiqulM<e4 8.3 

100996 J03909 Hs.14623 interferon; garnma-indutible protein 30 6.9 

100999 J03934 Hs.80706 dlaphorase (NAOH/NAOPH) (cytochrom 4.3 

25 101011 J04430 Hs.1211 acid phosphatase 5; tartrate resistant 5.9 

101017 J04599 Hs.821 blglycan 5.1 

101031 J05070 Hs.151738 matrix metaDoprotemase 9 (gelatinase B; 372 

101038 J05249 Hs.79411 replication protein A2 (32kD) 6.1 

101054 K02405 Hs.73931 Human MHC class II HtA-OQ-beta mRN 4.3 

30 101061 K03515 Hs.180532 glucose phosphate isomerase 4.3 

101091 L06132 Hs. 149155 voltage-dependent anion channel 1 7.4 

101097 L06797 Hs.89414 cherroWne (C-XC motif); receptor 4 (fus 4.6 

101104 L07615 Hs.169266 Human neuropeptide Y receptor Y1 (NPY 183 

101143 L12723 Hs.90093 heat shock 70kD protein 4 17.4 

35 101152 L13800 Hs.9884 Homo sapiens Bver expressed protein gen 7.6 

101183 L19779 Hs.795 H2Ahistonelarnliy; member O 10.9 

101216 L25876 Hs.84113 cycHrniependerit kinase inhibitor 3 (COK 7.4 

101233 129008 Hs.878 sorbitol dehydrogenase 14.6 

101247 133801 Hs.78802 glycogen synthase kinase 3 beta 7.5 

40 101282 L38810 Hs,79387 proteasome (prosorne; macropain) 26S sub 4.4 

101326 L42572 Hs.78504 Inner membrane protein; mltccrwndrial{m 5.8 

101332 L47276 Hs.156346 Homo sapiens {cell fine HL-6) alpha topo 18.9 

101348 L77213 Hs.30954 phosphomevalonafB kinase 7.5 

101352 L77701 Hs.16297 COX17 (yeasl) homotog; cytochrome cox 9.3 

45 • 101378 M13755 Hs.833 Werferon^timulated protein; 15 kDa 18.1 

101396 M15796 Hs.78996 proliferating ceD nuclear an^en 8.6 

101404 M16342 Hs.182447 heterogeneous nuclear rlbonucieoprotein C 4.5 

101439 M20902 Hs.268571 apolipoprotein C4 6.1 

101464 M22538 Hs.51299 NADH dehydrogenase (ubiquinone) flavo 8.7 

50 101469 M22877 Hs.169248 Human somatic cytochrome c (HCS) gene 4.2 

101472 M22960 Hs.1 18125' protective protein for beta-galactosldase ( 6.5 

101478 M23379 Hs.758 RAS p21 protein activator (GTPase activa 14 

101484 M24594 Hs.20315 toterferon^uced protein 56 92 

101539 M30818 Hs.926 myxovirus (influenza) resistance 2; homo! 5.1 

55 101540 M30938 Hs.84981 X-ray repair cornplerrtenting deiectrve rep 4.7 

101544 M31169 Human propionyi-CoA carboxylase beta-s 5.5 

101552 M31642 Hs.82314 hypoxanthlne phosphoribosyltransferase 1 8.5 

101580 M34677 Hs.83363 DNA segment on ctoomosome X (unique) 4.5 

101600 M37583 Hs.119192 H2A h'tstone famDy; member Z 5.7 

60 101663 M60750 Hs.2178 H2B histone Iam3y; member A 5.8 

101664 M60752 Hs.121017 H2A histone famfly; member A 135 

101667 M60858 Hs.79110 nudeolin 4 

101684 M63256 Hs.75124 cerebellar degeneration-related protein (62 7.6 

101702 M64929 Hs.179574 protein phosphatase 2 (formerly 2A); regu 4.2 

65 101754 M77142 Hs.239489 TIA1 cytotoxic granule-associated RNA-b 4.5 

101758 M77836 Hs.79217 pymW-5-rarboxyiate reductase 1 5.7 

101767 M81057 Hs.180884 carboxypeptidasa B1 (tissue) 21.7 
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101770 M81601 Hs.78869 transcription elongation factor A (Sli); 1 4.6 

101791 M83822 Hs.62354 cell division cycle 4-flke 9.7 

101803 M86546 Hs.155691 pre-B-cel! leukemia transcription factor 1 5.5 

101809 M86849 Hs.323733 Homo sapiens connexin 26 (GJB2) mRNA 22,5 

5 101839 M93036 Hs.692 membrane component; chromosomal 4; su 4 

101851 M94250 Hs.82045 midkine (neurite growUvpromoting factor 7.6 

101888 M99701 Hs.95243 transcription elongation (actor A (SWHike 11.4 

101973 S82597 Hs.80120 UDP<N-acetyt-aIpha-0^alactosamine:po 4.6 

101991 U00968 Hs.166 Human SREBP-1 mRNA; oomptete cds 4.1 

10 102009 U02680 Hs.82643 protein tyrosine kinase 9 4.4 

102025 U03911 Hs.78934 mutS (E. co!i) homolog 2 (colon cancer; n 4 

102047 U07158 Hs.83734 syntaxin 4A (placental) 6.1 

102051 U07550 Hs.1197 heat shock 10kD protein 1 (chaperonin 10 4.4 

102083 U10323 Hs.75117 intedeukin enhancer binding factor 2; 45k 10.4 

15 102095 U11313 Hs.75760 sterol carrier protein 2 9.5 

102130 U15009 Hs.1575 small nuclear ribonudeoprotein 03 polyp 6.6 

102133 U15173 Hs.155596 B<X2/adenowusE1B 19kf>feiteractlng p 4.3 

102148 U16954 Hs.75823 ALU-fused gene from chromosome 1q 6.9 

102179 U19713 Hs.76364 allograft irtflarnmatory factor 1 4.8 

20 102180 U19718 Hs.83551 microfibrillar^ssodated protein 2 72 

102193 U20758 Hs.313 secreted phosphoprotein 1 (osteopontin; b 7.2 

102198 1/21090 Hs.74598 pofymerase (DtiA directed); delta 2; regu 4.3 

102202 U21931 Hs.574 fructose-btsphosphatase 1 4.5 

102209 U22970 Hs.265827 Interferon; aipha-hduciWe protein (clone 9.9 

25 102211 U 23070 Hs.78776 putative transmembrane protein 4.9 

102220 U24389 Hs.65436 lysyl oxidase-iike 1 8.5 

102224 U24704 Hs.148495 proteasome (prosome; macropain) 26S sub 5.4 

102234 U26312 Hs.278554 cfiromobox homolog 3 (Drosophfla HP1 g 7.7 

102250 U28014 Hs.74122 caspase 4; apoptosis-related cysteine prot 5.4 

30 102260 U28386 Hs.159557 karyopherin alpha 2 (RAG cohort 1 ; impo 6.3 

102261 U28488 Hs.155935 implement component 3a receptor 1 5.7 

102273 U30688 Hs.75981 ubiquitin specific protease 14 (tRNA-guan 6.1 

102298 U32849 Hs.54483 N-myc (and STAT) interactor 4.1 

102302 U33052 Hs.69171 protein kinase (Mike 2 4.3 

35 102305 U33286 Hs.90073 chromosome segregation 1 (yeast homolo 5.4 

102320 U34683 Hs.82327 glutathione synthetase 4.1 

102330 U35451 Hs.77254 chromobox homolog 1 (Drosophila HP1 b 4 

102348 U37519 Hs.87539 aldehyde dehydrogenase 8 9.4 

102361 U39400 Hs.75859 chrornosorne 11 open reading frame 4 52 

40 102362 U39412 Hs.75932 N-ethylmaleimide-sensitive factor attachm 9.3 

102369 U39840 Hs.299867 hepatocyte nuclear factor 3; alpha 7.7 

102395 U41767 Hs.92208 a disintegrin and metalloprotelnase domal 10.4 

102409 UA32BS Hs.1 18725 seterwpnosphate synthetase 2 6.2 

102418 U43923 Hs.79058 suppressor of Ty (Sxerevislae) 4 homolog 4.1 

45 102425 U44772 Hs.3873 patrnitoyl-protein thloesterase (ceroioMipo 4.8 

102457 U48807 Hs.2359 dual specificity phosphatase 4 6.3 

102465 U49352 Hs.81548 2;4-dienoyl CoA reductase 1; mitochondri 9.4 

102495 U51240 Hs.79356 LysosonraJ-associated multlspanning mem 6.5 

102534 U56833 Hs.198307 von HippeWndau binding protein 1 8.6 

50 102546 U57877 Hs.3577 succinata dehydrogenase complex; subuni 4.3 

102549 U58046 Hs.198899 eukaryofc translation Initiation factor 3; s 6.3 

102557 U58766 Hs.264428 tissue specific transplantation antigen P35 5 

102562 U59309 Hs.75653 fumarate hydratase 6 

102568 U59877 Hs.223025 RAB31; member RAS oncogene family 9.1 

55 102580 U60808 Hs.152981 CDP-diacylglycerol synthase (phosphatid 7.9 

102581 U61145 Hs.77256 enhancer of zeste (Drosophila) homolog 2 7.6 

102590 U62136 Hs.79300 Homo sapiens enterocyte differentiation a 7 

102591 U62325 Hs.324125 amyloid beta (A4) precursor protein-blndJ 4 

102592 U62389 Hs. 11223 Human putative cytosolic NADP-depende 5 
60 102617 U65928 Hs. 198767 Jun activation domain binding protein 6.1 

102618 U65932 Hs.81071 extracellular matrix protein 1 23.2 

102638 U67319 Hs.9216 caspase 7; apoptosiswelafed cysteine prot 8.9 

102663 U70322 Hs.168075 karyopherin (importin) beta 2 7,1 

tQ2666 V7Q660 Hs2799i0 ATX1 (antioxidant protein 1; yeast} homo 4.7 

65 102679 U72661 Hs.11342 ntnjurin 1; nerve injury4nduced protein-1 4.7 

102687 U73379 Hs.93002 ubiquitin carrier protein E2-C 7.7 

102704 U76638 Hs.54089 BRCA1 associated RING domain 1 5.6 
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102705 U77180 Hs.50002 small inducible cytokine subfamily A (Cy 

102721 U79241 Hs.118666 Human clone 23759 mRNA; partial cds 

102729 U79254 Hs.181311 asparagtnyWRNA synthetase 

102739 U792S2 Hs.155572 Human done 23801 mRNA sequence 

5 102742 U79293 Hs.159264 Human clone 23948 mRNA sequence 

102761 U82130 Hs.118910 tumorsusceptibflfy gene 101 

102788 U86602 Hs.74407 nucleolar protein p40 

102790 U87269 Hs.154196 E4F transcription factor 1 

102801 U89606 Hs.38041 pyridoxal {pyridoxlne; vitamin B6) kinase 

10 102608 U90426 Hs.179606 nuclear RNA heEcase; DECD variant of O 

102817 U90904 Hs.83724 Human clone 23773 mRNA sequence 

102823 U90914 H&5057 carboxypeptidase D 

102827 U91327 Hs.6456 chaperc^contalriirgTCP1;subunit2(b 

102838 U94592 Hs.80658 Human uncoupling protein homolog (UCP 

15 102841 U95006 Hs.37616 Human D9 spOce variant B mRNA; comp 

102844 U96113 Hs324275 Homo sapiens Nedd-4-fike ubiquitirvprot 

102868 X02419 Hs.77274 plasminogen activator; urokinase 

102907 X06985 Hs.202833 heme oxygenase (decydlng) 1 

102919 X12447 aldolase A; frudose-bisphosphate 

20 102929 X13238 Hs.74649 cytochrome c oxidase subunit Vic 

102973 X16663 Hs.14601 hematopoietic cett-specificLyn substrate 

102983 X17620 Hs.118638 norwnetastatic eels 1; protein (NM23A) 

102985 X17644 Hs.2707 G1 to S phase transifion 1 

103003 X52003 Hs.1406 trefoil factor 1 (breast cancer; estrogen-ind 

25 103018 X53296 Hs.81134 Interteukln 1 receptor antagonist 

103023 X53793 Hs.117950 multifunctional polypeptide similar to SA 

103036 X54925 Hs.83169 matrix metatoprotelnase 1 (Interstitial col 

103060 X57766 Hs. 155324 matrix metalloproteinase 1 1 (stromeJysin 

103073 X59417 Hs.74077 proteasome (prosome; macropain) subunit 

30 103075 X59543 Hs.2934 ribonucleotide reductase M1 polypeptide 

103080 X59798 Hs.82932 cydin D1 (PRAD1; parathyroid adenomat 

103094 X60787 Hs.296281 interteukln enhancer binding factor 1 

103105 X61970 Hs.76913 proteasome (prosome; macropain) subunit 

103121 X63679 Hs.4147 translocating chain-associating membrane 

35 103149 X66363 Hs.171834 PCTAIRE protein kinase 1 

103180 X69433 Hs.5337 Isocltrate dehydrogenase 2 (NADP+); mit 

103182 X69819 Hs.99995 Intercellular adhesion molecule 3 

103188 X70O40 Hs.2942 macrophage stimulating 1 receptor (c-met 

103191 X70218 Hs.2903 protein phosphatase 4 (formerly X); cataly 

40 103193 X70476 Hs.75724 coatomer protein complex; subunit beta 2 

103194 X70649 Hs.78580 DEAD/H (Asp-Glu^Ia-Asp/HIs) box pot 

103195 X70940 Hs.2642 eukaryotlc translation elongation factor 1 

103206 X72755 Hs.77367 monokine Induced by gamma interferon 

103207 X72790 Human endogenous retrovirus mRNA for 
45 103208 X72841 Hs.31314 retinoblastcma-blndlng protein 7 

103216 X74262 Hs.16003 nstinobJastoma-binalng protein 4 

103226 X75042 Hs.44313 v-rel avian retlcufoendotheltosis viral onco 

103230 X75861 Hs.74637 testis enhanced gene transcript 

103262 X78565 Hs.289114 hexabrachlon (tenascin C; cytotactln) 

50 103278 X79882 Hs.80680 lung resistance-related protein 

103297 X81788 Hs.9078 Immature colon carcinoma transcript 1 

103302 X82103 Hs.3059 coatomer Drotein complex; subunit beta 

103316 X83301 Hs.324728 SMA5 

103330 X85373 Hs.77496 small nuclear n^onucteoprotein polypepti 

55 103349 X89059 serine/threonine kinase 9 

103352 X89398 Hs.78853 uracMDNA glycosylate 

103364 X90872 Hs.279929 SULT1C sulfotransferase 

103374 X91788 Hs.84974 chloride channel; nudeofide-sensitive; 1A 

103380 X92396 Hs.24167 synaptcbre vMce 1 

60 103395 X94754 Hs.279946 methionlne4RNA synthetase 

103402 X95404 Hs;180370 cofuln 1 (non-muscle} 

103410 X96506 Hs.295362 DRtessedated protein 1 (negaffve cofact 

103420 X97065 Hs.173497 Sec23 (S. cerevfciae) homolog B 

103421 X97074 Ks. 119591 adaptor-related protein complex 2; slgma 
65 103427 X97303 Hs.250655 H.sapiena mRNA for Ptg-12 protein 

103430 X97544 Hs.20716 transtocase of Inner mitochondrial membr 

103438 X98263 Hs. 152720 M-phase phosphoproteln 6 
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105012 AA1 16036 Hs.9329 chromosome 20 open reading frame 1 10.7 

105019 AA121879 Hs.9280 proleasome (prosome; macropain) subunft 57 

105029 AA126855 Hs.13268 ESTs 4.4 

105033 AA127964 Hs^74329 TP53 target gene 1 6.3 

5 105035 AA128486 Hs.8859 ESTs 6.5 

105039 AA130349 Hs.36475 ESTs 4 

105062 AA134968 Hs.36529 ESTs 4.3 

105076 M142858 Hs.37810 ESTs 6.4 

105087 AA147884 Hs.9812 ESTs 9.2 

10 105091 AA148859 Hs.179909 ESTs; Weakry similar to til! ALU SUBFA 5.7 

105093 AA149051 Hs.32405 ESTs 6.3 

105107 AA152302 Hs.25035 DKFZP566G223 protein 6.2 

105127 AA158132 Hs.301957 ESTs; Weakly similar to contains similar! 5.7 

105132 AA159501 Hs.247280 HBV associated factor 4.2 

1*5 105143 AA165333 Hs.24808 ESTs 47 

105154 M171736 Hs.35947 methyl-CpG binding domain protein 4 9 

105162 AA176690 Hs.4084 KIAA1025 protein 9.1 

105186 AA191512 Hs.28005 Homo sapiens mRNA; cDNA DKFZp564 19.3 

105209 AA205072 Hs.227743 KIAA0980 protein 7.4 

20 105223 AA211388 Hs.7750 ESTs 5.1 

105252 AA227428 Hs.9728 ESTs; WeaWy similar to WAA051 2 prote 11.1 

105253 AA227448 Hs.5003 KIAA0456 protein 6.4 
105261 AA227871 Hs.6361 MEK partner 1 9.1 
105263 AA227926 Hs.6682 ESTs 6.7 

25 105274 AA228122 Hs.281866 ATPase; H+ transporting; lysosomal (vacu 5.3 

105297 AA233451 Hs.183858 transcriptional toterrnediary factor 1 8.7 

105309 AA233790 Hs.4104 ESTs; Weakly similar to cDMA EST yk38 - 7.4 

105312 AA233854 Hs.23348 S-phase kinase^associated protein 2 (p45) 5.8 

105342 AA235286 Hs.157078 ESTs 4.5 

30 105376 AA236559 Hs.8768 ESTs; Weakly similar to HI! ALU SUBFA 5.8 

105386 AA236950 Hs.8115 ESTs 5.5 

105397 AA242868 Hs.7395 ESTs; Weakly similar to house-keeping p 7.7 

105399 AA243007 Hs.16420 ESTs; Highly similar to SH3 domain-bind 5.6 

105400 AA243052 Hs.65648 RNA binding motif protein 8 5.8 
35 105404 AA243303 Hs.21187 ESTs 9.1 

105409 AA243562 Hs.301855 ESTs 4.4 

105436 AA252172 Hs.237856 ESTs; Moderately sinto to CAMP Indue 5.1 

105483 AA255874 Hs.23458 ESTs 4.9 

105493 AA256268 Hs.10283 ESTs 6 

40 105495 AA256317 Hs.28785 Homo sapiens mRNA; cDNA DKFZp566 5.2 

105496 AA256323 Hs.301997 DKFZP434N126 protein 8.7 

105500 AA256485 Hs.222399 CGI-96 protein 9.5 

105507 AA256678 Hs.226318 ESTs; Moderately similar to CCR4-associ 4.1 

105538 AA258860 Hs.32597 ring finger protein (C3H2C3 type) 6 4.1 

45 105544 AA261954 Hs.24678 ESTs 8 

105546 AA262032 Hs.268281 ESTs; Weakly simffar to 62D9.a [D.melan 8.1 

105549 AA262417 Hs.5415 ESTs 4.6 

105551 AA262477 Hs.25292 ribonuclease HI; large subunit 9.1 

105560 AA262783 Hs.306915 ESTs 4.5 

50 105565 AA278302 Hs.18349 ESTs; Weakly similar to partial COS (C,e 4.2 

105566 AA278323 Hs.17481 Homo sapiens done 24606 mRNA sequen 11.9 

105575 AA278717 Hs.12772 ESTs 5.9 

105584 AA279012 Hs.3454 ESTs; Weakly similar to KiAA0665 prote 4.4 

105596 AA279418 Hs.18490 ESTs 4 

55 105604 AA279787 Hs.15467 ESTs; Moderately stmBaj to putative pho 5.6 

105610 AA279991 Hs.99872 ESTs; Weakly similar to trithorax homota 5.3 

105621 AA280865 Hs.6375 Homo sapiens mRNA; cDNA DKFZp564 4.8 

105627 AA281245 Hs.23317 ESTs 7.5 

105638 AA281599 Hs.247817 Homo sapiens mRNA for for Wstone H2B 5.9 

60 105645 AA282138 Hs.11325 ESTs 6.4 

105650 AA282347 Hs.25635 ESTs; Highly similar to HSPC003 (H.sap 11.3 

105666 AA283930 Hs.34906 ESTs 4.7 

105674 AA284755 Hs.279789 CDW52 antigen (CAMPATH-1 antigen) 8 

105687 AA286809 Hs.28423 ESTs 7.1 

65 105700 AA287643 Hs.35254 ESTs; Weakly similar to hypothetical pro 4.9 

105705 AA290767 Hs.101282 Homo sapiens mRNA; cDNA DKFZp434 8 

105709 AA291268 Hs.26761 OKFZP586L0724 protein 6.8 
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105731 AA292711 Hs.29131 ESTs 6.4 
105753 AA299789 Hs.1 10857 ESTs 7 
105774 AA348014 Hs.234t2 ESTs 7,1 
105784 M350771 Hs.17850 ESTs 13.4 
5 105791 M358036 Hs.14368 SH3-blnding domain glutamic acid-rich p 4.3 

105807 AA393803 Hs.16869 ESTs; Moderately similar to COLLAGEN 5.3 

105808 AA393808 Hs.286131 K1AA0438 gene product 4.1 

105812 AA394126 Hs.20814 ESTs; Highly similar to CGI-27 protein [H 14.6 

105813 AA394140 Hs.18585 ESTs 4.9 
10 105819 AA397920 Hs.28783 Homo sapiens mRNA; cONA DKFZp564 4.9 

105870 AA399623 Hs.101067 ESTs 4.8 
105874 AA400074 Hs.171118 ESTs 4 
105896 AA400999 Hs.7838 Human ring zinc-finger protein (ZNF127- 4.8 
105934 AA404248 Hs.16577 ESTs 5.2 
15 105935 AA404277 Hs.263727 ESTs; Weakly similar to bispbosphate 3*- 4 
105966 AA406105 Hs.5344 adaptor-related protein complex 1; gamma 8.3 
105974 AA406321 Hs.6224 KIAA0895 protein 4.6 
105990 AA410336 Hs.29403 ESTs; Weakly similar to PROBABLE AT 4.5 
105995 AA410510 Hs.5345 ESTs 4,9 

20 106000 AA410972 Hs.20726 ESTs 5.8 

106007 AA411462 Hs.11042 ESTs; Weakly similar to vefi 1 [H.sapien3 6.9 

' 106016 AA411819 Hs.8164 K1AA0898 protein 5 

106034 AA412473 Hs.14928 ESTs 6,6 

106042 AA412700 Hs.169895 ubiquffiivconjugating enzyme E2L 6 4.6 

25 106057 AA417067 Hs.289074 ESTs 4.5 

106065 AA417558 Hs.25206 ESTs 123 

106070 AA417761 Hs.5957 Homo sapiens clone 24416 mRNA sequen 5 

106103 AA421104 Hs.12094 ESTs 15.4 

106126 AA424006 Hs.22972 ESTs; Moderately similar to H5AR [Mm 6.4 

30 106154 AA425304 Hs.6994 ESTs 51 

106157 AA425367 Hs.34892 ESTs 11.1 

106166 AA425872 Hs.19561 NADH dehydrogenase (ubiquinone) 1 alp 19.3 

106204 AA428024 Hs.21479 ESTs 4.7 

106210 AA428239 Hs.10338 ESTs 57 

35 106220 AA428582 Hs.32196 ESTs; Moderately similar to metargidin p 7.7 

106236 AA429951 Hs.21104 ESTs 8 

106240 AA430074 Hs.18552 ESTs; Weakly similar to Ytr218cp [S.cere 4.4 

106263 AA431462 Hs.28329 ESTs 4.9 

106288 AA435536 Hs.24338 ESTs 8.8 

40 106293 AA435591 Hs.301444 signal sequence receptor; gamma (transloc 8.7 

106310 AA436244 Hs.17240 ESTs 4.5 

106317 AA436568 Hs.108124 ESTs 4 

106328 AA436705 Hs.28020 KIAA0766 gene product 4.4 

106341 AA441798 Hs.5243 ESTs; Moderately similar to pll_2 hypoth 23.7 

45 106348 AA442253 Hs.10702 ESTs 4.7 

106350 AA442763 Hs.194698 cydinB2 6.1 

106371 AA443923 Hs.170310 ESTs 6.8 

106389 AA446949 Hs.6236 ESTs 4.7 

106394 AA447223 Hs.25320 Homo sapiens clone 25142 mRNA sequen 4.4 

50 106426 AA448282 Hs.16206 ESTs; Weakly similar to F55C12.5 [C.ele 4.5 

106459 AA449741 Hs.4029 gltoma-ampOfied sequence-41 4.8 

106462 AA449912 Hs.30532 ESTs; Highly similar to CGl-77 protein [H 5.2 

106468 AA450047 Hs.14770 ESTs 6.8 

106479 AA450351 Hs.75251 ESTs 12.4 

55 106494 AA452108 Hs.18387 transcription factor AP-2 alpha (activating 4.5 

106503 AA452411 Hs.29679 ESTs; Highly simitar to mediator fH^aple 5.1 

106507 AA452584 Hs.267819 protein phosphatase 1; regulatory (inhibito 4.9 

106533 AA453786 Hs.145998 ESTs 8.3 

106568 AA455970 Hs.28285 patched related proteki translocated in ren 7.6 

60 106586 AA456598 Hs.57787 ESTs 62 

106589 AA456646 Hs.28561 ESTs 4.8 

106606 AA457730 Hs.283437 Homo sapiens clone 23851 mRNA sequen 4.4 

106611 AA458904 Hs.26267 ESTs; Weakly similar to torsinA [H.sapie 7 

106614 AA458934 Hs.256150 ESTs 45 

65 106628 AA459657 Hs.12311 Homo sapiens done 23570 mRNA sequen 6.5 

106637 AA459961 Hs.250824 ESTs 5 5 

106644 AA460239 Hs.12680 ESTs 44 
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106664 AA460969 Hs.7510 mitogen-activated protein kinase kinase ki 6.4 

106698 AA463745 Hs.29403 ESTs; Weakly similar to PROBABLE AT 5.3 

106719 AA465171 Hs.236844 ESTs 5.6 

106726 AA465339 Hs.3886 ESTs 10.1 

5 106747 AA476473 Hs.171957 triple functional domain (PTPRF Interacti 10.4 

106759 AA477263 Hs.25584 ESTs 4.2 

106765 AA477717 Hs^06117 Interfeukm 13 receptor; alpha 1 6.9 

106784 AA478558 Hs.227913 API5-Bke1 5.1 

106831 AM82014 Hs.29463 centrin; EF-hand protein; 3 (CDC31 yeast 5.1 

10 106836 AA482112 Hs.238707 ESTs 4.8 

106840 AA482548 Hs.5534 ESTs 103 

106856 AA486183 Hs.285123 ESTs; Weakly similar to similar to oxyste 6.2 

106865 AA487228 Hs.19479 ESTs 4.5 

106878 AA488872 Hs.12314 Homo sapiens mRNA; cDNA DKFZp586 7.9 

15 106888 AA489101 Hs.24734 oxysteroJ binding protein 6.4 

106895 AA489665 Hs.25245 ESTs 4.6 

106909 AA490323 Hs.250747 SUMO-1 activating enzyme subunit 1 4.2 

106919 AA490885 Hs.21766 ESTs 1Z3 

106920 AA490899 Hs.296323 ESTs 6.2 
20 106941 AA496204 Hs.237971 ESTs 4 

106942 AA496347 Hs.31314 fetinobtastoma-blndlng protein 7 4.8 

106948 AA496788 Hs.21077 WAA0532 protein 4 

106968 AA504631 Hs.26813 ESTs; Weakly similar to hypothetical 43.2 4.4 

106973 AA505141 Ks.11923 Human ONA sequence from clone 167A1 5.4 

25 106980 AA521121 Hs.8858 bromodomain adjacent to zinc finger dom 4.1 

106981 AA521157 Hs.74101 ESTs 5.7 

106998 AA598461 Hs.195464 insuGn-like growth factor binding protein 18.7 

107008 AA598710 Hs.23740 ESTs 6.2 

107028 AA599214 Hs.24143 ESTs 4.1 

30 107032 AA599472 Hs.247309 succinate-CoA Ogase; GOP-formlng; beta 5.3 

107052 AA600134 Hs.12482 gryceronephosphata Oacyttransferasa 4.8 

107053 AA600147 Hs.5741 ESTs; Weakly similar to NADH-cytochro 5.8 
107056 AA600310 Hs. 18720 programmed ceD death 8 (apoptosls-induc 4.9 
107080 AA609210 Hs.19221 ESTs 8.4 

35 107102 M609723 Hs.30652 ESTs 8 

107109 AA609943 Hs.32793 ESTs 9.5 

107129 M620553 Hs.4756 flap structure^pecific endonuctease 1 4.9 

107132 AA620598 Hs.9052 ESTs 5.3 

107136 AA620795 Hs.8207 ESTs 4 

40 107140 AA620889 Hs.170088 ESTs 6.7 

107151 AA621169 Hs.8687 ESTs 19 

107159 AA621340 Hs.10600 ESTs; Weakly similar to ORF YKR081c [ 8.1 

107174 AA621714 Hs^5338 ESTs 8.5 

107217 051095 Hs.35861 OKFZP586E1621 protein 7.2 

45 107252 059971 Hs.25925 ESTs 7.9 

107295 T34527 Hs.80120 UDP^^tyl^ha^alactosairtepo 5.6 

107299 T40327 Hs.30661 lung resistance-related protein 8.4 

107324 T61665 Hs.276422 OKFZP586G1 122 protein 7.5 

107372 U85625 Hs.8297 ribonuctease 6 precursor 4.7 

50 107373 U85773 Hs.154695 phosphornannomutase 2 4.8 

107481 W58247 Hs.279766 Homo sapiens Wnesin superfemity motor 6.3 

107531 Y13936 Hs.17883 protein phosphatase 1G (formerly 2C); ma 8.3 

107859 AA024835 Hs.47584 potassium voltage-gated channel; delayed 7.3 

107890 MO26O30 Hs.61311 ESTs; Weakly similar to CALPAIN 2; LA 7.3 

55 107908 AA026894 Hs.42826 ESTs 4.9 

108039 AA041341 Hs.46670 ESTs 5.4 

108040 AA041551 Hs.159971 ESTs 8.4 
108102 AA046424 Hs.49433 ESTs; Weakly similar to HYPOTHETICA 6.6 
108217 AA058686 Hs.62588 ESTs 7.7 

60 108255 AA063157 Hs.172608 ESTs 4 

108358 AA071514 Hs.1634 ESTs 4 

108609 AA10O694 Hs.69499 Human ONA sequence from SAC 15E1 0 5.5 

108647 AA1 12396 Hs.44276 ESTs; Moderately similar to HOMEOBO 14.3 

108676 AA1 15562 Hs.274417 Homo sapiens mRNA; cDNA OKFZp564 5.2 

65 108687 AA120785 Hs£4347 ESTs 5.6 

108695 AA121315 Hs.70823 WAA1077 protein 10.5 

108733 AA126422 zn84f1 js\ Stratagene lung carcinoma 9372 4.4 
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108774 AA128125 Hs.71040 


ESTs; Moderately similar to CELL GROW 


4.6 


108828 AA131584 Hs.273344 


DKFZP564O0463 protein 


5.5 


108872 AA134063 Hs.1 11680 


ESTs 


7,2 


108884 AA134958 Hs.293591 


ESTs 


11.3 


108893 AA135894 Hs.194691 


retinolc add induced 3 


8.9 


109008 AA156360 Hs.87128 


ESTs 


14J 


109010 AA156460 Hs.44229 


dual specificity phosphatase 12 


4.9 


109011 AA156542 HsJ2127 


ESTs 


4.6 


109042 AA159525 Hs.71779 


Homo sapiens ONA from chromosome 19 


7.2 


109086 AA166695 Hs.270737 


tumor necrosis factor (Ggand) superfamiiy 


4 


109090 AA167006 


Hs.70499 


ESTs 


5.9 


109101 AA167708 


Hs.52184 


ESTs 


4.2 


109112 AA169379 


HJL257924 


ESTs 


4 


109160 AA179387 


Hs.301997 


DKFZP434N126 protein 


4 


109166 AA179845 Hs.73625 


RAB6 Interacting; kinesMke (rabklnesin 


13.6 


109178 AA181600 Hs.283707 


ESTs 


11.8 


109179 AA181902 Hs.192789 


ESTs; Weakly similar to III! ALU SUBFA 


5.4 


109261 AA195255 


Hs.61779 


ESTs 


67 


109270 AA195515 Hs.3585 


ESTs; Weakly similar to alternatively spli 


4.9 


109277 AA196332 Hs.86043 


ESTs 


5.4 


109313 AA206800 Hs.86276 


EST s; Moderately similar to zinc finger p 


5.5 


109415 AA227219 Hs.110826 


trinucleotide repeat containing 9 


20.1 


109454 AA2322S5 


Hs 295232 


ESTs 


4,7 


109467 AA232904 


Hs 63187 


ESTs 


6.8 


109481 AA233342 


Hs.289069 


ESTs; Weakly similar to WO40 protein C 


10.6 


109508 AA2338Q? 


He W09 


ESTs; Weakly similar to HI! ALU SUBFA 


a 


109514 AA234087 


Hs.262346 


ESTs; Weakly similar to ORF2: function 


8.2 


109572 F02027 


Hs.171937 


ESTs 


4.8 


109632 F04165 


Hs.235873 


ESTs; Weakly similar to K11C42 [Celeg 


5.2 


109644. F04477 


Hs.291531 


ESTs; Moderately simbrto GLYCERAL 


6.6 


109703 F09684 


Hs.24792 


ESTs; Weakly similar to ORF YOR283W 


7.1 


109726 F10009 


Hs.9196 


ESTs 


5 


109747 F10161 


Hs.22969 


ESTs 


4.7 


109799 F10770 


Hs.180378 


Homo sapiens done 669 unknown mRNA 


4.5 


109814 F10979 


Hs.153106 


Homo sapiens done 23728 mRNA sequen 


8.7 


110189 H20543 


Hs.6273 


DKF2P586B1621 protok 


16.6 


110240 H25577 


Hs.176588 


ESTs; Weakly similar to CYTOCHROME 


6.2 


110280 H29285 


Hs.32468 


ESTs 


4.5 


110520 H56965 


Hs.4082 


yr09f06.s1 Soares fetal fiver spleen 1Na 


5.7 


110561 H59617 


Hs.5199 


ESTs; Weakly similar to UBIQUITIN-CO 


19.5 


110707 H95079 


Hs.15617 


ESTs; Weakly similar to III! ALU SUBFA 


6.2 


110734 H98714 


Hs.24131 


ESTs 


202 


110770 N22262 


Hs.131705 


ESTs 


5.8 


110780 N23174 


Hs.22891 


solute cantor family 7 (cationlc amino ad 


8.2 


110787 N24716 


Hs.12244 


ESTs; Weakly similar to C44B9.1 [Celeg 


6.7 


110794 N25262 


Hs.27931 


ESTs 


5.9 


110799 N26101 


Hs.323401 


Human ring zinc-finger protein (ZNF127- 


4 


110818 N29454 


Hs.27552 


ESTs; Weakly similar to putative p150 [H 


4.3 


110839 N30856 


Hs.30246 


solute carrier family 19 (thiamine transpo 


12.8 


110844 N31952 


Hs.167531 


Homo sapiens mRNA full length Insert cD 


10.1 


110854 N32919 


Hs.27931 


ESTs 


4.7 


110856 N33063 




ESTs; Weakly sNIarto S164 [H.sapiens 


4.2 


110860 N33438 


Hs.170065 


ESTs 


125 


110897 N39148 


Hs.6880 


DKFZP4340156 protein 


4 


110915 N46252 


Hs,29724 


ESTs 


232 


110935 N48787 


Hs.305979 


protease inhibitor 1 (antt-efastase); alpha- 


4.8 


110970 N51374 


Hs.96870 


Homo sapiens mRNA full length Insert cD 


5.4 


111006 N53375 


Hs.166146 


Homer; neuronal immediate early gene; 3 


4.7 


111008 N53388 


Hs.7222 


ESTs 


13.3 


111018 N54067 


Hs.3628 


mltogen-activated protein kinase kinase ki 


5.7 


111084 N59543 


Hs.15456 


PDZ domain containing 1 


8.3 


111100 N62522 


Hs.20450 


ESTs 


14.3 


111125 N63823 


Hs.269115 


ESTs 


7.9 


111132 N64378 


Hs.83293 


ESTs; Highly similar to unknown function 


4.4 


111139 N64683 


Hs.290943 


ESTs 


6 


111164 N66857 


Hs.14808 


ESTs; Weakly similar to till ALU CLASS 


4.1 


111172 N67102 


Hs.21851 


Homo sapiens mRNA; cONA DKFZp586 


5.5 
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111178 N67227 Hs.24633 ESTs 5.7 

111179 N87239 Hs.10760 ESTs 37 
111181 N67278 Hs.UM2 ESTs; Weakly similar to hypothetical pro 5.6 
111184 N67437 Hs.243901 Homo sapiens mRNA; cONA DKFZp564 8.7 

5 111221 N68869 Hs.15119 ESTs 7.3 

111223 N68921 Hs.297939 ESTs; Weakly similar to neogenin (Rsap 9 

111229 N69113 Hs.110855 ESTs 8.9 

111241 N69514 H&288880 ESTs; Weakly similar to CGI-82 protein [ 6.9 

111268 N70481 Hs.26118 Homo sapiens done 24766 mRNA sequen 4.5 

10 111295 N73275 Hs^1Z75 ESTs; Weakly simfiar to ublquitin-conjug 5.6 

111299 N73808 Hs.24936 ESTs 8.5 

111336 N79565 Hs.29394 ESTs 6.7 

111357 N91023 Hs.87128 ESTs 15 

111370 N92915 Hs.94631 brefeldin A-biWbfted guanine nucleotide* . 5.2 

15 111806 R33468 Hs.279008 ESTs 10 

111825 R35885 Hs.286148 stromal antigen 1 4.5 

111836 R36228 Hs.25119 ' ESTs 7.2 

111890 R38678 Hs.12365 ESTs 17.3 

111923 R39995 Hs.25925 Homo sapiens clone 23860 mRNA sequen 7.3 

20 111942 R40576 Hs.21590 ESTs 9.2 

111987 R42036 Hs.6763 K1AA0942 protein 10.6 

112101 R44793 Hs.296341 adenyty! cyclase-associated protein 2 5.3 

112134 R46025 Hs.7413 ESTs 17.4 

112197 R49482 Hs.5637 ESTs 4.4 

25 112244 R51309 Hs.70823 WAA1077 protein 11 

112253 R51618 Homo sapiens mRNA; cDNA DKFZp566 9.3 

112305 R54822 Hs.26244 ESTs 4.4 

112449 R63802 Hs.124186 ring finger protein 2 6.3 

112483 R66534 Hs.285885 ESTs 4.9 

30 112519 R68631 Hs.11861 ESTs 14.3 

112610 R79392 Hs.23643 ESTs 5.2 

112693 R88741 Hs.91065 ESTs; Moderately similar to proliferation 4.6 

112751 R93507 Hs.8207 ESTs 5.6 

112801 R97486 Hs.157160 protein kinase; DMA-activated; catalytic p 8.7 

35 112869 T03313 Hs.4747 dyskeratosis congenita 1; dyskerin 5.9 

112871 T03352 Hs.12285 ESTs 5.8 

112908 T10065 Hs.3530 US-associated serin*argtoine protein 4.1 

112966 T17119 Hs.102548 glucocorticoid receptor DNA binding fact 5.7 

112971 T17185 Hs.83883 ESTs 6.4 

40 112995 T23528 Hs.7155 ESTs; WeaWysJmSar to TYK) protein [M 9.1 

113047 T25867 Hs.7549 ESTs 5.4 

113075 T34660 Hs.6986 ESTs; WeaWy simHar to 1111 ALU SUBFA 5.7 

113117 T47819 Hs.159153 ESTs 5.8 

113206 T58044 Hs.241471 ESTs; Moderately similar to 111! ALU SU 6.4 

45 113248 T63857 yc16e1.s1 Stratagene lung (#93721) Homo 4.6 

113260 T64896 Hs.287420 ESTs 6.9 

113277 T65797 Hs.11774 protein (peptidyl-proiyi cis/trans feomeras 5.6 

113278 T65802 Hs.11135 yc11M 0^1 Stratagene lung (#937210) Ho 4.5 
113440 T86121 Hs.191445 ESTs 6.4 

50 113523 T90037 Hs.95549 ESTs 6.4 

113604 T92735 Hs.296083 ESTs 8.7 

113702 T97307 ESTs; Moderately similar to HI! ALU SU 9.5 

113783 W19222 Hs.7041 ESTs; Weakly similar to iiil ALU SUBFA 5.2 

113794 W37382 Hs.1109O ESTs 11.9 

55 113808 W44735 Hs.9286 ESTs 16.7 

113811 W44928 Hs.6994 ESTs 4 

113822 W47350 Hs. 17466 retinoic acid receptor respond er (tazaroten 4.8 

1 13823 W47388 Hs.55099 rab6 GTPase activating protein (GAP and 4 
113836 W56792 Hs.12040 ESTs; WeaWy similar to KIAA0881 prate 4.1 

60 113857 W65477 Hs.5297 Homo sapiens mRNA; cONA DKF2p564 4.3 

113886 W72471 Hs.23920 ESTs 4.6 

113895 W73738 Hs.12921 ESTs 7.1 

113923 W80763 Hs.3849 ESTs; Weakly similar to FK506-btnding p 6.8 

113931 W81205 Hs.3496 ESTs 6.1 

65 113950 W85765 Hs.30504 Homo sapiens mRNA; cONA OKF2p434 14 

113970 W86748 Hs.8109 ESTs . 15 

114051 W94942 Hs.177534 dual specificity phosphatase 10 5.4 
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114U57 W96222 Hs.34192 


ESTs 


4.8 


114086 Z38266 Hs.288649 


Homo sapiens PAC clone OJ0777O23 fro 


5.1 


mlRJO £0004/ hS. 110000 


tb rs, weaKiy sunuar to similar to 5. cere 


6.2 


IVIIUSJ £oo4JD nS.io4lUo 


ribosomal protein L21 


4.6 


114124 Z38595 Hs.125019 


ESTs; Highly similar to K1AA0886 prate 


22 


114i3o Z38763 Hs. 15740 


amyloid beta (A4) precursor rxotein-bindi 


8.8 


114149 Z36814 ns.27196 


ESTs 


4 


114 ibZ Zoo9U9 Hs.22265 


ESTs 


7.2 


114777 £j9Uo2 Hs.23740 


ESTs 


5.3 


1141% Z39211 Hs.150926 


fucose-1 -phosphate guanylyltransferase 


4.4 


114208 Z39301 Hs.7859 


ESTs 


5.1 


114250 239897 Hs. 13297 


ESTs 


72 


•H/IOC4 7QO0OQ U» HfMQ 

114251 Z39o9o ns.21940 


toTS 


14.7 


114292 Z40715 Hs,184641 


delta-6 fatty acid desaturase 


19.4 


114297 Z40758 Hs.173091 


DKFZP434K151 protein 


8.9 


114334 Z41342 Hs.22941 


ESTs 


13.7 


114460 AAQ24604 Hs.26102 


ESTs 


10.1 


114471 AA028074 Hs. 104613 


ESTs 


5.7 


114480 AA032243 Hs.1 51678 


UDP^acetyl-aIpha-r>gaIactosarnlne:po 


7.3 


114518 AA046407 Hs. 106469 


suppressor of varl (S.cerevisiae) 3-Jike 1 


4.3 


114542 AA055768 Hs.293380 


ESTs 


11.7 


114549 AA056484 Hs.292833 


ESTs 


7.3 


4 4 4 nr*S A A 4f\ 4 4 Af* Li— 4fV~>4AfS 

114652 M10 1416 Hs. 107149 


ESTs; Weakly similar to PT&ASSOCIAT 


6.1 


114673 AA1 13303 Hs.95583 


transmembrane 4 superfamily member (te 


4.3 


114698 AA126951 Hs.1 10857 


ESTs; Highly similar to putative DNA-dir 


7.1 


114767 AA148885 Hs.154443 


mmichromosome maintenance deficient (S 


5.3 


114799 AA1 59323 Hs.109929 


ESTs 


42 


114804 AA1 60363 Hs.269956 


ESTs 


AS 


114811 AA161161 Hs, 95907 


multiple inositol polyphosphate phosphata 


7,1 


114821 AA165313 Hs.55468 


ESTs 


4.4 


A A AO£f\ A A O^f A4f Li— 

114852 AA235035 Hs.38260 


ESTs; Moderately similar to ubiquitin spe 


5 


AAAf\47\A A A rt^^mA II JAOiO^ 

114901 AA236276 Hs. 196437 


ESTs; Weakly similar to R26660J; parti 


16.9 


114902 AA236359 Hs. 39504 


ESTs 


5.1 


114940 AA243012 Hs.75928 


ESTs 


8.5 


114965 AA250737 Hs.72472 


ESTs 


35.1 


115047 AA252627 Hs.82916 


homeo box B5 


5.7 


115054 AA252863 Hs.87729 


ESTs 


6.2 


4.4Cr\C4 AA0C0O47 U~ 

115061 AA253217 Hs.41271 


ESTs 


13 


115082 AA255557 Hs.198269 


NADH dehydrogenase (ubiquinone) 1 aip 


28.2 


115116 AA256486 Hs.62275 


ESTs 


8.8 


115140 AA258030 Hs.279938 


ESTs; Weakly similar to supported by GE 


4.1 


4 A A AAAi^A i 1 AAJA4A 

115205 AA262470 Hs.284216 


ESTs 


8.3 


115206 AA262491 Hs. 186572 


ESTs 


5.1 


115239 AA278650 Hs.73291 


ESTs; Weakly similar to similar to the bet 


4.6 


115242 AA278755 Hs.283732 


ESTs 


8.3 


115249 AA278961 Hs.71124 


ESTs 


10.1 


115259 AA279071 Hs.13453 


splicing factor 3b; subunit 1; 155kO 


9.5 


115285 AA279799 Hs,293736 


ESTs 


5.8 


115291 AA279943 Hs.122579 


ESTs 


5.1 


115357 AA281793 Hs.72988 


ESTs 


5 


115377 AA282247 Hs, 193063 


ESTs 


6.1 


115400 AA283198 Hs.89113 


ESTs 


4.9 


115439 AA284561 Hs.193090 


ESTs 


5.8 


115471 AA287138 Hs.59346 


ESTs; Weakly similar to ASPARTYL-TR 


11.7 


115506 AA292537 Hs.45207 


Human DNA sequence from done 620E1 


6.8 


115522 AA331393 Hs.47378 


ESTs 


5,8 


115572 AA398392 Hs.59594 


ESTs; Weakly similar to F33G12.3 gene p 


9.7 


115587 AA399264 Hs.283037 


ESTs; Highly similar to HSPC039 protein 


8.7 


115600 AA400247 Hs.42173 


ESTs 


4 


115612 AA400948 Hs.71243 


ESTs; WeaWy similar to zinc finger prote 


8.4 


115646 AA404352 Hs.305971 


ESTs 


5.3 


115652 AA405098 Hs.38178 


ESTs 


16.1 


115657 AA405620 Hs.55158 


ESTs; Weakly similar to weak similarity t 


4.7 


115658 AA405625 Hs.183056 


Human DNA sequence from clone 34B21 


5.1 


115675 AA406546 Hs.82065 


Homo sapiens mRNA; cONA OKFZp564 


20.5 


115721 AA417102 Hs.90960 


ESTs 


4.8 


115763 AA421560 


ESTs 


7 
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115764 


AA421562 


Hs.91011 


anterior gradient 2 {Xenepus laevis) homo 


41.6 


115835 


AA428576 


Hs.41371 


ESTs 


4.2 


115844 


AA430124 


Hs.7773 


ESTs 


115 


115875 


AA433943 


l_i_ A1(\AC 

HS.43946 


coi s; Weakly similar to Weak slmuarny 


33.5 


115888 


A A MO coon 

AA435839 


n3.7o59i 


KIAAQoo/ protein 


7.2 


115922 


AA441911 


Hs.71B69 


ESTs; Weakly similar to WAA0926 prate 


5.1 


115941 


AA443602 


Hs.46679 


ESTs 


4.8 


115947 


A ft A A OTftO 

AA443793 


Hs.94761 


ESTs 


8.3 


115948 


AA443798 


Hs.43445 


poJy(A^specific ribonuctease (deadenyiat 


13.5 


115951 


A A A A 1Q«Q 

AA443910 


ns.30i04o 


cofifin 1 (rrarwnuscte) 


75 


115967 


AA44ooo7 


Hs.4291 1 


ESTs 


Q Q 

0.0 


115984 


M447687 


» t AAA AA 

Hs.91109 


ESTs 


13.1 


116009 


A A J ifti An 

M449448 


11 J JAM A 

Hs.44238 


ESTs 


5.5 


116024 


AA451748 


Hs.83883 


Human DNA sequence from clone 718J7 


7.5 


116028 


AA452112 


Hs.42644 


thloredoxh-fike 


12.7 


116050 


M453656 


Hs.88417 


ESTs 


72 


116097 


AA456099 


Hs.1 76376 


ESTs 


11.8 


116108 


AA4 57566 


HsJ>8777 


ESTs 


AJ5 


116121 


AA459254 


Hs.48855 


ESTs 


45 


116127 


M459703 


Hs.279884 


v-myc avian myetocytornatosis viral onco 


4.3 


116129 


AA459956 


Hs.49163 


ESTs; Highly similar to putative rtbonude 


7.6 


116142 


M460649 


Hs,39457 


ESTs 


4.8 


116204 


AA465701 


Hs. 108646 


ESTs 


6.8 


116221 


M478397 


Hs.50180 


ESTs 


4.9 


116222 


AA478415 


Hs.89986 


ESTs 


4 


116238 


AA479362 


Hs.47144 


DKFZP586N0819 protein 


4.6 


116246 


AA479961 


Hs.250546 


ESTs; Highly similar to ubtquitin-conjuga 


4 


116249 


AA480886 


Hs.86693 


ESTs 


185 


116250 


A A I OftftTf 

AA480975 


Hs.44829 


ESTs 


10.8 


116254 


A A JA4 4 Jfl 

AA481146 


t_J— J 4/\<ir> 

Hs.41086 


ESTs; Weakly similar to OXY STEROl-B 


9.1 


116256 


A a A n 4 /\r-o 

AA481256 


H3.88201 


ESTs: Weakly similar to lysophosphoCpa 


8.4 


116264 


AA482594 


Hs.272239 


Homo sapiens mRNA; cONA DKFZp586 


72 


116265 


A A 4 0OCrtC 

AA482595 


II- CC4 OA 

Hs.55189 


ESTs; Weakly similar to F25B5.3 [ueleg 


44 4 
11.1 


116282 


AAA A^Pf*i\ 

AA486550 


Hs.204501 


ESTs; Weakly similar to Wiskott-Aldrich 


6.2 


116298 


AA489U4D 


HS.94109 


ESTs 


4.9 


116300 


AA489194 


Hs.159471 


ESTs; Weakly similar to snRNP protein B ' 


4.6 


116327 


A ft A/lAftP/t 

AA490959 


HS28005 


Homo sapiens mRNA; cONA DKFZp564 


5.8 


116334 


AA491457 


Hs.48948 


ESTs 


4.3 


116337 


AAA A£ 4 *VT 

M496127 


Hs.44070 


ESTs 


O A 

8.4 


116351 


A AEAA4 4D 
AA0Q4ll0 


Hs.o25Q1 


Homo sapiens mRNA; cuNA DKF^p434 


O.J 


116357 


A a cnA one 

AA504806 


HS.90797 


Homo sapiens clone 23620 mRNA sequen 


52 


116415 


AA609204 


Hs.27973 


K1AA0874 protein 


6.6 


1 low 




Us HGfMOQ 

nS.iyiMoo 


cois; weakly similar © KcKA tin, i yp 


4.0 


116470 


C13992 


Hs .83484 


ESTs 


4.5 


116480 


C140oo 




gfyceraidehyde-3-phosphate dehydrogena 


5.6 


116578 


D51272 


Hs.75337 


nucleolar phosphoprotein p1 30 


4.1 


116579 


051276 


Hs.81915 


leukemia-associated phospho protein p18 


5.8 


116626 


F02028 


Hs.81907 


ESTs 


4.9 


116647 


F03069 


Hs.15395 


ESTs; Weakly similar to ARGINYL-TRN 


6.1 


116674 


F04816 


Hs.92127 


ESTs 


10.6 


116680 


F08813 


Hs.273829 




4.2 


116700 


F09983 


Hs.317589 


ESTs 


13 


116724 


F13665 


Hs.65641 


ESTs 


85 


116726 


F13681 


Hs.53913 


ESTs 


5.6 


116732 


F13779 


Ks. 165909 


ESTs 


11.6 


116734 


F13789 


Hs.93796 


DKFZP586D2223 protein 


5.4 


116760 


H11054 


Hs.155342 


protein kinase C; delta 


4.3 


116780 


H22566 


Hs.30098 


ESTs 


5.7 


116786 


H25836 


Hs.301527 


tumor necrosis factor (figand) superfamily 


8.8 


116787 


H28581 


Hs.15641 


ESTs 


8.6 


116790 


H29532 


Hs.101174 


rnicrotuDUIe-assodated protein tau 


22.2 


116803 


H47357 


Hs.109701 


ESTs; Moderately similar to weak similar 


6.7 


116877 


H68116 


Hs.168732 


ESTs 


&5 


116921 


H72948 


Hs.821 


biglycan 


20.7 


117216 


N20083 


Hs.42792 


ESTs 


4.4 


117232 


N20579 


Hs.61153 


ESTs 


7.4 


117284 


N22162 


Hs.183779 


ESTs; Weakly similar to cONA EST yk33 


4.1 
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117344 N24048 Hs.210706 ESTs 7.4 

117367 N24954 Hs.42502 ESTs 10.5 

117392 N26175 Hs.93405 ESTs 5.8 

117394 N26257 Hs.39871 KIAA0727 protein 8.4 

5 117412 N26722 Hs.42645 ESTs 181 

117498 N31726 Hs.44268 ESTs; Highly similar to myelin gene expr 5.8 

117557 N33920 Hs.44532 dhibiquiUn 12.3 

117634 N36421 Hs.13323 ESTs; Weakly similar to SODIUM- AND 4.4 

117639 N36923 Hs.44833 ESTs 6 

10 117754 N47469 Hs.59757 ESTs 7.6 

117852 N49408 Hs.136102 WAA0853 protein 5.9 

117879 N50050 Hs.303025 ESTs; Weakly similar to keratin; 67K typ 7.9 

117924 N51056 Hs.38891 ESTs 7.9 

117950 N51394 Hs.75478 KIAA0956 protein 5 

15 117992 N52000 Hs.172089 Homo sapiens mRNA; cDNA DKFZp586 7 

118138 N57773 Hs.93560 ESTs; Weakly similar to tig pinorvegicu 4.8 

118215 N62195 Hs.77910 3^ydroxy^-rnetrrylglutaiyKk)enzymeA 13.4 

118229 N62339 Hs.166254 heat shock 90kD protein 1; alpha 5.4 

118265 N62827 Hs.48645 EST 42 

20 118336 N63604 Hs.47166 ESTs 7.2 

118363 N64168 Hs.48938 ESTs 6 

118429 N66158 Hs.74649 ESTs 4.1 

118470 N66769 Hs.291033 ESTs 5.4 

118472 N66818 Hs.42179 ESTs 10.8 

25 118475 N66845 ESTs; Weakly similar to Jill ALU CLASS 4.5 

118493 N67149 Hs.50115 ESTs 5.3 

118528 N67889 Hs.49397 ESTs 10,4 

118542 N68010 Hs.49427 ESTs 7.9 

118600 N69222 ESTs 9.2 

30 118695 N71781 Hs.50081 Homo sapiens mRNA foil length Insert cD 9.8 

118698 N72113 H&50187 ESTs " 4.3 

118901 N90719 Hs.94445 ESTs 8.1 

118952 N92966 ESTs; Highly similar© CGI-90 protein [H 12.5 

118976 N93629 Hs.93391 ESTs 5 

35 118986 N94362 Hs.125830 ESTs 7.3 

118989 N94439 Hs.45105 ESTs 8.2 

119027 N99256 Hs.114611 ESTs 5 

119042 R05316 Hs.5472 ESTs 4 

119075 R36451 Hs.287820 fibronectinl 6 

40 119260 T15916 Hs.102950 ESTs; Highly similar to coat protein gamm 4.1 

119271 T16387 Hs.65328 ESTs 12.1 

119298 T23820 Hs.155478 cyclinTC 5.6 

119302 T25725 ESTs 14.3 

119341 T62571 Hs. 146388 mlcrotubui^assocJated protein 7 4 

45 119495 W35390 Hs.55533 ESTs 5.3 

119580 W42451 Hs.92260 hlgrwnobflity group profeh 2-like 1 5.6 

119602 W46286 Hs.233694 ESTs; Weakly similar to ZK1058.5 [Cete 6.5 

119620 W47620 Hs.56009 2Vffoligoadenylate synthetase 3 8.1 

119676 W60473 Hs.57787 ESTs 5.5 

50 119717 W69134 Hs.57987 ESTs 46 

119729 W69747 Hs.94806 K1AA1062 protein 4 

119805 m3788 Hs.43213 ESTs 4 

119859 W80702 Hs.58461 ESTs 48 

119867 W80852 Hs.250696 KDa (Lys-Asp-Glu-Leu) endoplasmic re 4.2 

55 119873 W81129 Hs.44865 Homo sapiens mRNA; cDNA DKFZp586 4.8 

119899 W84767 Hs.58698 ESTs 59 

119940 W86779 Hs.272531 DKF2P586B0319 protein 9 

119943 W86835 Hs.14158 copinelll 48 

119970 W87812 Hs.93581 Homo sapiens mRNA; cDNA DKFZp586 4 

60 120131 Z38656 Hs.75887 coatomer protein complex; subunit alpha 42 

120150 Z39549 Hs.153746 ESTs 11 

120206 Z40805 Hs.91668 ESTs 82 

120241 Z41815 Hs.65946 ESTs 156 

120255 AA169752 Hs.5672 ESTs; Weakly similar to Similarity to Yea 42 

65 120314 AA194168 Hs.221040 WAA1038 protein 68 

120325 AA195651 Hs.104106 ESTs 152 

120352 AA211400 Hs.193172 ESTs 6.8 
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120428 AA236822 


Hs.173694 


K1AA1097 protein 


5.6 


120524 AA261S52 


Hs.192905 


ESTs 


5.6 


120528 AA262107 


Hs.104413 


ESTs 


4.5 


120571 AA280738 


Hs.34892 


ESTs 


4.9 


120649 AA287115 


Hs.192843 


ESTs 


4.5 


120655 AA287347 


Hs.238205 


ESTs 


6,7 


120668 AA287833 


Hs.292913 


ESTs 


8.3 


120712 AA292654 


Hs.102506 


eukaryotic translation initiation (actor 2 at 


4.6 


120713 AA292655 


Hs.96557 


ESTs 


10.6 


120724 M293470 


Hs.100747 


ESTs 


5,4 


120873 AA358015 




EST 


7.1 


120885 AA365515 


Hs.301872 


ESTs; Moderately similar to 111! ALU SU 


4.6 


120919 AA381125 


Hs.301444 


ESTs 


8.2 


120948 M397822 


Hs.104650 


ESTs; Highly similar to similar to mago n 


8.6 


120989 AA398116 


Hs.129206 


casein kinase 1; gamma 3 


10.5 


120977 AA398155 


Hs.97600 


ESTs 


10.9 


121103 AA398936 


Hs.97697 


EST 


7.4 


121291 AA401753 


Hs.8166 


rung cancer candidate 


5.3 


121320 AA403008 


Hs.301927 


T-cetl receptor; alpha (V;D;J;C) 


13.5 


121463 AA411745 


Hs.239681 


ESTs; Weakly similar to KIAA0554 prote 


8.9 


121596 AA416740 


Hs.174104 


ESTs 


22.6 


121723 AA419622 


Hs.104800 


ESTs; Weakly similar to Mouse 19.5 mRN 


8 


121748 AA421171 


Hs.234545 


ESTs 


5.6 


122125 AA434411 


Hs.98806 


ESTs 


5.3 


122522 AA449444 


Hs.98969 


ESTs 


4 


122655 AA454756 


H3.97837 


ESTs 


4 


122704 AA456326 


Hs.99445 


ESTs 


6.2 


122782 AA459894 


Hs.99472 


ESTs 


5.3 


122856 AA463740 


Hs.75367 


Src-Oke-adapter 


13.1 


122882 AM65381 


Ha.108812 


ESTs; Weakly similar to B0041.5 [Ceteg 


5.5 


122928 AA476578 


Hs.101840 


ESTs 


6.3 


122974 AA478625 


Hs.194215 


ESTs 


6 


122997 AA479295 


Hs.106290 


Kelch motif containing protein 


1Z5 


123016 AA480103 


Hs.323231 


ESTs; Weakly similar to alternatively spti 


4.4 


123107 M486071 


Hs.104207 


ESTs 


8.3 


123111 AA486273 


Hs.191721 


ESTs 


4.2 


123114 AA486407 


Hs.129928 


ESTs; Moderately similar to K1AA0454 p 


5.2 


123136 AA487449 


Hs.194024 


ESTs 


4.2 


123137 AA487468 


Hs.100686 


ESTs; Weakly simflarto secreted cement 


14.6 


123169 AA488892 




ESTs; Weakly similar to Gag-Pol polypro 


4.5 


123176 AA489020 


Hs.69233 


ESTs 


5.2 


123338 AA504249 


Hs.187585 


ESTs 


4 


123436 AA598714 


Hs.223014 


protease; serine; 15 


7.3 


123442 AA598803 


Hs.1 11496 


ESTs 


5.9 


123449 M598899 


Hs.1 12493 


Homo sapiens mRNA; cDNA DKFZp564 


, 4.1 


123494 AA599786 


Hs.112110 


ESTs 


4 


123503 AA600121 


Hs.293156 


ESTs 


12.8 


123533 AA608751 




ESTs; Weakly similar to llll ALU SUBFA 


7.9 


123619 AA609200 




ESTs 


23.1 


123673 AA609471 


Hs.158549 


ESTs 


6.6 


123729 AA609778 


Hs.278672 


membrane component; chromosome 11; s 


4.7 


123819 AA62GS36 


Hs.1 12264 


ESTs 


4 


123960 AA621785 


Hs.287733 


memytmaJonate-semialdehyde dehydroge 


7.6 


124000 D57317 


Hs.74861 


activated RMA polymerase il transcription 


4.4 


124006 D60302 


Hs.270016 


ESTs 


20.6 


124012 080240 


Hs.241471 


HUM5G1 1 A Human fetal brain (TFujlwa 


6.7 


124021 F02859 


Ha.13974 


ESTs 


4.7 


124049 F10523 


Hs.74519 


primase; polypeptide 2A (58k0) 


4.7 


124059 F13673 


Hs.283713 


ESTs 


7.7 


124243 H66710 


Hs.133525 


ESTs 


5.5 


124308- H93575 


Hs.241507 


Homo sapiens mRNA; cDMA DKFZp564 


11.4 


124314 H94877 . 


Hs.215766 


GTP-binding protein 


13.7 


124315 H94892 


HsJ288757 


v-fal simian leukemia viral oncogene horn 


14 


124350 N21359 


Hs.101282 


Homo sapiens mRNA; cDNA OKFZp434 


8.6 


124352 N21626 


Hs.102406 


ESTs 


12 


124357 N22401 




yw37g07.s1 Morton Fetal Cochlea Homo 


5.2 


124390 N29325 


Hs.7535 


ESTs; Highly similar to C08W-Eke place 


7.9 
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124438 N40188 


Hs.11090 


ESTs 


9.5 


124447 N48000 




Homo sapiens mRNA; cDNA DKFZp586 


4.8 


124457 N50114 


Hs.266175 


ESTs 


6.1 


124539 N63172 


Hs.146409 


cell division cycle 42 (GTP-binding prote 


5.6 


124626 N74604 


Hs.11090 


ESTs 


12.8 


124632 N79515 


Hs.306117 


interteukln 13 receptor; alpha 1 


6.4 


124644 N91279 


Hs.109654 


ESTs; Moderately similar to outer membr 


8.3 


124676 R01037 


Hs.181013 


phosphoglycerate mutase 1 (brain) 


12.3 


124677 R01073 




ESTs; Weakly similar to Hi! ALU CLASS 


5.4 


124724 R12405 


Hs.1 12423 


Homo sapiens mRNA; cDMA DKFZp586 


6.6 


124773 R40923 


Hs.106604 


ESTs 


4.9 


124777 R41933 




ESTs 


72 


124792 R44357 


Hs.48712 


ESTs; Weakly similar to cDNA EST EMB 


8.6 


124857 R63652 


Hs.137190 


ESTs 


4.9 


124911 R88992 


Hs.180612 


ESTs 


4.7 


124955 T10598 


Ha.324841 


ESTs; Weakly similar to Hi! ALU SUBFA 


4.4 


124958 T11134 


Hs.431 


murine leukemia viral (bmM) oncogene h 


12.6 


125038 T78089 


Hs.270134 


ESTs 


4.1 


125092 T92544 


Hs.137548 


CD84 antigen (leukocyte antigen) 


14.8 


125132 W15495 


Hs.129781 


chromosome 21 open reading frame 5 


6.7 


125144 W37999 


Hs.24336 


ESTs 


4.8 


.125154 W38419 




ESTs 


5.3 


125243 W86423 


Hs.105413 


ESTs 


6.6 


125279 W93640 


Hs.4779 


ESTs; Moderately similar to similar to AD 


5,8 


125299 Z39436 


Hs.102720 


ESTs 


12.2 


125303 239821 


Hs.288193 


ESTs 


10.2 


125304 Z39833 


Hs.124940 


GTP-binding proteii 


6.8 


125474 AA151216 Hs.75103 


tyrosine 3-monooxygenase/tryptaphan 5-m 


8 


125509 AA044232 Hs.288967 


ESTs 


5.4 


125580 AA126504 Hs.267812 


sorting nexin4 


4.1 


125582 AA507383 Hs.74649 


cytochrome c oxidase subunit Vic 


11.5 


.125670 AJ432621 


Hs.82685 


CD47 antigen (Rh-related antigen; integri 


4 


125698 AA748483 Hs.191356 


general transcription factor IIH; pofypepti 


9.4 


125745 AI283493 


Hs.75722 


ribophorin II 


6.2 


125852 H09290 


Hs.76550 


Homo sapiens mRNA; cDNA DKF2p564 


25.9 


125972 AA434562 Hs.35406 


ESTs 


4.1 


126160 N90960 


Hs.265398 


ESTs; Weakly similar to transformatioiM 


16.4 


126257 N99638 


Hs.124084 


tumor necrosis factor receptor superfamliy 


9.5 


126337 AI066486 


Hs.40500 


similar to S, cerevislae RER1 


5.6 


126405 U46278 


Hs.122489 


ESTs 


7.5 


126537 W40262 


Hs.146310 


ESTs; Weakly similar to putative p150 (H 


4.1 


126590 W78968 


Hs.181307 


H3histone; family 3A 


4.5 


126712 AA205862 Hs.7942 


ESTs 


5.2 


126721 T72569 


Hs.125359 


Thy-1 cell surface antigen 


4.4 


126764 AI334393 


Hs.102178 


ESTs 


4.6 


126804 AI203334 


Hs.160628 


ESTs 


11.7 


126819 AA305536 


Hs.279607 


ESTs 


4 


126877 AI052047 


Hs.26102 


ESTs 


7 


126991 R31652 


Hs.821 


biglycan 


5.6 


127479 AA513722 Hs.179729 


collagen; type X; alpha 1 (Schmld metaph 


m 


127514 AA826926 Hs.204214 


ESTs 


4.5 


127663 W07286 


Hs.10340 


ESTs; Weakly similar to weak similarity t 


5.1 


127677 AA916752 Hs.264190 


ESTs; Highly similar to MEM3 (M.muscu 


17.3 


127814 AA761755 HS.136713 


ESTs; Weakly similarto V4-1 [Usapiens 


4.1 


127997 A1281549 


Hs.311054 


ESTs 


5.5 


128092 AA904617 Hs.166229 


ESTs 


5.8 


128218 H02682 


Hs.292154 


ESTs; Moderately similarto recomblnatio 


5.8 


128466 D59653 


Hs.241471 


EST 


7.4 


128482 U83908 


Hs.296251 


programmed cell death 4 


5.8 


128517 AA280617 Hs.100861 


ESTs; Weakly similarto p60 katanin [H.s 


8.3 


128530 AA504343 Hs.183475 


Homo sapiens done 25061 mRNA sequen 


6.6 


128559 AA226801 Hs.101448 


metastasis associated 1 


5.2 


128574 AA412048 


Hs.38260 


keratin 8 


5.1 


128595 U31875 


Hs.152677 


short-chain alcohol dehydrogenase family 


27.1 


128610 L38608 


Hs.10247 


activated leucocyte ceil adhesion molecule 


13.2 


128629 AA399187 Hs.102708 


DKFZP434A043protem 


6.7 


128649 AA142853 Hs.103106 


Homo sapiens mRNA for G7b protein {G 


4.5 
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128651 AA446990 Hs.103135 ESTs 6.1 

128653 R48943 Hs.10315 solute carrier tamDy 7 {catloalc amino ad 4.4 

128656 AA458542 Hs. 10326 coatomer protein complex; subunit epsiton 143 

128717 T30617 Hs.104222 Homo sapiens mRNA; cONA DKFZp566 245 

5 128727 M64174 Hs.50651 Janus kinase 1 (a protBln tyrosine kinase) 73 

128764 N49308 Hs.104938 ESTs; WeaWy simBar to alpha 1(XVIII) c 9.2 

128781 X85372 Hs. 105465 small nuclear ribonudeoproteln polypepti 5.4 

128793 W93562 Hs.105749 KIAA0553 protein 4.6 

128835 W15528 Hs.106390 Homo sapiens mRNA; cDNA DKFZp586 4 

10 128845 AA455658 Hs. 10649 basement membrane-Induced gene 6.9 

128871 AA400271 Hs. 106778 Homo sapiens mRNA for putative Ca2+4 4.5 

128922 AA252023 Hs.9589 ESTs; WeaMy similar to HRIHFB2157 [H 6.4 

128925 061676 Hs.21851 Homo sapiens mRNA; cONA DKFZp586 6.4 

128938 AA410325 Hs. 107260 ESTs 7 

15 128946 N29353 Hs. 107318 kynurenine 3-monooxygenase (kynurenln 5.2 

128948 AA485655 Hs.223025 proteasome (prosome; macropaln} subunit 13.1 

128955 F10290 Hs.185807 Homo sapiens done 24758 mRNA sequen 5.8 

129005 AA460049 Hs.13323 ESTs; Weakly similar to SODIUM- AND 12.6 

129009 AA131421 Hs.75607 ESTs 9.8 

20 129017 H13108 Hs.107968 ESTs 13.9 

129057 X62466 Hs.276770 CDW52 antigen (CAMPATH-1 antigen) 10.7 

129075 AA129465 Hs.83765 ESTs 4.7 . 

129095 L12350 Hs.108623 thrombospondin 2 4.4 

129124 AA234530 Hs.108802 N^thylmaleimide-sensitive factor 20.7 

25 129160 AA131252 Hs.109007 ESTs 5.9 

129164 AA282183 Hs.109045 ESTs 5.8 

129180 R40556 Hs.318401 ESTs; Highly similar to HSPC039 protein 7.6 

129224 X89109 Hs.109606 coronin; acfln-binding protein; 1A 12 

129229 AA211941 Hs.109643 polyadenylate binding proteirHnteracting 7.9 

30 129240 W24360 Hs.237868 InterteuMn 7 receptor 5.3 

129241 AA435665 Hs.109706 ESTs; Moderately similar to HN1 [Minus 8.4 

129243 H88033 Hs.109727 K1AA0733 protein 7.8 

129247 AA151574 Hs.109733 piliiv^tRanscnption factor 6.4 

129259 AA090695 Hs.181385 ESTs 6.2 

35 129270 Z35227 Hs.109918 ras homotog gene family; member H 5.4 

129281 AA026318 Hs.289101 glucose regulated protein; 58kD 4.4 

129300 C20976 Hs.1 10165 ESTs; Highly similar to rtbosomal protein 5.7 

129318 N93155 Ks.285976 calmodulin 1 (phosphorylase kinase; delta 7.7 

129319 AA037467 Hs.30340 ESTs 6 
40 129351 AA167268 Hs.62349 Human ras Inhibitor mRNA; 3' end 9.3 

129366 H18027 Hs.1 84697 plexlnd 18.2 

129383 W92984 Hs.288224 ESTs 5.9 

129388 AA151621 Hs.1 10964 ESTs 4.1 

129391 T80814 Hs.11101 discs; large (DrosophBa) homoiog 3 (neur 105 

45 129404 AA172056 Hs.317584 ESTs 5.3 

129406 N23707 Hs.1 11 138 K1AA0712 gene product 4 

129426 AA412087 Hs.111323 EST; Highly similar to proteai inhibitor o 8 

129453 AA421213 Hs.111632 Lsm3 protein 5.5 

129513 C00225 Hs.306163 ESTs; Weakly similar to fos39554 J [H.s 5.5 

50 129519 AA298786 Hs.1 12242 ESTs 6,8 

129606 R21443 Hs.166254 heatshock90kD protein 1; alpha 5 

129622 AA278243 Hs.323949 ESTs 6.8 

129626 AA447410 Hs.111334 ESTs; Weakly simOar to ilii ALU SUBFA 5.1 

129627 AA258308 Hs.71968 Homo sapiens mRNA; cDNA DKFZp564 5.3 
55 129628 U26727 Hs.1 174 cyc&Miependent kinase Inhibitor 2A (mel 8.2 

129642 R50008 Hs.1 1806 7-dehydrocholesterol reductase 4.3 

129663 AA442768 Hs.1 1866 translocase of Inner niftochondrial membr 4.4 

129665 M86458 Hs.118776 KDEL (Lys-Asp-Glu-Leu) endoplasmic re 4 

129691 X06700 Hs.119571 collagen; type III; alpha 1 (Ehlers-Danios 6 

60 129783 AA454618 Hs.12479 associated molecule with the SH3 domain 6.4 

129800 AA252436 Hs.1 2540 lysophosphoBpase I 7.7 

129836 AA452161 Hs.206521 YME1 (S.cerevblae)-nke 1 5 

129850 N20593 Hs.288932 GDP dissociation inhibitor 2 6.9 

129869 AA102520 Hs.13015 ESTs; Weakly similar to heat shock prote 5 

65 129896 AA043021 Hs.13225 UDP-GaU)etaGlcNAcbeta1;4-galactosy 6.6 

129982 M87789 immunogbbulln gamma 3 (Gm marker) 4 

129985 AA450045 Hs.1 4 0452 cargo setecfion protein {mannose 6 phosp 5.8 
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Hs.1504 
Hs.21635 



Hs.8768 
Hs.155168 
Hs.166071 
Hs.155291 



Hs.241392 
Hs.155485 
Hs.178710 
Hs.155637 
Hs.155956 
Hs.180446 



130029 AA236412 Hs.236510 
130033 M90696 Hs.181301 
130036 AA195260 Hs.125849 
130069 AA055896 Hs.146428 
5 130077 T24055 Hs.91379 
130080 X14850 Hs.147097 
130096 AA223874 Hs.197955 
130114 AA234717 Hs.14992 
130125 M36803 

10 130135 M61764 

130170 AA610070 Hs.151469 
130189 D43947 Hs,151761 
130208 AA620556 Hs.15250 
130211 D50840 Hs.23703 

15 130235 X14046 Hs.153053 
130276 S75295 Hs.169149 
130280 L13738 Hs.153937 

130313 AA620323 Hs.154320 

130314 D86967 Hs.154332 
20 130328 M135673 Hs.154668 

130356 X84373 Hs.155017 

130367 Z38501 

130378 T47333 

130384 X66364 
25 130393 013630 

130399 AA449417 Hs.155356 

130407 N29888 Hs.155410 

130414 M21121 

130417 U58522 
30 130421 021260 

130441 U35835 

130455 X17059 

130498 138951 

130499 M416723 Hs.158286 
35 130511 L32137 Hs.1584 

130553 M430032 Hs.252587 
130558 K96654 Hs.15984 
130568 AA232535 Hs.16085 
130583 W24957 Hs.293907 

40 130585 H66211 Ha.16331 
130604 X03635 Hs.1657 
130614 AA132007 Hs.16697 
130619 AA477739 Hs.12532 
130622 AA235247 Hs.16846 

45 1305625 FQ3969 Hs.260720 
130627 L23808 
130629 M60346 
130635 M87503 
130639 059711 

50 130677 H17861 
130681 D82808 
130693 AA487202 Hs.17962 
130703 N63295 Hs.18103 
130706 AA488843 Hs.201673 

55 130712 AA292066 Hs.279762 

130714 X92896 Hs.18212 

130715 T98227 Hs.171952 
130744 AA203527 Hs.18747 
130747 AA471293 Hs.6879 

60 130751 AA435633 Hs.18879 
130796 R39390 Hs.19525 
130800 AA223386 Hs.19574 
130855 AA425439 Hs.143323 
130859 AA287327 Hs.20478 

65 130866 M58028 Hs^055 
130880 014678 Hs.20830 
130891 031891 Hs.20991 



Hs.1695 

Hs.1697 

Hs.1706 

Hs.17132 

Hs.17767 

Hs.17820 



ESTs; Moderately simflarto PFT27 (M.m 
cathepsin S 

ESTs; Moderately similar to tlli ALU SU 

coiiagen; type V; afpha 1 

ribosoma) protein L26 

H2A hlstone family; member X 

WAA0704 protein 

ESTs 

hemopexin 
tubulin; gamma 1 

calcium/calmodulin-dependent serine pro 
KIAA0100 gene product 
peroxisomal D3.t)2-enoyl-CoA isomerase 
U DP -glucose ceramide glucosyftransferas 
C037 antigen 

karyopnerin alpha 1 (importin alpha 5) 
activated p21cdc42Hs kinase 
ubiquitin^ctivating enzyme E1C (homolo 
KIAA0212 gene product 
WAA0391 gene product 
nuclear receptor interacting protein 1 
ESTs; Weakly similar to (III ALU SUBFA 
TATA box binding protein fTBP^associa 



KIAA0005gene product 
Homo sapiens mRNA for putative glucosy 
ESTs 

small inducible cytokine A5 (RANTES) 
huntngHnnnteracting protein 2 
clathrin; heavy polypeptide-like 2 
protein kinase; ONA-acttvated; catalytic p 
N-acetyltransferase 1 (aryiamine Nicety) 
karyopnerin (importin) beta 1 
Homo sapiens mRNA for KIAA0446 prot 
cartilage oSgomeric matrix protein (pseud 
pituitary tomortransformlng 1 
ESTs; Weakly similar to gene pp21 protei 
ESTs; HigMy similar to CGM3 protein |H 
ESTs; Moderately similar to similar to C.e 
ESTs 

estrogen receptor 1 

ESTs 

ESTs 

ESTs; Weakly similar to cytochrome P45 
matrix metaHoproteinase 2 (getatinase A; 



ATPase; H+ transporting; lysosomal (vacu 
interferon-stimulated transcription factor 
ESTs 
ESTs 

Rhc-associated; coHed-coi containing pro 

ESTs 

ESTs 

comichon-llke 
adenylate cyclase 7 

ONA segment on chromosome X (unique) 
occludin 

POP7 (processing of precursor; & cere vis 
ESTs 

Homo sapiens clone 23965 mRNA sequen 
ESTs 

ESTs; Weakly similar to katanin p80 subu 
putative DrM/chromatin binding motif 
ceroid-lipofuscinosis; neuronal 2; late infa 
ublquitin-activating enzyme E1 (A1S9T a 
klnesin-Cke 2 
SET domain; bifurcated; 1 

231 



5.6 
5.4 
7.4 
7.6 
4 

^z^ 

5 

7.8 
72 
5.6 
7.5 
6.4 
6.4 
4.5 
9.1 
8.6 
5 

6,1 

10 

6.1 

10.6 

8.3 

7.1 

5.6 

4,1 

4.6 

7 

4.1 
7.9 
4 

6.8 

26.4 

4.8 

6.1 

8.3 

7.5 

5.6 

4 

13.3 

10.1 

39.9 

5.1 

5.9 

4.1 

8.3 

10.3 

7 

5.5 
7.2 
13.5 
6 

6.1 
4.3 
4 

5.1 
8.4 
5.7 
6.2 
8.2 
8.3 
4.5 
7.7 
4.3 
9.8 
4.3 
4.5 
4 
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130905 AA056489 Hs.129998 ESTs 8.7 

130913 W03592 Hs.21198 traoslocase of outer mitochondrial membr 203 

130919 AA291710 Hs.21276 collagen; type IV; alpha 3 (Goodpasture a 9 

130921 AA074S96 Hs. 194 688 bromodomaln adjacent to zinc finger dom 5.3 

S 130944 M97935 Hs.21486 signal transducer and activator of transcrip 18.8 

130974 X57985 Hs.2178 H2B Wstone famHy; member Q 13.4 

130987 R45698 Hs.21893 ESTs; Weakly similar to cAMP inducible 8.5 

130999 N48963 Hs^1992 WAAQ689 protein 72 

131010 AA435748 Hs.169341 ESTs; WeaWy similar to phosphafidic acid 5.2 

10 131046 X02530 Hs^248 smaQ inducible cytokine subfamily 8 (Cy 10.1 

131091 T35341 Hs.22880 ESTs; Highly similar to dipeptidyt pepfid 6.3 

131153 H11760 Hs.23606 ESTs 7.3 

131185 M25753 Hs.23960 cyclinBI 6.2 

131200 AA609427 Hs.293732 ESTs; Moderately similar to !!!! ALU SU 4.3 

15 131208 AA044078 Hs.24210 ESTs 5.5 

131210 AA430047 Hs.95549 ESTs 7.1 

131227 AA429472 Hs.236522 OKFZP434P106 protein 5.6 

131244 038076 Hs.24763 RAN binding protein 1 5.5 

131245 AA620599 Hs.24766 DKFZP564E1 962 protein 6.7 
20 131257 AA256042 Hs.24908 ESTs 5.8 

131319 U25997 Hs.25590 stanntocaicin 8.9 

131339 'AA463450 H&25812 Nflmegen breakage syndrome 1 (ntorin) 6.5 

131388 R34531 Hs.92200 KIAA0480 gene product 9.2 

131410 H84658 Hs.279836 ESTs 12.1 

25 131472 AA608962 Hs.27258 calcycDn binding protein 18.1 

131475 Z39053 Hs.27263 ESTs 7.5 

131501 AA121127 Hs.8207 H3 histoid family 3A 5.5 

131514 X02152 Hs.2795 lactate dehydrogenase A 5.1 

131524 N39152 Hs.301804 ESTs 4.3 

30 131528 060856 Hs.28309 UDP-glucose dehydrogenase 6.4 

131544 N33236 Hs.28555 ESTs; Weakly similar to 60511.8 [C.eleg 5.6 

131557 D30946 Hs^8707 signal sequence receptor; gamma (transbc 8.7 

131562 U90551 Hs28777 H2A histone family; member L 18.8 

131564 AA491465 Hs.28792 ESTs 11.8 

3 5 131586 AA235385 Hs.26966 ESTs; Moderately similar to alternatively 4.7 

131587 M15182 Hs.183868 glucuronidase: beta 52 

131589 U52100 Hs.29191 epiMal membrane protein 2 4.4 

131615 D14533 Hs.192803 xeroderma pigmentosum; complementatio 4.6 

131664 AA136126 Hs.30327 mitogen-activated protein kinase-activated 4.3 

40 131679 AA136660 Hs.30579 ESTs 9.4 

131684 U26174 Hs.3066 granzyme K (serine protease; granzyme 3 9.7 

131687 L11066 Hs.3069 heat shock 70kD protein 9B {mortaRn-2) 6.2 

131689 AA599653 Hs.30S96 transcription factor-like 5 (basic helix-loo 8.3 

131693 W60913 Hs.110796 ESTs;WeaWy similar to cDNA EST yk45 9 

45 131710 AA233225 Hs.30985 MRS1 protein 5.2 

131716 D49738 Hs.31053 cytoskeleton-assoctated protein 1 6.6 

131742 031352 Hs^1433 ESTs 11 

131762 H46831 Hs.107767 ESTs; Moderately similar to CaM-KJl inn 4.9 

131781 AA460450 Hs^1989 DKFZP586G1722 protein 9.2 

50 131795 N32724 Hs.32317 Sox-like transcrip6ona! factor 4.5 

131809 L76517 Hs.3260 presenffin 1 (Alzheimer disease 3 J 5.4 

131814 AA437226 Hs.157 fnterieukin 10 receptor, alpha 4 

131838 AA091932 Hs.180628 dynamin-l&e protein 6.7 

131877 J04088 Hs.156346 toposomerase (DNA) II alpha (170kD) 5 

55 131885 AA044095 Hs.3402 ESTs 11.1 

131891 AA158258 Hs.30376 heterogeneous nuclear protein similar tor 5.6 

131925 AA248470 Hs.183180 ESTs; Weakly similar to RING finger pro 4.5 

131930 AA205460 Hs.69476 ESTs 14.3 

131941 D62657 Hs.35086 ublquitin-spedfic protease 1 6.2 

60 131965 W90146 Hs.35962 ESTs 6.3 

131970 086960 Hs.3610 KIAA0205 gene product 4.2 

131971 R70167 Hs.154938 ESTs 4.3 
131974 AA410424 Hs368122 Homo sapiens mRNA; cONA OKFZp586 4.6 
131977 F09788 Hs3622 procollagen-proline; 2-oxogWarate 4-dtox 6.4 

65 131994 AA479515 Hs.279882 Human DNA sequence from done 703H1 12 

131997 D82399 Hs.136644 Homo sapiens done 23714 mRNA sequen 10 

132017 W67251 Hs.267659 Homo sapiens vav 3 oncogene (VAV3)m 4.7 
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132021 T68246 Hs.306079 chaperonin containing TCP1;subunlt 5 (e 5.2 

132065 D82226 Hs.211594 proteasome (prosome; macropain) 26S sub 8.5 

132085 D44466 Hs.3887 proteasome (prosome; macropain) 26S sub 135 

132089 AA131971 Hs.39122 ESTs 4.8 

5 132109 AA599801 Hs.40098 ESTs 6.2 

132143 AA257056 Hs.7972 WAA0871 protein 14.6 

132149 T10822 Hs.324743 ESTs 5.3 

132153 N90141 Hs.41066 ESTs; Moderately similar to ELONGATI 9.2 

132160 AA281770 Hs.295923 seven in absentia (DrosophUa) homolog 1 5.5 

10 132164 U84573 Hs.41270 procoflagen-lyshe; 2-oxogiutarate 5-dioxy 8.1 

132180 AA405569 Hs.416 fibroblast activation protein; alpha; sepras 15.4 

132183 L19183 Hs.199695 hypothetical protein 122 

132225 AA128980 ESTs 5.6 

132227 AA412620 Hs.4248 ESTs 67 

15 132235 F09058 Hs.42656 ESTs 6.2 

132256 AA608856 Hs.431 murine leukemia viral (bmi-1) oncogene h 6 

132298 N41849 Hs.7120 Homo sapiens cytokine receptor related p 5.6 

132314 AA285290 Hs.44499 small EDRK-rich factor 2 6.8 

132325 N37065 Hs.44856 ESTs 4.7 

20 132384 AA479933 Hs.46967 Human ONA sequence from done 167A1 4.2 

132387 R70914 Hs.281434 heat shock 70kD protein 1 9.1 

132393 W85888 Hs.47334 . ESTs; Moderately similar to Hll ALU SU 4 

132406 F09979 Hs.4774 ESTs 15 

132407 AA431459 Hs.47783 ESTs 8 
25 132413 AA132969 Hs.260116 K1AA1 104 protein 4 

132446 AA426218 Hs.46764 ESTs 5.3 

132465 AA047896 Hs.49169 ESTs 15.4 

132482 AA429478 Hs.238126 ESTs; Highly similar to CGI-49 protein [H 9 

132492 T03749 Hs.4990 KIAA1089 protein 8.5 

30 132528 AA283006 Hs.50758 cr^rnosorna-associated polypeptide C 4.3 

132540 AA488987 Hs.5097 synaptogyrin 2 9.8 

132543 AA417152 Hs.5101 protein regulator of cytokinesis 1 10.1 

132580 137042 Hs.283738 casein kinase 1; alpha 1 5.9 

132586 AA412452 Hs.52515 DKFZP434N024 protein 4.2 

35 132608 AA199588 Hs.5321 ARP3 (acrjrwelated protein 3; yeast) horn 4.2 

132616 AA386264 Hs.283558 isocitrate dehydrogenase 2 (NADP+); mit 5.2 

132617 AA171913 Hs.5338 carbonic anhydrase XII 10.1 
'132618 AA253330 Hs.279916 adaptor-related protein complex 1; gamma 4.8 
132640 U33821 Hs.5437 Taxi (human T-cell leukemia virus type i 5.7 

40 132668 AA453614 Hs.5460 KIAA0776 protein 4.4 

132694 M6Q830 Hs.5509 ecotroplc viral integration site 2B 15.6 

132700 N47109 Hs£521 ESfs 7 

132724 AA417962 Hs.55498 geranytgeranyl diphosphate synthase 1 5.6 

132738 W42674 Hs.264636 ESTs; Moderately similar to neuronal thre 4.9 

45 132742 AA490862 Hs.292812 ESTs; Weakly similar to C43H8.1fC.eleg 7.9 

132744 X54326 Hs.55921 glutamyHworyl-tRNA synthetase 4.1 

132795 H99152 Hs.57079 ESTs 8 

132807 AA331777 Hs.57301 mutL (E. coO) homotog 1 (colon cancer; n 8 

132811 U25435 Hs.57419 transcriptional repressor 4 

50 132817 AB004884 Hs.57553 tousleoMike kinase 2 6.5 

132840 N23817 Hs.5807 Homo sapiens done 23675 mRNAsequen 5.6 

132845 D62588 Hs.5813 ESTs 12.4 

132847 T48195 Hs.58189 eukaryofjc translaffon initiation factor 3; s 7 

132856 W79865 Hs.58367 g!ypican4 6.2 

55 132869 N26855 Hs.203961 ESTs 6.5 

132874 AA425776 Hs.58609 ESTs 5.6 

132880 AA444369 Hs.177537 ESTs 7.2 

132894 082422 Hs.5944 ESTs 7.5 

132900 N56451 Hs.5978 UM domain only 7 4.4 

60 132903 AA2354Q4 Hs.5985 Homo sapiens clone 25186 mRNAsequen 9.1 

132904 X83618 Hs-59889 34iydroxy-3^thylgUjtaryMk»enzymeA 10.7 

132906 AA142857 Hs.234896 ESTs; Highly similar to gemintn (H.saple 10.2 

132914 AA496037 Hs.60293 ESTs 4.7 

132918 AA252605 Hs.6051 K1AA0616 protein 7.1 

65 132936 AB002305 Hs.6111 K1AA0307 gene product 8,3 

132951 U04209 Hs.61418 microfibriBar-associated protein 1 4.3 

132957 AA234791 Hs.61469 Human gene from PAC 753P9; chromoso 13.2 
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132959 AA028103 Hs.61472 ESTs; Weakly sknUar to unknown (S.cere 

132968 N77151 Hs.61638 myosin X 

132984 H80409 Hs.62112 zinc linger protein 207 

132990 AA458761 Hs.18387 transcription factor AP-2 alpha (activating 

5 132994 AA505133 Hs.279905 solute carrier family 2 (facilitated glucose 

132998 Y00062 Hs.170121 protehtyroshe phosphatase; receptor ryp 

133002 AF006082 Hs.42915 ARP2 (actfrwelated protein 2; yeasQ hom 

133005 C21400 Hs.276605 KIAA0970 protein 

133015 AA047036 Hs.246315 ESTs 

10 133016 W81298 Hs.6269 growtti factor receptor-bound protein 2 

133039 X62055 Hs.63489 protein tyrosine phosphatase; non-recepto 

133050 S67325 Hs.63788 proptonyl Coenzyme A carboxylase; beta 

133056 AA071387 Hs.6396 jumping translocation breakpoint 

133062 R33663 Hs.64056 ESTs 

15 133083 N70633 Hs.6456 chaperonin containing TCP1; subunit 2 (b 

133091 M122147 Hs.64691 KIAA0483 protein 

133093 AA598749 Hs.285996 ESTs 

133124 AA156049 Hs.267923 ESTs 

133126 D16469 Hs.6551 ATPase; H+ transporting; rysosornal (vacu 

20 133196 R37367 Hs.6727 Ras-GTPase activating protein SH3 doma 

133214 Y10659 Hs.285115 Interteukln 13 recepton alpha 1 

133225 241415 Hs.6823 ESTs; Weakly similar to intrinsic fador-8 

133228 N90029 Hs.6831 Homo sapiens done 1400 unknown prate 

133239 AA059405 Hs.1 79882 Homo sapiens done 24655 mRNA sequen 

25 133240 D31161 Hs.242894 ESTs 

133257 AF006086 Hs.6895 actin related protein 2/3 complex; subunit 

133264 W72187 Hs.69192 ESTs; WeaWy similar b cONA EST yk37 

133274 AA488886 Hs.6949 ESTs 

133281 AA421079 Hs.69594 ESTs; Weakly similar to Sox-tike transcri 

30 133283 AA410507 Hs.6968 ESTs 

133287 L15702 Hs.69771 Breton properdin 

133294 R79723 Hs.69997 zinc finger protein 238 

133297 AA600057 Hs.70266 WAA0905 proteto 

133318 AA256168 Hs.152316 ESTs 

35 133362 H06195 Hs.7194 ESTs; Highly similar to CGI-59 protein [H 

133370 AA156897 Hs.72157 DKFZP564I1922 protein 

133391 X57579 Hs.727 inhibin; beta A (acfMn A; actMn AB alp 

133395 AA491296 Hs.72805 ESTs 

133422 N79516 Hs.73287 ESTs; Weakly similar to eyelid [D.melano 

40 133431 AA255438 Hs.7358 Homo sapiens mRNA; cDNA DKFZp566 

133435 T23983 Hs.323966 ESTs 

133449 AA094989 Hs.7381 voltage-dependent anion channel 3 

133468 X03068 Hs.73931 major hrstocompatibifity complex; class il 

133484 X78710 Hs.211581 metakegulatory transcription factor 1 

45 133506 AA316868 Hs.74346 ESTs; Weakly similar to 140G11.hp.me 

133517 X52947 Hs.74471 gap Junction protein; alpha 1;43kO (conn 

133551 D63480 Hs.278634 WAA0146 protein 

133569 AA313977 Hs.172772 transcription elongation factor B (SHI}; po 

133572 W94333 Hs.279915 translocase of inner mitochondrial membr 

50 133577 F03717 Hs.75063 human immunodeftdency virus type I enh 

133589 L37368 Hs.75104 RNA-binding protein S1; serine-rich dom 

133608 013315 Hs.75207 glyoxalasel 

133617 AA148318 Hs.75249 K1AA0069 protein 

133627 U09587 Hs.75280 glycyURNA synthetase 

55 133633 D21262 Hs.75337 nudeo1arphosphoproteBip130 

133634 U24166 Hs.234279 mlcrotubule-assodated protein; RP/EB fa 

133640 083004 Hs.75355 ubiqulfin-conjugating enzyme E2N (homo 

133644 089077 Hs.75367 Src^adapter 

133649 AA479139 Hs75393 add phosphatase 1; soluble 

60 133652 AA287383 Hs.7540 ESTs 

133674 AA458946 Hs.75497 ESTs 

133700 K01396 Hs397681 protease inhibitor 1 (an^elastase); aipha- 

133705 N21648 Hs.75659 MpV17 transgene; murine tornolog; giom 

133716 Y00282 Hs.75722 ribophortnll 

65 133720 127841 Hs.75737 pertcontrtolarmateriaJI 

133752 U49278 Hs.75875 ubiquifin-conjugaflng enzyme E2 variant 

133765 021255 Hs.75929 cadherin 1 1 (OB^adhsrin; osteoblast) 
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133772 W73693 Hs.76038 IsopentenyWiphosphatedellalsbmenase 7.9 
133774 Z23090 Hs.76067 heat shock 27kD protein 1 4.1 
133776 J03473 Hs.177766 ADP-ribosyltransferase(NAD^pory{AD 13 
133784 AA214305 Hs.301064 ESTs 5.2 
5 133814 M33882 Hs.76391 myxovlrus (influenza) resistance 1;homol 11.7 
133829 M453783 Hs.76550 Homo sapiens mRNA; cONA DKFZp564 9.4 
133834 AA147510 Hs.28866Q serine protease; umbilical endotheiium 4.8 
133839 M59815 Hs.170250 complement component 4A 6.7 
133842 U73477 Hs.285013 putative human HU class li associated p 7.1 
10 133845 T68510 Hs,76704 ESTs 6.3 
133859 U86782 Hs.178761 26S proteasome-assocfated pad1 homobg 13.7 

133867 043948 Hs.76989 WAA0097 gene product 4.1 

133868 U58090 Hs.183874 cuIiin4A 4 
133871 AA454597 Hs.182793 ESTs 4.7 

15 133893 X01060 Hs.77356 transferrin receptor (p90;CD71) 8.3 
133914 N32811 Hs.77542 ESTs 5 
133918 W72783 Hs.58382 ESTs; Weakly similar to C13F10.5 [Cele 4.5 
133944 AA045870 Hs.7780 Homo sapiens mRNA; cONA DKFZp564 6.3 

133946 AA156565 Hs.173878 4-nftrophentfphosptatase domain and non 6.4 

20 133963 134587 Hs.184693 transcription elongation factor B (Sill); po 6.3 

133980 D00760 Hs.250811 proteasome (prosome; macropaln) subunit 11.9 

133990 C02374 Hs.7822 Homo sapiens mRNA; cDNA OKFZp564 8.2 

133999 M28213 Hs.78305 RAB2; member RAS oncogene family 5.2 

134030 J03077 Hs.78575 prosaposin (variant Gaucher disease and v 4.6 

25 134032 Z81326 Hs.78589 protease inhibitor 12 (neuroserpin) 6.5 

134045 S82470 Hs.78768 681 11.9 

134046 D28473 Hs.172801 isoleticine-tRNA synthetase 52 
134064 087685 Hs.78893 K1AA0244 protein 7.3 
134070 H98621 Hs.78946 cullin 3 4.7 

30 134087 U51166 Hs.173824 thymine-DMA glycosylase 7 

134090 M22382 Hs.79037 heat shock 60kD protein 1 (chaperonin) 4.5 

134098 X05323 Hs.79086 ribosomal protein; mitochondrial; 13 9.4 

134110 U41060 Hs.79136 UV-1 protein; estrogen regulated 4.4 

134132 U32519 Hs.220689 Ras-GTPase-activating protein SH34oma 6.6 

35 134168 AA398908 Hs.181634 Human Chromosome 16 BAC clone CIT9 8.6 

134170 M63138 Hs.79572 cathepsin D (lysosomal aspartyl protease) 9.3 

134208 U88871 Hs.79993 peroxisomal biogenesis factor 7 6.3 

134258 L28010 Hs,808 heterogeneous nuclear ribonucteoprotein F 4.3 

134288 AA430008 Hs.8117 ESTs 6.9 

40 134310 AA313414 Hs.8148 Homo sapiens clone 24856 mRNA sequen 7.4 

134326 U16306 Hs.81800 chondroifin sulfate proteoglycan 2 (versic 6.1 

134329 D38551 Hs.81848 ' RAD21 (S. pombe) homoiog 8.6 

134331 AA452020 Hs.111222 ESTs; Weakly similar to CG1-128 protein 6.1 

134351 R82074 Hs.82109 syndecanl 4.4 

45 134357 L43575 Hs.82171 Human clone 191B7 placenta expressed m 6.6 

134363 M37033 Hs.82212 CD53 antigen 5.3 

134367 X54199 Hs.82285 phosphoribosylglydnainkleformyltransfe 4.8 

134374 D62633 Hs.8236 ESTs 152 

134375 AA412720 Hs.82389 ESTs; Highly similar to CGM18 protein 12 
50 134376 X02874 Hs.82396 2 , ;5 , -ofigoadenylate synthetase 1 6.4 

134381 U56637 Hs.184270 capping protein (actin filament) muscle Z- 4 

134388 -M15841 Hs.82575 small nuclear ribonucteoprotein polypepti 57 

134395 L09717 Hs.8262 rysosomal-associated membrane protein 2 6.9 

134399 H99801 Hs.82689 tumor rejection antigen (gp96) 1 4.5 

55 134401 AA243746 Hs.211577 Wnedn 1 (Wnesto receptor) 112 

134405 J04177 Hs.82772 collagen; type XI; alpha 1 15.3 

134415 AA329274 Hs.82911 protein tyrosine phosphatase type IVA; m 4.1 

134417 087969 Hs.82921 solute carrier family 35 (CMP-sialic acid t 4.2 

134419 L08044 Hs.82961 trefoil factor 3 (intesfinaQ 5.9 

60 134421 AA122386 Hs.82985 collagen; type V; alpha 2 58 

134423 W96151 Hs.83006 ESTs; Highly similar to CGM39 protein 44 

134438 AA449984 Hs.246857 ESTs; Highly similar to prote&ie kinase JN 7 

134446 T25732 Hs.83419 KIAA0252 protein 46 

134453 X70683 Hs.83484 SRY (sex determining region Y)-box 4 5.1 

65 134470 X54942 Hs.83758 CX28 protein kinase 2 20.3 

134487 R38185 Hs.83954 Homo sapiens unknown mRNA 5 

134495 D63477 Hs.84087 WAA0143 protein 16.1 
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Hs.284226 

Hs.89603 

Hs.89718 

Hs.89751 

Hs.89981 

Hs.90061 

Hs.90280 

Ks.904 

Hs.90419 

Hs.9071 

Hs.92308 



134498 M63180 Hs.84131 

134506 U45328 Hs.84285 

134529 H24460 Hs.848 

134570 U66615 Hs.172280 

5 134582 AA234966 Hs.86041 

134600 R68884 Hs.86347 

134623 X74496 Hs.86978 

134654 W23625 Hs.8739 

134655 M454070 Hs. 123090 
10 134675 AA250745 Hs.87773 

134711 X04011 Hs.88974 

134714 U89922 Hs.890 
134722 W47183 
134776 J05582 

15 134806 Z49099 
134810 M27394 
134840 U51477 
134843 H60595 
134853 D82348 

20 134866 U84011 
134868 Z39762 
134885 N27670 
134982 N46066 

134969 AA236324 Hs.92381 

25 134992 H05625 Hs.5831 

134993 AA282343 Hs. 301005 

135010 D59675 Hs.92927 

135015 U54999 Hs.278338 
135029 AA224180 

30 135032 AA243497 Hs.173685 

135037 U77948 Hs.278589 

135059 AA598449 Hs.93832 

135071 L08069 Hs.94 

135083 AA495950 Hs.94262 

35 135117 W52493 Hs,94694 

135144 AA044842 Hs.95260 

135154 AA126433 Hs.267812 

135218 031157 Hs.324277 

135237 AA454930 Hs.9691 

40 135243 AA215333 Hs.97101 

135335 H20989 Hs.198281 

135349 083174 Hs.9930 

135367 AAW1Q9 Hs.9963 

135389 U05237 Hs.99872 

45 135400 M23263 
135411 L10333 

300019 mms 

300021 M97935 

300022 M97935 
50 300089 AI199738 

300107 AI694585 
300254 AW079607 Hs.188417 
300328 AW015860 Hs.224623 
300549 AA699328 Hs.298119 

55 300711 A1492179 Hs.166244 
300921 AW293224 Hs.232165 
301124 T79326 Hs.298262 
301165 N85789 Hs.150186 
301576 AI682905 Hs.270431 

60 301604 AA373124 Hs.24809 
301704 AA526313 Hs.293691 
301782 N99399 Hs.143046 
301884 AA312082 Hs. 105445 
301936 NM_004694Hs.114924 

65 302002 AF013956 Hs.5637 
302032 NM.001992HS.128087 
302067 H05698 Hs.222399 



Hs.99915 
Hs.99947 



Hs^Q8275 
Hs-270464 



threonyMRNA synthetase 


6.1 


ubiquitin-conJugaOng enzyme E2I (homo) 


4.6 


fKSQ^blntSng protein 4 (5?H>) 


6.2 


SWVSNF related; matrix associated; adin 


4.8 


COG triptet repeat binding protein 1 


4.7 


ESTs; Weakly similar to predicted using G 


5.8 


pToiyfendbpepSdase 


4.5 


ESTs; Weakly similar to ORF YGR200c ( 


13.7 


ESTs 


5.8 


protein kinase; cAMP-dependent; catalyti 


8.9 


cytochrome b-245; beta polypeptide (chro 


6.8 


lymphotoxin beta (TKf superfarnily; mem 


35.7 


ESTs; Weakly similar to neural F box pro 


8.1 


mucin 1 ; transmembrane 


6.2 


spermine synthase 
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7 


diacylgiycero) kinase; zeta (104kD) 


4.1 


progesterone binding protein 


4.7 


5^aminoimIaazoM<anboxamW 


10.2 


amylo-1;6-glucostdase; 4-atpha-giucanotr 


12.1 


WAA0882 protein 


6 


progesterone membrane binding protein 


5 


ESTs 


4.1 


Homo sapiens mRNA; chromosome 1 spe 


16.8 


ESTs 


4 


purine-rich element binding protein B 


4.4 


ESTs 


7 


LGN protein 


4.8 


ESTs; Moderately similar to 17-beta-hydr 


13.6 


Human DMA sequence from done 30M3 


4 


general transcription factor II; 1 


8 


Homo sapiens clone 24483 unknown mRN 


5.4 


heat shock protein; DNAJ-fike 2 


9.3 


ESTs 


6.7 


Homo sapiens done 24837 mRNA sequen 


102 


Homo sapiens mRNA; cONA OKFZp586 


6.6 


sorting nextn 4 


7.4 


ESTs; Weakly similar to growth factor-res 


6.2 


ESTs 


19.5 


putative G protein-coupled receptor 


8.8 


pyruvate kinase; musde 


12.4 


coiiagen-binding protein 2 (coDtgen 2) 


5.5 


TYRO protein tyrosine kinase binding pro 


5.4 


fetal Alzheimer antigen 


7.8 


androgen receptor (dtydrotestosterone re 


9.1 


reficutonl 


5.3 


AFFX control; STAT1 


8.3 


AFFXoontrofcSTATI 


7 


AFFX control: STAT1 


14 


ESTs; Weakly similar to 1111 ALU CLASS 


9.1 


P*?T«* WoaUu ctrtitlar in fill A 1 II Ci AC<5 

co is, weaiuy similar \o uti alu uuaoo 


7 A 


ESTs; WeaMy similar to ZnT-3 [H^apien 


30.1 


ESTs 


115 


ESTs 


5.5 


ESTs; Weakly similar to cONA EST yk40 


11 


ESTs 


11 


ESTs; Weakly similar to dJ88J8.1 [H^api 


8.8 


ESTs; Weakly similar to PTERJN-4-ALP 


6 


ESTs; Weakly similar to IIII ALU SUBFA 


. 4.7 


ESTs;WeakryslmiIartoC17G10.1 (Cele 


8 


ESTs 


4.2 


EST duster (not in UniGene) with exon h 


18 


GDNF family receptor alpha 1 


20.7 


EST cluster (not in UniGene} with exon h 


11.6 


chroma box homotog 4 (DrosophQa Pc da 


9.2 


EST cluster (not in UniGene) with exon h 


4.3 


ESTs; Weakly similar to protetn-tyrosine 


7.8 
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302145 NNL003613HS.151407 EST duster (not in UniGene) with exon h 15.1 
302236 A/128606 Hs.6557 zinc finger protein 161 25.6 
302276 NM.004448HS.323910 EST cluster (not In UniGene) with exon h 21.6 
302290 AL117607 Hs.175563 Homo sapiens mRNA;cONAOKFZp564 41.4 
5 302326 NMJJ04271HS.184018 EST duster (not in UniGene) with exon h 8.9 
302342 AB023141 Hs.190386 WAA0924 protein 5.4 
302372 AH 17406 Hs.200102 Homo sapiens rnRNA; cDNA DKFZp434 8.9 
302422 AB021227 Hs.3743 matrix metailoproteinase 24 (meinbrane-in 5.2 
302431 AF129530 Hs.226434 EST duster (not in UniGene) with exon h 5.3 

10 302501 AF022726 Hs.251446 EST cluster (not In UniGene) with exon h 9.9 

302505 AL049650 Hs.247874 multiple UniGene matches 4.3 

302533 L36149 Hs.248116 chemokine (C motif) XC receptor 1 4,9 

302638 AA463798 Hs.102696 ESTs; Weakly similar to C11D2.4[C.eieg 5.3 

302656 AW293005 Hs.70704 ESTs 84 

15 302792 AA343696 Hs.46821 ESTs; Weakly similar to putative [H.sapie 4,5 

302820 X04588 Hs.85844 EST duster (not in UniGene) with exon h 6.8 

302838 U66049 Hs.82171 EST duster (not in UniGene) with exon h 8.4 

302892 N58545 Hs.42346 hlstone deacetytase 3 228 

302977 AW263124 Hs.315111 EST duster (not In UniGene) with exon h 6,8 

20 302989 N46406 Hs.84700 EST duster (not in UniGene) with exon h 8.9 

303007 AA478876 Hs.317714 pallid (mouse) homolog; paiOdin 10.1 

303052 AF140242 Hs.279926 EST duster (not in UniGene) with exon h 24.4 

303131 AW081061 Hs.103180 actin-fike6 63 

303132 AI929819 Hs.4055 ESTs 17.7 
25 303153 U09759 Hs.246857 mttogen-activated protein kinase 9 11.4 

303387 AA908797 Hs.180799 ESTs 15,8 

303499 AI815990 Hs.293515 ESTs 7.2 

303502 AA488528 EST duster (not in UniGene) with exon h 5.3 

303576 T07216 Hs.301226 EST duster (not in UniGene) with exon h 16.2 

30 303620 AA397546 Hs.119151 ESTs 89 

303634 AI953377 Hs.28444 ESTs; Weakly similar to predicted using G 12 

303642 AW299459 Hs.111977 EST duster (not In UniGene) with exon h 4.2 

303654 AA436942 Hs.288529 ESTs 8,4 

303733 AW502498 Hs.15220 ESTs; Weakly similar to zinc finger prate 5.2 

35 303780 AW24014 Hs.18995 ESTs; Moderately similar to K1AA0456 p 28.4 

303792 C75094 Hs.199839 ESTs; Highly slmSar to NG22 [H.saptens 4.4 

303842 A1337304 Hs.126268 ESTs; Weakly similar to similar to PDZd 8.1 

303951 AW475081 Hs.172928 collagen; type 1; alpha 1 7.5 

304465 AA421948 EST singleton (not in UniGene) with exon 6.5 

40 304507 AA456426 EST 5.4 

304591 AA505702 EST singleton {not in UniGene) with exon 9.8 

304601 AA507875 EST singleton (not in UniGene) with exon 7.5 

304659 AA533185 EST singleton (not in UniGene) with exon 7 

305040 AA630582 Hs.169476 glyceraWehyde^-phosphate dehydrogena 12.4 

45 305134 AA653159 Hs.179661 EST singleton (not in UniGene) with exon 8.7 

305415 AA725116 Hs.78465 EST singleton (not In UniGene) with exon 5.3 

305453 AA738110 EST singleton (not in UniGene) with exon 4.1 

305898 AA872838 keratin 8 7.7 

305913 AA876109 EST singleton (not in UniGene) with exon 6.3 

50 305950 AA884479 EST singleton (not in UniGene) with exon 5.6 

306004 AA889992 Hs.2186 EST singleton (not in UniGene) with exon 13.2 

306009 AA894560 Hs.283370 EST singleton (not in UniGene) with exon 4.4 

306060 AA906161 Hs.76277 EST singleton (not in UniGene) with exon 4.6 

306398 AA970548 Hs.297681 EST singleton (not in UniGene) with exon 7.6 

55 306505 AA987722 Hs.17292B EST singleton (not in UniGene) with exon 19.7 

306576 AA995761 Hs.276092 EST singleton (not in UniGene) with exon 5.5 

307117 AJ184111 Hs.76067 heat shock 27kD protein 1 7.7 

307138 AU85516 Hs.172928 collagen; type i; alpha 1 88 

307187 AI190870 Hs.276417 EST singleton (not in UniGene) with exon 4.1 

60 307542 A1280859 Hs.62954 EST singleton (not in UniGene) with exon 6 

307554 A1281603 Hs.172928 EST singleton (not in UniGene) with exon 10.8 

307806 AI351739 Hs.276726 EST singleton (not in UniGene) with exon 4.7 

308079 AW72733 Hs.270208 ESTs 4 2 

308307 AI581398 Hs.172928 collagen; type 1; alpha 1 54 

65 308511 A1687580 Hs.169476 EST singleton (not in UniGene) with exon 10.1 

308615 A1738593 Hs.101774 EST singleton (not In UniGene) with exon 151 

308677 AI761173 EST singleton (not In UniGene) wfth exon 46 
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308852 AI829848 Hs.182937 pepOdytproIyl isomerase A (cyctophiEn A 5.9 

308974 A1872290 Hs.300697 Immunoglobulin gamma 3 (Gm markei) 4.5 

308981 AI673242 EST singleton (not tn UniGene} with exon 7.6 

308995 AI880172 EST singleton (not In UniGene) wfth exon 6.6 

5 309177 AI951118 EST singleton (not in UnlGene) with exon 243 

309188 AI952723 Hs.90207 EST singleton (not in UruGene) with exon 6.1 

309198 AI955915 major hlstocompaWty complex; class I; 5.6 

309226 AI969897 EST singleton (not In UnlGene) with exon 6.2 

309279 AI990102 EST singleton (not in UnlGene) with exon 7.9 

10 309583 AW170035 EST 64.5 

309824 AW191929 Hs^52989 EST 53 

309629 AW192764 Hs.172928 col/agen; type ft alpha 1 6.9 

309641 AW194230 Hs.253100 EST 11.4 

309698 AW238461 Hs.73742 rttwsomal protein; large; PO 43 

15 309700 AW241170 Hs.179651 Homo sapiens clone 24703 beta-tubutln m 11.9 

310073 AD35004 Hs.148558 ESTs 42 

310094 AW450967 Hs.235240 ESTs 5.7 

310373 AW080778 Hs.145582 ESTs 43 

310438 AW022192 Hs.200197 ESTs 39.1 

20 310470 AI281848 Hs.194691 ESTs 4.9 

310583 AW205632 Hs.211198 ESTs 7 

310877 T47784 Hs.188955 ESTs 4.1 

311067 A1587332 Hs.209115 ESTs 11.2 

311166 A1821294 Hs.118599 ESTs 24.1 

25 311199 T57896 Hs.191095 EST cluster (not in UniGene) 5.7 

311465 AI758660 Hs.206132 ESTs 15.7 

311587 AI828254 HsJ?71019 ESTs 6.4 

311774 AA700870 Hs.14304 ESTs 6.2 

311785 AI056769 Hs.133512 ESTs 5 

30 311923 T60843 .Hs.189679 ESTs 5.9 

311935 AA216387 EST duster (not In UniGene) 5.5 

311972 N51511 Hs.188449 ESTs 5.2 

• 312014 A1435650 Hs.128778 ESTs 4.3 

312047 AA588275 Hs.180669 ESTs 14.7 

35 312147 T89855 Hs.195648 EST cluster (not In UniGene) 9.8 

312153 AA759250 Hs.153028 cytochrome th561 27.1 

312168 T92251 Hs.198882 ESTs 4.2 

312172 A1222168 Hs.191168 ESTs 6.1 

312226 AI796815 Hs.199993 ESTs; Weakly similar to ubiquitous TPR 53 

40 312292 AW451893 Hs.151124 ESTs 18.4 

312312 AI080505 Hs.134529 ESTs 11.9 

312369 AA582039 Hs.173884 Homo sapiens mRNA; cfuomosome 1 spe 4 

312407 R46180 Hs.153485 ESTs 13.6 

312430 AW139117 Hs.117494 ESTs 4.1 

45 312470 AW451347 Hs.175862 ESTs 4.6 

312483 A1417526 Hs.7753 ESTs 153 

312521 AA033609 Hs319093 ESTs 123 

312544 AI498371 Hs.183526 ESTs 14.6 

312638 AW439195 Hs^56880 ESTs 5.3 

50 312754 R99834 Hs.250383 ESTs 8.4 

312772 H63791 EST cluster (not in UnlGene) 43 

312821 AA699325 Hs,269880 ESTs 83 

312837 AW292286 Hs.255058 ESTs 7.1 

312849 AA846353 Hs.194054 ESTs 5.9 

55 312854 AA828713 Hs.321058 EST cluster (not in UniGene) 4,1 

312992 AA088446 Hs.170298 ESTs 73 

313096 AI422367 Hs.163533 ESTs 6.1 

313112 AA732534 Hs.269099 ESTs 42 

_ 313126 AA720887 Hs.283313 EST cluster (not in UniGene) * 18.1 

60 313136 N59284 Hs.288010 ESTs 17 

313197 A1738851 Hs.222487 ESTs 12.9 

313219 N74924 Hs.182099 ESTs 7.1 

313258 AW068358 Hs.183918 ESTs 13.7 

313328 AW449211 Hs.105445 ESTs 273 

65 313352 AW292127 Hs.144758 ESTs 93 

313417 AA741151 Hs.137323 ESTs 82 

313455 AW081702 Hs.98571 ESTs 6.9 
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313590 AA804410 Hs.291677 EST cluster (not in UniGene) 5,3 

313663 A1953261 Hs,1698l3 ESTs 7.6 

313667 U69201 Hs.13684 ESTs; Weakly similar to choline kinase is 12.5 

313749 AW450376 Hs.1 19004 ESTs 5.5 

5 313832 AW271022 Hs.133294 ESTs 4.3 

313881 AA535580 Hs.16331 ESTs 7,7 

313915 AI969390 Hs.163443 ESTs 27.1 

313955 AI858884 Hs.270647 ESTs 57 

313974 AI310151 Hs.173524 ESTs 4.3 

10 314097 AA648744 Hs.269493 ESTs 14.5 

314129 AA228366 Hs.1 15122 ESTs 9.5 

314359 AA205569 Hs,194193 ESTs 5.4 

314384 AA535840 Hs.162203 ESTs; Weakly simBar to altemafively spD 5.3 

314394 AI380563 Hs.130816 ESTs 13.2 

15 314462 AA347951 Hs.326413 ESTs 6.2 

314465 AA602917 Hs.156974 ESTs 18.1 

314470 AI934422 Hs.30661 ESTs ' 4.2 

314488 AA358265 Hs.182890 ESTs 6.1 

314506 AA833655 Hs.206868 ESTs 27.8 

20 314510 AI204418 Hs.190080 ESTs 9.5 

314558 A1873274 Hs.190721 ESTs 22J5 

314661 AA436432 Hs.324239 EST cluster (not In UniGene) 13.3 

314691 AW207206 Hs.136319 ESTs 214 

314754 AW026761 Hs.134374 ESTs 4.4 

25 314775 AI149880 Hs.188809 ESTs 4.4 

314943 AI476797 Hs.184572 cell division cycle 2; G1 to S and G2 to M 18.4 

314961 AW008061 Hs.231994 ESTs 102 

314963 AI689617 Hs.200934 ESTs 5.3 

315006 AI538613 Hs.298241 ESTs 20.7 

30 315010 AA531082 Hs240049 ESTs 5 

315019 AA532807 Hs.105822 ESTs 6.1 

315033 AI493046 Hs.146133 ESTs 12 

315036 AA534953 Hs.163297 ESTs 8.3 

315037 AW205863 Hs.133988 ESTs; Weakly similar to gene MAC25 pr 6.1 
35 315051 AW292425 Hs.163484 EST " 12.7 

315054 AI968598 Hs.78768 ESTs 7.6 

315073 AW452948 Hs.257631 ESTs 13.9 

315080 AA744550 Hs.136345 ESTs 4.4 

315083 AI221325 Hs.205442 ESTs 5.1 

40 315088 AA557351 Hs.152448 ESTs; Moderately similar to MULTIFUN 4.7 

315175 AI025842 Hs.152530 ESTs 11.9 

315195 AA972756 Hs.44898 ESTs 28.8 

315296 AA876905 Hs.125286 ESTs 16.1 

315303 AW194364 Hs.128022 ESTs; Weakly similar to FIG-1 PROTEIN 25.7 

45 315352 AA604799 Hs.136528 ESTs; Moderately similar to Ml ALU SU 12.3 

315364 AA643602 Hs.155485 ESTs; Highly similar to serine protease [H 4.6 

315368 AW291563 Hs.104696 ESTs 4.8 

315390 A1801565 Hs.200113 EST s; Weakly similar to alternatively spU 4.4 

315408 AW273261 Hs.216292 ESTs * 5 

50 315458 AA872000 Hs.116104 ESTs 7.6 

315472 AA828850 Hs.165469 ESTs 4.9 

315478 AA665612 Hs.120874 ESTs 5.2 

315498 AA628539 Hs.116252 ESTs; Moderately similar to ilil ALU SU 4.8 

315527 AI791138 Hs.116768 ESTs 4.4 

55 315530 AI200852 Hs.127780 ESTs . 22.4 

315562 AA737415 Hs.152826 ESTs 5.9 

315634 AA837085 Hs.220585 ESTs 8.8 

315647 AA648983 Hs.212911 ESTs 15 

315652 AI521489 Hs.3053 ESTs 6.3 

60 315676 AW002565 Hs.124660 ESTs 92 

315680 AA814309 Hs.123583 ESTs 8.1 

315735 AI831760 Hs.155111 ESTs 13.4 

315741 AA812168 Hs.122559 ESTs 5.4 

315769 AA744875 Hs.189413 ESTs 4.4 

65 315978 AA830893 Hs.119769 ESTs 104 

315984 AI015862 Hs.131793 ESTs 5 

316042 AW297979 Hs.170698 ESTs 147 
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316136 AA830a08 Hs.124366 ESTs 4 

316177 AJ908272 Hs.293102 EST cluster (not in UniGene) 3Z6 

316313 AA741300 Hs.202599 ESTs 4.8 

316405 AA757900 Hs.270823 ESTs 4.8 

5 316480 AI749921 Hs.205377 ESTs 1Z9 

316564 AI743571 Ks.168799 ESTs; Weakly state to llll ALU SUBFA 8.1 

316714 AA809792 Hs.123307 ESTs 5 

316715 AW40266 Hs.170673 ESTs 4.2 
316828 AA828116 Hs.173076 ESTs 5.2 

10 316869 A1954880 Hs.134604 ESTs 13.3 

316905 AW138241 Hs.210846 ESTs 6.2 

316943 AW014875 Hs.137007 ESTs 5.3 

316949 AA856749 Hs.124620 ESTs 72 

317008 AW051597 Hs.143707 ESTs 4.1 

15 317028 AA962623 Hs.189144 ESTs; WeaMy similar to RENAL SOOIU 4.2 

317067 AI805392 Hs.325335 ESTs 4.5 

317059 AI732892 Hs.190489 . ESTs 6.4 

317210 AA490718 EST duster {not In UniGene) . 4.4 

317298 AI922374 Hs.158549 ESTs 5.9 

20 317658 AW139077 Hs.202217 ESTs 4.6 

317674 AW294909 Hs.132208 ESTs 5.2 

317685 AI798630 Hs.149997 ESTs 4.3 

317836 AA983913 Hs.128929 ESTs 12.4 

317881 AIS272A8 Hs.224398 ESTs 12.1 

25 317902 Ai828602 Hs.211265 ESTs 8.8 

317916 A1565071 Hs.159983 ESTs 12.6 

318042 AW294522 Hs.149991 ESTs 5.6 

318053 A1074465 Hs.133469 ESTs 4 

318064 AW296888 Hs.170939 ESTs 5.2 

30 318070 A1024594 Hs.248942 ESTs 4.7 

318073 AW167087 Hs.131562 ESTs 15.7 

318146 AI040125 Hs.150521 ESTs 5.9 

318186 AW016773 Hs.3709 ESTs 5.3 

318481 AI291584 Hs.145921 ESTs; Weakly similar to HYPOTHETICA 7.6 

35 316566 AI335361 Hs.226376 ESTs 5.8 

318617 AW247252 Hs.75514 nudeoskte phosphorylase 11.1 

318662 AI285898 Hs^94014 ESTs 16.3 

318691 AW192139 Hs.181307 H3 histone; family 3A 4 

318740 NM_002543Hs.77729 EST duster (not in UniGene) 21.3 

40 318744 AI793124 Hs.144479 ESTs 35 

318948 AA317274 Hs.13996 ESTs 11.7 

319163 F15257 Hs.27 glydne dehydrogenase (decarboxylating; 7 

319478 R06841 Hs.270307 EST duster (not In UniGene) 8.9 

319545 R83716 Hs.14355 ESTs 8.2 

45 319668 NM.002731HS.87773 EST duster (not in UniGene) 25.4 

319763 AA460775 Hs.6295 ESTs 7 

319913 AA179304 Hs.271586 ESTs; Moderately similar to Ulf ALU SU 8.7 

319936 W22152 Hs.282929 EST duster (not in UniGene) 5.6 

319951 AA307665 Hs.14559 ESTs 4.9 

50 319962 H06350 Hs.135056 ESTs 9.2 

319977 AA632632 EST duster (not in UniGene) 4.6 

320074 AA321166 Hs.278233 EST duster (not in UniGene) 16.7 

320032 AF022799 Hs.1 13292 caipain 9 (nCL-4) 5.4 

3201Q7 AAS36461 Hs.291712 EST duster (not in UniGene) 5.3 

55 320133 063271 EST duster (not in UniGene) .5.5 

320167 AA984373 Hs.90790 EST duster (not in UniGene) 15 

320187 T99949 Hs.303428 EST cluster (not In UniGene) 6.7 

320211 AL039402 Hs.125783 DEME-6 protein 24.3 

320401 U90449 Ks.152717 nudeoskie diphosphate kinase type 6 (Inh 10 

60 320458 A1884396 Hs.24131 ESTs 5.4 

320488 R31386 Ks.191791 EST duster (not in UniGene) 4.9 

320521 N31464 Hs.24743 ESTs 9.5 

320661 AA864846 Hs.1 15175 EST duster (not in UniGene) 6.6 

320691 R61576 Hs.313951 hypofteticai protein 5.9 

65 320699 R63161 Hs.118249 EST duster (not In UniGene) 4 

320727 U96044 Hs.161125 EST duster (not in UniGene) 45.3 

320993 AL050145 Hs.225986 Homo sapiens mRNA; cDNA DKFZp586 7.2 
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321012 AA737314 Hs.194324 EST duster (not In UniGene) 6.1 

321050 AW393497 EST cluster (not in UniGene) 5 

321051 AF134149 Hs.240395 EST cluster (not in UniGene) 11.4 
321171 AI769410 Hs.221461 ESTs 7.7 

5 321192 AA295304 Hs.297939 ESTs; Weakly similar to neogenin [H.sap 5.5 

321354 AA078493 EST cluster (not In UniGene) 16.9 

321357 H68014 Hs.141276 ESTs; WeaWy similar to III! ALU SUBFA 42 

321412 AW366305 Hs.22891 EST cluster (not in UniGene) 6.3 

321489 AW392474 Hs.172759 ESTs; Moderately similar to till ALU SU 9 

10 321539 N98619 Hs.42915 ARP2 (ac&weteted protein 2; yeasO horn 113 

321593 HB4762 Hs.253197 ESTs 10.4 

321666 028390 Hs.272897 EST duster (not in UniGene) 19.9 

321891 AW157424 Hs.165954 ESTs 5.6 

321910 H67065 Hs.271530 ESTs; Weaty similar to III! ALU SUBFA 5.4 

15 321953 AW068268 Hs.292833 ESTs; WeaWy similar to fill ALU CUSS 6.5 

321978 N77342 Hs.21851 EST duster (not in UniGene) 105 

322017 AA310039 Hs.9192 ESTs 9.8 

322026 AA233527 Hs.283675 low density lipoprotein receptor (familial 27.8 

322035 AL137517 Hs^06201 EST duster (not in UniGene) 40.2 

20 322171 AF085968 Hs.48474 EST cluster (not in UniGene) 5.7 

322175 AF085975 EST duster (not In UniGene) 7.7 

322236 AL134970 Hs.104222 foffefaMke 1 14.4 

322303 W07459 Hs.157601 EST duster (not in UniGene) 13.4 

322735 AA086123 Hs.297856 EST duster (not in UniGene) 7.6 ' 

25 322777 AA679082 Hs.269947 ESTs 4.4 

322818 AW043782 Hs.293616 ESTs 21 

322882 AW248508 Hs.279727 DtGeorge syndrome critical region gene 2 15.3 

322975 C16391 EST duster (not in UniGene) 21.3 

322991 C18965 Hs.159473 ESTs 11.7 

30 323011 AA580288 EST duster (not In UniGene) 8.9 

323091 AW014094 Hs.210761 ESTs 10.8 

323107 AI301107 Hs.150790 ESTs 6.5 

323136 AL120351 Hs.30177 EST duster (not in UniGene) 5.5 

323168 AL120862 Hs.124165 ESTs 17.9 

35 323195 AI064982 Hs.117950 multifunctional polypeptide similar to SA 5.8 

323201 AL049370 Hs.13350 Homo sapiens mRNA; cDMA DKF2p586 11.6 

323203 AA203135 Hs.130186 ESTs 6.4 

323243 W44372 Hs.110771 EST duster (not in UniGene) 7.3 

323244 T70731 Hs.193620 EST cluster (not In UniGene) 15.8 
40 323328 AA228078 Hs.255096 EST duster (not in UniGene) 4.8 

323332 AI829520 Hs.227513 ESTs 202 

323333 AA228883 Hs.208558 EST duster (not in UniGene) 8.8 
323570 AL038623 Hs.208752 ESTs; Weakly similar to llil ALU SUBFA 5 
323604 AI751438 Hs.41271 ESTs; Weakly similar to till ALU SUBFA 6.5 

45 323685 AA344205 Hs,289088 EST duster (not in UniGene) 7.1 

323753 AA327102 Hs.70266 EST duster (not in UniGene) 6.1 

323817 AA410943 EST duster (not in UniGene) 16.8 

323845 AI684674 Hs.41127 ESTs; Weakly similar to wadaw [Djneian 10.1 

323930 AA570698 Hs.8173 ESTs 6.4 

50 323997 AA844907 Hs.274454 EST duster (not in UniGene) 8 

324047 AA378201 Hs.271340 EST duster (not in UniGene) 6.3 

324261 AL044891 Hs.269350 EST duster (not in UniGene) 50.1 

324302 AA543008 Hs.292471 ESTs; Weakly similar to HII ALU SUBFA 5.7 

324338 AL138357 Hs.145078 ESTs 9.5 

55 324344 AW502000 Hs.46677 EST duster (not in UniGene) 4.4 

324432 AA464510 Hs.152812 EST duster (not in UniGene) 16.7 

324495 AW501411 Hs.122489 ESTs; Weakly similar to HII ALU CLASS 5,5 

324497 AW152624 Hs.136340 ESTs 5.4 

324598 AA502659 Hs.163986 ESTs 8.8 

60 324603 AW016378 Hs.292934 ESTs 23.1 

324620 AA448021 Hs.94109 EST duster (not in UniGene) 2\2 

324727 AI610425 Hs.19597 ESTs 5 

324774 A1031771 Hs.132586 ESTs 5 

324783 AA640770 Hs.200994 EST duster (not in UniGene) 4.1 

65 324824 AI826999 Hs.224624 ESTs 6.3 

324826 AA704806 Hs.143842 ESTs 11.7 

324902 031323 Hs.271492 ESTs 4.8 
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324961 AA613792 




EST duster (not in UniGene) 


13.3 


324987 T08882 


Hs.172634 


ESTs 


19.6 


324988 T06997 


Hs.121028 


EST duster (not tn UniGene) 


24.5 


325146 AI064690 


Hs.171176 


ESTs 


4.6 


325622 




CH.14Jis gt]5867000 


5.2 


326213 




CR17Jisgi|5867224 


8.1 


326474 




CH.19_hs gj5867405 


12.7 


326816 




CH^OJts gq6552458 


9.4 


326617 




CK20_hsgiJ6552458 


11.7 


327110 




CR21Jis gq6117842 


14.7 


327196 




CR01„hsglj5867446 


5.1 


327283 




CH.01_hs gi(5867478 


4.3 


327313 




CH.01 hs gi|5867501 


4.8 


327450 




CH.0Lhsgi|5867766 


4.1 


328059 




CH.06_hsgi|6117819 


6.2 


328304 




CH.07_hsg(|6004478 


5,4 


328492 




CH.07_hsgij5868455 


7 


328857 




CR07_hsgiI6381927 


5.2 


329367 




CRXJwgi|5868842 


7.6 


329373 




CHXJw gi]6682537 


12 


329655 




CH,14_p2gi[6448516 


4 


329899 




CH.15j)2gil6563505 


4 


329960 




CH.16_p2gi|5091594 


7.6 


330084 




CR19j)2gt|6015302 


4 


330384 M23263 




androgen receptor (dihydralestosterone re 


5.8 


330385 AA449749 




ESTs; Highly similar to secreted apoptosl 


10.2 


330387 H14624 




ESTs; Highly slmBar to secreted apoptosl 


4.4 


330388 X03363 




HER2 receptor tyrosine kinase (c-erbB-2; 


17.7 


330409 D50692 


Hs.78221 


c-myc binding protein 


10.1 


330460 TIGRUT544 


HsJ3946 


Endo 


330486 M13755 


Hs.833 


interferon-sflmulated protein; 15 kOa 


67 


330494 M29696 


Hs.237868 


interieukin 7 receptor 


6 


330500 M34423 


Hs.79222 


galactosidase; beta 1 


13.1 


330510 M75099 


Hs.227729 


FK506-bindir>g protein 2 (13JcD) 


29 


330513 M81057 


Hs.180884 


carboxypeptidase 81 (tissue) 


38.5 


330541 U22970 


Hs.265827 


multiple UniGene matches 


7.4 


330542 U23942 


Hs.226213 


cytochrome P450; 51 (lanosterol 14-alpha 


15 


330547 U32989 


Hs.183671 


tryptophan 2;3-dioxygenase 


11 


330551 U39840 


Hs.299867 


hepatocyte nuclear (actor 3; alpha 


6.5 


330562 U49082 


Hs.76460 


transporter protein 


7.7 


330573 U62800 


Hs.83393 


cystatinE/M 


4 


330673 D57823 


Hs.321403 


Sec23 (S. cerevislae) homotog A 


10.5 


330711 M164687 


Hs.177576 


mannosyl (alpha-1;3-)^lycoprotein beta-1 


24.3 


330614 AA015730 


Hs.265398 


ESTs; WeaWy similar to transformations 


44.1 


330850 AA075298 


Hs.322710 


ESTs 


4.4 


330874 AA127474 


Hs.191157 


ESTs; WeaWy similar to 111! ALU SUBFA 


8.1 


330884 M133457 


Hs.102548 


ESTs 


5,2 


330912 AA195936 


Hs.82719 


general transcription factor IIA; 1 (37kD a 


5 


330924 AA232136 


Hs.159737 


Homo sapiens mRNA; cONA DKFZp434 


9.1 


330997 H55762 


Hs.9302 


ESTs 


7.6 


331014 H98597 


Hs.30340 


ESTs 


13.5 


331024 N32919 


Hs.27931 


ESTs 


9.1 


331046 N66563 


Hs.191358 


ESTs 


10.5 


331135 R61398 


Hs.4197 


ESTs 


7.4 


331145 R72427 


Hs.129873 


ESTs; Weakly similar to CYTOCHROME 


41.9 


331148 R73816 


Hs.17385 


ESTs 


4.7 


331222 T98531 


Hs.173904 


ESTs 


4.1 


331230 W69B07 


Hs.16537 


hypothetical protein; similar to (U06944) 


4.9 


331306 AA252079 


H5.63931 


dachshund (DrosophBa) homotog 


15.1 


331327 AA281076 


Hs.109221 


ESTs 


4.8 


331337 AA287662 


Hs.50495 


ESTs 


7.6 


331341 AA303125 


Hs.23240 


ESTs; Weakly similar to Iffl ALU SUBFA 


13 


331344 AA357927 


Hs.1 26550 


ESTs 


114 


331362 AA4 17956 


Hs.40782 


ESTs 


6.5 


331363 AA421562 


Hs.91011 


anterior gradient 2 (Xenepus laevfe) homo 


28.2 


331376 AA443802 


Hs.41007 


ESTs; WeaWy similar to cONA EST yk47 


15.1 


331384 AA456001 


Hs.93847 


ESTs. 


7.9 
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331478 N26608 Hs.40639 . ESTs 7 

331526 N49957 Hs.46624 ESTs 19,8 

331533 N51517 Hs.47282 ESTs 6.5 

331681 W85712 Hs.119571 collagen; type (II; alpha 1 (Ehters-Oanfos 13.6 

5 331686 W88502 Hs.182258 ESTs 9.9 

331750 AA284372 Hs,111471 ESTs 5.6 

331751 AA284840 Hs.143818 ESTs 5.8 
331760 AA292721 Hs.154434 ESTs; Weakly similar to unknown [H.sap 7.4 
331763 AA312861 Hs.96704 ESTs 7.8 

10 331825 AA411144 Hs.292882 ESTs 15.2 

331890 AA432166 Hs.3577 succinate dehydrogenase complex; subunl 24.3 

331952 AA454756 Hs.97837 ESTs 5 

332015 AA487910 Hs.208800 ESTs; WeaWy similar to Iff! ALU CLASS 10.5 

332043 AA490831 Hs.125056 ESTs 11.4 

15 332060 AA504779 Hs.191402 ESTs 13.6 

332071 AA598594 Hs.205293 ESTs 9.1 

332093 AA608794 Hs.1 12592 ESTs 8.8 

332139 AA620669 Hs.112879 EST 9 

332219 N22508 Hs.139315 ESTs 7.1 

20 332225 N33213 Hs.100425 ESTs 125 

332246 N57927 Hs.120777 ESTs; Weakly similar to RNA POLYME 15.6 

332247 N58172 Hs.109370 ESTs 16.9 
332260 N70088 Hs.138467 ESTs 4 
332269 N91279 Hs.109654 ESTs; Moderately similar to outer membr 8.2 

25 332336 T96130 Hs.137551 ESTs 7.7 

332340 W15495 Hs.129781 chromosome 21 open reading frame 5 14.1 

332347 W60326 Hs.288684 ESTs 4.4 

332362 W93640 Hs.4779 ESTs; Moderately similar to similar to AO 16.9 

332467 AA489630 Hs.1 19004 KIM0665 gene product 4.8 

30 332499 M12036 Hs.323910 Human tyrosine khase-type receptor (HE 10.4 

332513 AA018182 Hs.154424 detodinase; lodothyronine; type H 5.8 

332526 AA281753 Hs.77515 Inositol 1;4;5-tfphosphate receptor; type 19 

332532 N63192 Hs.1892 EST; Highly similar to PHENYLETHAN 15.3 

332565 AA234895 Hs.25272 E1A bhd'ng protein p300 12.3 

35 332607 R41791 Hs.36566 LIM domain kinase 1 * 11.1 

332640 AA417152 Hs.5101 protein regulator of cytokinesis 1 18.2 

332694 AA262768 Hs.243901 KIAA1067 protein 15.2 

332702 H93968 Hs.75725 transgeBn2 4.7 

332705 T59161 Hs.76293 thymosin; beta 10 5.5 

40 332749 AA479968 Hs.B8251 arylsuIfataseA 9.8 

332927 CH22^GENES.38J 17.7 

332929 CH22J r GENES.38J 4.7 

332930 CH2ZJGEHESMJ 7.4 
332955 CH22-FGENES.48J2 5.4 

45 332958 CHZLFGENES^JS 17.8 

332961 CH22„FGENES.48_18 10.6 

332983 CH2^FGENES.54J 4.3 

333009 CH22J=GENES.61J 52 

333010 CH22J r GENES.61 w 2 8.1 
50 333013 CH22J=GENES.61_5 8.5 

333108 O^GENES^M 5.6 

333139 CH22J r GENES.83J6 6.3 

333254 CH22J=GENES.118_2 6.8 

333305 CH22_FGENES.137_2 11.4 

55 333343 CH25LFGENES.139.JI2 5.1 

333388 'CH22.FGENES.144J 12.7 

333456 CH22J r GENES.157J 4.2 

333459 CH22.FGENES.157J 7.6 

333517 CH22.FGENES.173J2 8.2 

60 333585 CH22J r GENES.203jl 5 

333679 CH22J r GENES.247_6 4.3 

333743 CH22.fGENES.264J 13.4 

333758 CH^GENES^SJ 4 

333767 CH22J r GENES.27L6 5.6 

65 333768 CH22LFGENES.271J 122 

333769 CH^GENES^U 48.3 

333795 CH2^/GENES275J 6.1 
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333796 


CH22J=GENES.Z75J 


6.6 


333892 


CH22J : GENES.292J4 


4.4 


333904 


CH22J : GENES.294_2 


6.5 


333905 


CH22_FGENE&294_3 


9.3 


333921 


CH22_FGENES^96J2 


9.6 


333968 


CH2£_FGENES.3Q7_4 


155 


334102 


CH22_FGENES.327J0 


7.1 


334222 


CH22_FGENES.360_3 


6.7 


334223 


CH22_FGENES.360_4 


33.5 


334264 


CH22J : GENES.367_15 


18.5 


334343 


CH2ZJGENES.375_25 


6.1 


334360 


CH22J=GENES.378J 


6.1 


334784 


CH22^FGENES.432_9 


4.8 


334789 


CH22J : GENES.432J4 


5.1 


334794 


CH22J=GENES.434J2 


7 


334889 


CH22_FGENES.452_3 


12.4 


335004 


CH22J : GENES.472_8 


7.9 


335115 


CH22J : GENES.496J2 


18.8 


335287 


CH22J=GENES,526J1 


4.5 


335342 


CH22.FGENES.536J 


5.3 


335491 


CH22_FGENES.570J23 


24 


335495 


CH2a.FGENES.570.28 


7 


335498 


CH22_FGENEa571_7 


12.2 


335544 


CH2£J : GENES.576_5 


8.4 


335610 


CH22_FGENES.583_4 


125 


335653 


CH22_FGENES.590_4 


6.7 


335682 


CH22J=GENES.595J2 


12.1 


335687 


CH22_FGENES.596_2 


13.9 


335755 


CH22_FGENES.604_4 


115 


335782 


CH22_FGENES.6Q9J 


175 


335791 


CH22_FGENES.611_7 


27.3 


335809 


CH22_FGENES.617..6 


19.2 


335822 


CH22 - FGENES.619_7 


19.1 


335823 


CH22_FGENES.619_8 


4.5 


335824 


CH22J : GENES.619J1 


40.2 


335825 


CH22_FGENES.619J2 


34.3 


335895 


CH22_FGENES.635_3 


10.2 


335917 


CH22_FGENES.636J3 


6 


335920 


CH22_FGENES.636J6 


8.8 


336035 


CH22J r GENES.678_6 


5.9 


336042 


CH22_FGENES.679_4 


5.8 


336093 


CH22_FGENES.691_2 


11.6 


336098 


CH22J : GENES.691_5 


7.6 


336150 


CH22_FGENES706_6 


6.3 


336152 


CH22_FGENES.706_9 


10.5 


336416 


CH2?J r GENES.823_38 


5 


336444 


CH22_FGENES.827_10 


4.8 


336449 


CH22L.FGENE S.829_6 


13.6 


336471 


CH2i.FGENES.829.30 


6.9 


336512 


CH22_FGENES.834_7 


21.4 


336558 


CH2a.FGENES.84U 


8.2 


336560 


CH2ZJGENES.842J 


9 


336676 


CH22_FGENES.43-4 


9.4 


336959 


CH22.FGENES.367-13 


19 


337968 


CH22_EM:AC005500.GENSCAN.103-2 


13.4 


338008 


CH2Z_EM:AC005500.GENSCAN.127-9 


15.2 


338057 


CH2a.EfytAC005500.GENSCAN.160-1 


13.9 


338410 


CH22„EM^C005500.GENSCAN.341-6 


8 


338451 


CH2a.EMAC005500.GENSCAN.35^3 


11.6 


338588 


CH22.EMAC005500.GENSCAN.432-1 


10.3 


338665 


CH2^EfvtAC005500.GENSCAN.464>2 


4.8 


338689 


CH22_E^tAC005500.GENSCAN.475^3 


6.7 


338832 


CH22JXI246D7.GENSCAN.64 


4.8 


338980 


CH22_DA59H18.GENSCAN.24 


5.1 


339352 


CH2i_BA354M2.GENSCAN.29-7 


6.9 


339373 


CH2i.BA232E17.GENSCAN.1-29 


4.3 
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TABLE 13A 

Table 13 A shows the accession numbers for those pkeys lacking unigenelD's for Table 13. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

123619 371681 1 AA602964 AA609200 
103207 30635.4 X72790 
103349 11052.-2 X89059 

110856 19346J4 AA992380 N33063N21418H79958 R21911 H79957 
113248 328626J T63857AW971220AA493469T63699 

123169 44573.2 AI950087 N70208 R97040 N36809 AI3081 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 A1820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 A1560251 AW874068 AL134043 AW235363 AA663345 AW008282 AA488964 
AA283144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 A1872193 AI763273 AW1 73586 AW150329 AJ653832 
AI762688 AA988777 AA488892 A1356394 AW103813 AI539642 AA642789 AA856975 AW505512 AI961530 AW629970 
BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961 101 AA251669 AA251874 
AI819225 AW205862 A1683338 A1858509 AWZ769Q5 AI633006 AA972584 M908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H50052 792487 A1022058 AA780419 AA551005 W80701 AW613456 AI373032 A1564269 
F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F04005 

123533 genbankJ\A608751 AA608751 

116480 genbanK.C14088 C14088 

132225 genbank_AA12898Q AA128980 

125154 genbank.W38419 W38419 ' 
118475 genbanK_N65845 N66845 

102919 25180.2 M21 191 AUJ35748 AA021266 AA323126 M18051 5 AI613029 D28356 NM.000034 M1 1560 AW401425 AW248248 AA012956 
AA323294 W04965 H38759 AA206622 AA580747 AI541172 AA381075 AA354229 AW402353 AW405575 AW404021 AW406207 
AA075752 AW176066 AA287222 AA195818 T20243 R87945 AA295539 AA402533 AA232419 AA224515 AW401583 AA331367 
AW402140 AW249079 R31488 AA075757 X05236 AW239490 AA338036 AW239495 AA357262 AA431005 AA306726 R33804 
AA216544 AW275288 AA227044 AL038124 AA243300 C03242 AA315615 AL035840 R64336 AA313917 AA018963 AA001385 
AA054395 H30840 AW498825 AA086141 AI557324 AA121576 H39128 R77161 AA019688 AA380987 AA348140 M348257 
AW176086 AA362432 AA171389 AA362416 AA299938 AA319093 AA337972 C04921 AA345696 R89640 MQ85425 AA481708 
AA313637 AL039229 H84490 H86153 F00656 AA326668 AA347304 R65890 H41949 AA339309 AW402002 AW404854 
AA192582 AA112802 H09248 N83165 H38367 AA356339 AA455763 R66853 AA294935 H85911 AA310414 H93436 N87014 
AA001 186 H83640 AA41 1328 AA317929 C04192 AW406288 U46335 AA323179 AA427649 AI366131 H14328 M197161 
AA379497 M31 1816 AA017206 AA001137 AA017420 AA012990 AW163775 AA021397 AA295513 AA355248 AA374921 
AW380419 AA345864 AA318058 AA371711 AA363255 AA057094 R88057 AA394045 AW362741 AA479579 AW362789 
AW362775 AA223624 AW362809 AW362746 AW362753 AW380412 M057104 AA295271 X06352 AW362771 X12447 R59553 
R87925 AA292864 R83920 F01 120 AA496740 AA852551 AL039209 AA093339 M333961 N86416 C03325 C03831 AA194750 
AA359270 AA359460 AW245492 AA339400 AA330784 AI908970 N89223 AW361938 AI940053 AA134323 AA371741 F01267 
AW372970 AA341973 AA346098 AW372969 AA337549 AA327342 H93855 A1074079 F291 18 AA852940 F35696 AA345963 
AA079578 AA113785 C02989 AA095945 C03145 C05199 AA346024 M190506 AW361659 AI909845 AW374374 AW374382 
AW374401 AW374373 AW374370 AI909831 AW374367 AA353658 F01041 C02843 AA375948 AW374414 AA213946 A1525039 
H13744 R31007 AA112044 AA134404 H47935 AW177018 AA429768 AA336873 AA112875 H46393 AA191267 D59131 
AW406037 AA055244 AA341880 AA179024 AA308537 AW406985 AW327311 H30301 AA300705 H43788 AW364149 
AA806213 AA481936 C04941 AW375299 M21 190 AA410818 AA250940 AI354547 AA317422 AA250903 AI865497 AA890603 
AA366197 AW498538 M78072 AW406461 C03092 F00308 H56488 AA336320 AW406501 AA354102 AA382942 AA096393 
AW376830 AA383446 F01 259 AW081388 T94036 AA379643 H43842 A1524063 AA621727 AA379099 AA3714 17 R6681 1 
H44129 N84794 F01135 AA477852 AA293062 AW361595 H27194 A1831650 H43253 H24797 AI564680 AA380090 W20057 
AI921586 AI192549 AW090808 H25967 AA918121 AI626060 H20221 AA812572 H42178 AA887222 H96000 C03160 F00946 
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C03986 AJ318091 A1860172 AA582179 AI633388 AA557193 R68075 F24105 AW518239 W56622 AI625219 AI925243 
AW468046 AI921828 AA339164 AI144391 AA643334 AA459631 AA873247 AW373432 AA604384 H27600 AI680458 AA159956 
AA610836 AA364298 AW373435 AA604225 W73754 AW087924 AA599776 N89227 AI630871 AI633128 AW514329 AA010455 
AA563928 A1571598 A1128394 W73707 AI423575 AA583809 AA657988 AI950837 AA169782 AA600009 AI885540 AA771884 
5 AI978829 AA505408 AA533937 AA481469 AA610869 AA775241 AW273870 AW070909 AI905695 AA4801 15 AA574051 

AW89185 AA773167 AA331375 AA001437 AA194324 AA194300 AA558632 AI038538 AA411329 AA781570 A1833176 
AA935520 AW074197 AA583063 AW073099 AW001198 AA948025 AA587857 AA191540 AI460085 AA193244 AI538037 
AA515572 AA758587 AI14931 1 AA508610 AA206409 AA534004 AA994600 AA827543 AI916349 AW245129 AW517804 D25663 
AA781985 AA284536 AI819422 H16Q40 H27531 AM56564 AA845555 AI423596 AA012908 AA889439 AA71631 1 AA968868 

1 0 AA320508 AA725731 AA834202 AA935997 AA724815 AA769353 AA594803 F00827 AI342442 AI003519 AKJ02503 AI347597 

AJ040946 AA197162 AA987883 AA292865 AW001944 AI64071 1 AW244044 AA456784 F30588 AA290829 H24754 A1978683 
AA483686 AA583939 AA121382 AA833831 AA477102 AA977322 AA666379 F35456 AA993537 AI749610 AA226934 AA716204 
AW513025 AA628543 AA583705 F25702 AI368748 A1124097 AB80086 AA477513 A1758834 AI690753 AA477746 F37761 
AA642243 AA159957 AA250844 AA459406 AA427566 F25Q54 AI569314 AA961665 AI922050 AI759000 AA555236 AA514432 

1 5 AA293474 AA001129 AA826789 AA641390 AA 134405 F35585 AA477416 AW193359 A1361315 AA284988 F36340 AI361322 

F26969 AA991922 AA021267 F26973 AI361314 F35891 AI918509 AA250964 AA 190992 AA577139 AA865535 AA134324 
AW192842 AI224046 F18975 AA779626 AA856894 AW269997 AW014614 H95554 F31378 AA374868 F26343 AA654007 
A1830942 AA1 13195 F26432 W56652 AA464690 AA055263 AA340654 AA031448 AA976399 AA972526 AA063476 R83921 
T16240 AA533290 N91545 H44053 AA883451 AA513761 AA086477 H09249 F20482 F26737 AA054148 AA857063 AA017259 

20 AA179789 AA088908 H43704 AA194320 F35950 A1880127 F20441 F32878 AA962483 H39094 H56489 H44621 F19390 

AI302232 F25162 AA826965 AA086052 AA917410 AA454513 R59554 AA198755 AA086369 AA079530 H28108 AA243301 
AI025737 AA101239 AA088887 A1214910 AA974886 F16089 F26054 AA515092 F33436 F32829 M78061 AA235645 F19715 
F37529 AI811549 AA665180 AA708200 F01124 F32382 AA346220 AA627361 F30741 F30010 F28543 AA211715 F20245 
AA331222 F25634 F21996 W28215 F21911 R65793 AI192566 H20130 H84491 AA719223 AA557435 F16967 F26989 F30353 

25 AA857159 AA291918 F28234 F20840 F25176 F22437 T27904 AA480355 F19528 R87926 H14286 F27532 AW337864 F2841 1 

H13692 F25651 AA975454 F24229 F29657 F18024 AA464779 F17588 F34954 AA947328 F18063 AA657777 AA459644 N91455 
F27850 F29608 F27206 F18418 F31459 F18564 F33496 F16376 F29740 F29843 f 29904 F29866 F19135 AW276602 T40337 
F24835 F34672 F26474 AJ926215 AA464185 F18217 N26193 AL043256 T41197 F33055 F00386 F29500 F34191 F33297 
AI937207 F22724 F15909 F26232 F18889 AA3 18627 F29085 AA872 104 F17509 F23373 F15660 F17552 F17412 F16863 

30 F34033 F21515 F17364 F18383 F16546 F17561 F17260 AA292000 F15723 T47438 F16798 F18046 F18319 F17978 F17566 

F34230 F33258 F20860 F17998 AJ695701 
116600 genbank_N69222 N69222 
118952 genbanK.N92966 N92966 
120873 genbankJ\A358015 AA358015 

35 113702 genbankJT97307 T97307 

129982 221J267 Z14221 AW381862 M97920 AW401444 Z66542 M29470 AW406502 X61011 M34024 AA327072 Z14166 Z14167 Z14165 

AW403806 Z14200 AA383972 Z14205 Z14201 M18513 Z14202 AW403684 X14584 AF062221 U43760 X65892 X65883 X62107 
Z80847 X65885 X65893 AF062142 X65891 X17675 Z47274 Z47277 Z47276 X65888 Z47275 X62109 AF082140 101278 
AF062134 AF062139 X81723 Z80840 X81733 X81743 X81744 X81732 Z80843 AW402942 AW403516 X65919 AF062190 

40 AF062177 AF062222 AF062115Z47240AF062263 AF062261 AF062223 AF062211 Z47238AW401714 AW404008 AW404991 

AF062280 M99595 Z47214 Z47232 Z47218 M26995 AF062184 X65895 L38433 X81731 Z1 1946 Z47226 AF062205 AF174012 
L01Z76 AF062168 AF062136 X81755 X81748 AF174019 AF062285 L06924 L1 1699 AW402665 L09070 L28049 L08090 
AW407843 Z14171 AW402944 108083 AW405627 L33035 L26907 M17750 L28052 M17751 AJ239360 U19885 L14821 X56526 
AB014341 L12087 L12096 U68231 L12184 AF062242 AF022000 U64499 U00570 AI268604 Y15773 X64239 X62969 U00505 

45 X73605 X99360 U00577 D83677 AB021539 AF035796 Z33899 U00588 AJ239353 AF174062 Z33901 X98899 AF 174058 X63080 

083676 AB021529 Z18318 U00488 L01412 XB1746 U21262 U21272 U00560 AF174060 U00547 U00561 Z18321 S73957 
M26435 AF115130 AF115117 Z92896 U21254 Z92895 AF115112 AF062290 T28938 Y09386 AF174067 U27189 Y09384 
U77373 AF174057 M17749 X69692 AF174038 U64478 U64486 AF174098 X99365 AF174077 AF174083 AF174089 AF015133 
AF127792 AJ006171 U0O510 U58144 AF004323 AF1 15109 AF004324 AF015130 AF090414 AF090418 AB021536 AB021519 

50 Z96957 AF021986 Z92898 Z96956 134154 AF062251 AF052524 U00549 U00541 D83687 AF052522 AF087420 AF052520 

AF02201 1 L43085 U00536 L19915 L43083 AF090420 AB021530 AB021534 Z96955 Z96954 Z33903 X62964 AF1 03282 U00509 
AF062298 AF062289 AF062307 AW408326 L33Q37 L04337 L04328 X81742 L04336 L03385 AF032360 S56184 AF062191 
AF191092 AI906954 AtlQ3184 L04343 AW364860 AF001424 AF103163 Z98717 AF103143 AJ008250 104323 AF103321 
AF103335 104342 U03818 L03817 AW404978 AW403690 U86801 AF103150 AJ010334 AF035027 AJ0Q7327 AF103115 

55 AF017458 AJ008207 AJ008183 AJ008196 AJ008241 AJ008208 AF103210 AF068668 AF058670 AF068S71 AF068664 

AF068669 AF068666 AF068665 AF068672 AW371244 AW403670 AW408074 AW404575 AW362153 AW403803 AW406702 
AW351514D78345 T29140 J00231 NMJ02179 AW405146AA301091 X04646 H64660AW402990AW406534 T93007 
A1857980 AW368899 AI905833 AW406586 AA482084 AI872299 AA715266 AW404328 A1831674 AI709348 AA6031 12 
AW514864 AA485775 H64492 AW404789 AA487630 M715498 AA295885 727613 T981 13 

60 115763 genbanKuAA421560 AA421560 
124357 genbank^N22401 . N22401 
108733 504187 J AA121022 AA126422 
101544 enlrez_M31169 M31169 
124447 genbank_N48000 N48000 

65 124677 genbanKJ*01073 R01073 
124777 genbanK_R41933 R41933 
119302 genbanlLT25725 T25725 
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103680 entre2_Z93784Z93784 
135029 H58818_at H58818 
112253 genbankJR51818 R51818 
Ptey CAT number Accession 
5 322175 46877J AF085975 H53458 H53459 

323011 139750J AA580288 AA315655AA1 33031 AA377748 
322975 1510563J C16391C16413 

317210 211994J AW881145M490718 M85637AA304575 T06067AA331991 

323817 233566 J AM10943 AW948953 AA334202 AA332882 
10 309583 1046029.-2 AW170035 

324961 376239J AA613792AW182329 T05304AW858385 

303502 325188 J BE174240AA488528 AL042253 

320133 447553 J BE151746 BE336853 063271 T94955 AA774994 

311935 174129J AA216387T63548AA228676 
15 321050 502195J C05928 AW393497 

319977 345248 J AA534222AA632632 T81234 

312772 4380_7 AW962299 AA310349 AW982294 H63791 H63751 

321354 116028_-2 AA078493 

336512 CH22.3941FG_834JJLINK.DJ 
20 336558 CH2^3992FGJ42JJJNKJ3J 

336560 CH22J994FGJ42L5JJNKJDJ 

329367 cj_hs 

329373 cjLhs 

336676 CH22.4154FGJJ3 4 
25 338008 CH22_6490FG_UNK_EM:ACOO 

338057 CH22_6558FG_jJNK_EM4C00 

329655 c14j>2 

336959 CH22_4764FG_367_13_ 

329899 c15j>2 
30 329960 c16j>2 

338410 CH2^7067FG_JJNK_EM^COO 

338451 CH22J124FG_JJNK W EM*C00 

338588 CH22_7331FG_UNK_EI^AC00 

338665 CH2*J438FG_JJ^EMAC00 
35 338689 CH22_7464FG_UMK_EM^C00 

308677 AI761173 

33B632 CH22_7678FG_UN^.DJ246D7 
338980 CH22_7859FG_JJhK.OA59H18 
333009 CH22J33FG_61J_UNK_EMA 
40 333010 CH22J234FG 61 2JLINK_EfvtA 
333013 CH22_237FG_61J_UNKJEMA 
308981 AI873242 
308995 AI880172 

333108 CH22_.336FGJ r 9_14_LINK_EM: 
45 333139 CH22_368FG_83_16_UNK_EM: 

333254 CH22„495FG_1 18JLLINK..EM: 

333305 CH22»550FG_137_2_UNK_EM: 

333343 CH22_589FGJ39J2JJNK_EM 

333388 CH22_634FGJ44J_UNK_EM: 
50 326213 c17Jis 

333456 CH22_706FG_157_5_LINK_EM: 

333459 CH22_709FG_157_8 UNK-EM: 

333517 CH2^773FG_173.UINK_EM: 

333585 CH2*j346FG_203_4JUNKJEM: 
55 333679 CH22_341FG_247_6JUNK_EM: 

326474 c19Jis 

333743 CH22J0Q9FGJ64JJJNK.EM 
333758 CH22_1024FG_268J_UNH_EM 
333767 CH22_1034FG_271_6_LINieEM 
60 333768 CH22_1035FG_271J r JJNK_EM 
333769 CH22L1036FGJ271_3J.INK_EM 

333795 CH22_1063FG_275JJJNK_EM 

333796 CH22J065FG_275JUJNK_EM 
335004 CH22_2326FG_472_8JJNK_EM 

65 333892 CH22J163FG 292J4JUNKJ5 
335115 CH22^2447FG_496^.DnICEM 
333904 CH22_1176FG_294_2 UNK.EM 
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333905 CH22J 177FG J94.3 JJNK.EM 
333921 CH22.1194FG.296J2JJNK.E 
333968 CH2i.1245FG.307JJJNK.EM 
328059 CjjB 
5 335287 CH22_2629FG_526_11JJNK_E 

326816 c20_hs 

326817 c20_hs 

335342 CH22_2689FG_536_1_LINK_EM 
335491 CH22J843FG_570_23JLINK£ 
10 335495 CH22J847FG_570_28JJNK_E 
335498 CH22J850FG_571.7JJNK.EM 
328304 c_7_hs 
305453 AA738110 

335544 CH22_2899FG.576_5_LINK.EM 
15 335610 CH22J969FG_583J.LINK.EM 
335653 CH22_3013FG.590.4 LINK.EM 
335682 CH22.3043FG.595_2_LINK.EM 
335687 CH22.3048FG.596J.LINK.EM 
328492 c_7_hs 
20 335755 CH22.3122FG.604_4.LINK.EM 
335782 CH22_3151FG.609_4_UNK.EM 
335791 CH22_3160FGJ11.7JJNK.EM 
335809 CH22.3181FG.617.6JJNK.EM 
335822 CH22_3195FG_619.7_LINK.EM 
25 335823 CH22_3196FG_619.8_LINK.EM 

335824 CH22_3197FG_619_11_LINK.E 

335825 CH22_3198FG_619_12_UNK.E 
335895 CH22_3272FG_635_3_LINK_EM 
335917 CH22.3294FG.636.13.LINK.E 

30 335920 CH22J297FG_636J6.UNK_E 

305898 AA872838 

305913 AA876109 

305950 AA884479 

328857 c_7_hs 
35 330084 c19j)2 

337968 CH22_6419FG_.LINK.EM:AC00 

309177 AI951118 

309198 AI955915 

309226 AI969897 
40 339352 CH22.8317FG__UNK_BA354I1 

309279 A1990102 

339373 CH22_8348FG__LINK.BA232E1 
325622 c14_hs 

334102 CH22_1380FG.327.6O_UNKJE 
45 332927 CH22_148FG_38_1.UNK.C20H 

332929 CH22J50FG_38.3.UNK_C20H 

332930 CH22_151FG_38.4_UNK.C20H 
332955 CH22.179FG.48 12JJNK.EM: 
332958 CH22.182FG 48 15JJNK.EM: 

50 332961 CH22J85FG_48_18.UNK.EM: 
332983 CH22_207FG_54_5JJNK_EMA 

334222 CH22_1506FG.360_3JUNK.EM 

334223 CH22.1507FG.360_4JJNK.EM 
334264 CH22_1551FG_367_15_UNK_E 

55 327110 c21_hs 

334343 CH22.1636FG_375.25 UNKJE 

334360 CH22_1654FG_378_5_UNK_EM 

327196 c_1_hs 

327283 Q.1 hs 
60 327313 cjjts 

304465 AA421948 

304507 AA456426 

327450 C_2> 

304591 AA505702 
65 304601 AA507875 

304659 AA533185 

334784 CH22_2096FG_432_9_UNK.EM 
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334789 CH22J101FGJ32J4 UNK.E 
334794 CH22J106FGJ34JLUNK3I 
336035 CH22_3420FG 678 6 UNKJDJ 
336042 CH22_3427FG_679JJJNKJ)J 
5 336093 CH2£J481FG__691_2JJNKJ)J 
336096 CH22L_3484FG_691_5JJNKJ)J 
334889 CH22_2206FG_452_3J_INK^EM 
336150 CH2^3540FG_706_6JJNK_DA 
336152 CH22J543FG_706_9_UNKJ>A 
10 336416 CH22J833FGJ23_38JJNK_0 
336444 CH22_3864FG_827_10JLINK_D 
336449 CH22_3870FG_829_6_UNHLDJ 
336471 CH22J894FG_829_30J.INKJ> 
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TABLE 13B 

Table 13B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 13B. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 
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Pkey. Unique number corresponding to an Eos probeset 

Ref: Sequence source The 7 digit numbers in this column are Genbank Idenfifier (Gl) numbers. "Dunham I. et al" refers to the publication 

entitled The DNA sequence of human chromosome 22/ Dunham I. et aL, Nature (1 999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: indicates nucleotide positions of predicted exons. 



Pkey Ref 



Strand NLposition 



332955 
332958 
332961 
333139 
333254 
333305 
333388 
333517 
333585 
333679 
333767 
333768 
333769 
333795 
333796 
333892 
333921 
333968 
334102 
334264 
334343 
334794 
334889 
335287 
335491 
335495 
335498 
335653 
335687 
335809 
335822 
335823 
335824 
335825 
336035 
336093 
336098 
336444 
336959 
338008 
338057 
338410 
338588 
338665 
338832 
338980 
339352 



Dunham, 1. etal 
Dunham, I. etal. 
Dunham, I. etai. 
Dunham. 1. etal. 
Dunham, I. etal 
Dunham, I. etal 
Dunham. L etal. 
Dunham, i. etal. 
Dunham, I. etal. 
Dunham. I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal 
Dunham, I. etai. 
Dunham, I. etal. 
Dunham, L etal 
Dunham, I. etal 
Dunham, I. etal 
Dunham, L etal 
Dunham, I etal. 
Dunham. L etal 
Dunham, I etal. 
Dunham, I. etal 
Dunham. I. etal 
Dunham, L etal 
Dunham, I. etal. 
Dunham, I etal. 
Dunham, L etal 
Dunham, I. etal 
Dunham, L etal 
Dunham, I. etal 
Dunham, L etal 
Dunham, I etal 
Dunham, L etal 
Dunham, L etal 
Dunham, I etal 
Dunham, I. etal 
Dunham, I. etal 
Dunham, I. etal 
Dunham. I. etal 
Dunham, I. etal 
Dunham, I. etal 
Dunham. I. etal 
Dunham,!. etal Plus 
Ounham, L etal Plus 
Dunham. I. etal Pius 
Dunham. 1. etaL Plus 



Plus 
Plus 
Plus 
Plus 
Plus 
Pius 
Pius 
Pius 
Pius 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Pius 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Pius 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Plus 
Pius 



2508895-2506992 

2516164-2516310 

2521424-2521555 

3369495-3369571 

252142«521555 

4630388-4630645 

49137494913805 

5570729-5570925 

623477W234894 

7068795-7068896 

7694407-7694623 

7695440-7695697 

7696625-7696707 

7807688-7807795 

7808253-7808319 

8156825-8157001 

83803254380441 

8681004-8681241 

9995140-9996373 

13234447-13234544 

13655828-13656307 

16374312-16374458 

19286024-19286515 

22299047-22299299 

24128651-24128827 

24140688-24140872 

24172082-24172161 

25329710-25329802 

25445952-25446064 

26310772-0310909 

26364087-26364196 

26365925-26366004 

26376860-26376942 

26378175-26378268 

29016748-29017410 

29556922-29557002 

29578878-29579047 

34190585-34190718 

13233040-13233126 

7697068-7697236 

8526397-8526522 

19292807-19292916 

22896767-22896920 

24472654-24472853 

27775128-27775290 

29896789-29896874 

33544784-33545121 
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332929 Dunham, I. etaL Minus 

332930 Dunham, L etaL Minus 
332983 Dunham,!. etaL Minus 
333009 Dunham, 1. etaL Minus 

5 333010 Dunham, Let at Minus 
333013 Dunham, L etaL Minus 
333108 Dunham, I. etaL Minus 
333343 Dunham, I. etaL Minus 
333456 Dunham, I. etaL Minus 
10 333459 Dunham, L etaL Minus 
333743 Dunham,!. etaL Minus 
333758 Dunham, 1. etaL Minus 

333904 Dunham, I. etaL Minus 

333905 Dunham,!. etaL Minus 
15 334222 Dunham, Letal. Minus 

334223 Dunham, Letal. Minus 
334360 Dunham, Letal. Minus 
334784 Dunham. Letal. Minus 
334789 Dunham, Letal. Minus 

20 335004 Dunham, Letal. Minus 
335115 Dunham, Letal. Minus 
335342 Dunham, Letal. Minus 
335544 Dunham,!. etaL Minus 
335610 Dunham,!. etaL Minus 

25 335682 Dunham, I. etaL Minus 
335755 Dunham, I. etaL Minus 
335782 Dunham, Letal. Minus 
335791 Dunham, I. etal. Minus 
335895 Dunham.!. etaL Minus 

30 335917 Dunham, I. etal. Minus 
335920 Dunham,!. etaL Minus 
336042 Dunham, I. etal. Minus 
336150 Dunham, Letal. Minus 
336152 Dunham, Letal. Minus 

35 336416 Dunham, I. etaL Minus 
336449 Dunham, I. etaL Minus 
336471 Dunham. Letal. Minus 
336512 Dunham, L etaL Minus 
336558 Dunham, I. etal. Minus 

40 336560 Dunham, Letal. Minus 
336676 Dunham, Letal. Minus 
337968 Dunham, Letal. Minus 
338451 Dunham, Letal. Minus 
338689 Dunham, I. etal. Minus 

45 339373 Dunham, I. etal. Minus 
325622 5867000 Plus 
329655 6448516 Minus 
329899 6563505 Minus 
329960 5091594 Minus 

50 326213 5867224 Minus 
326474 5867405 Plus 
330084 6015302 Minus 

326816 6552458 Plus 

326817 6552458 Plus 
55 327110 6117842 Plus 

327196 5867446 Plus 
327283 5867478 Minus 
327313 5867501 Minus 
327450 5867766 Minus 

60 328059 6117819 Plus 
328492 5868455 Minus 
328304 6004478 Minus 
328857 6381927 Minus 
329367 5868842 Minus 

65 329373 6682537 Minus 



2020758-2020664 

2022665-2022497 

2631933-2631797 

2766043-2765856 

2766207-2766119 

2772278-2772039 

3240494-3240389 

46928864692753 

2631933-2631797 

5144546-5144344 

7573218-7573060 

7666413-7666091 

82173744217261 

82177964217670 

12732417-12732289 

12734365-12734269 

13728850-13728751 

16294548-16294360 

16306095-16305996 

20581911-20581794 

21388250-21388146 

22597448-22597284 

24650505-24650403 

25068943-25068841 

25421215-25421093 

25763806-25763747 

25908578-25908440 

25948563-25948411 

26975307-26975239 

27028481-27028377 

27034927-27034811 

29041694-29041500 

30150423^30150256 

3015605340155870 

34047408-34047311 

34204707-34204577 

34215091-34214978 

3427837344278275 

3437582544375698 

3437681444376596 

2022565-2022497 

7095797-7095680 

20174286-20174193 

24893073-24892972 

33860127-33860047 

69994-70075 

35565-35843 

111058-111783 

1031-1162 

60751-60927 

16995-18101 

5701949337 

198354-198436 

199909-200001 



180921-181333 

567-962 

8973449838 

47928-48076 

3705247204 

4609446241 

38844952 

8055741051 

8720147587 

3895049301 
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TABLE 14: Table 2 from BRCA 001-5 US 

Table 14, a subset of table 13, depicts a preferred group of genes highly upregulated in breast 
cancer cells. 



Ptey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

10 UnigenelD: Unlgene number 

Unlgene Title: Unlgene gene life 

R1: Ratio of tumor to normal breast tissue 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pkey ExAccn 


UniGenelD 


Unlgene Title 


R1 


100038 M97935 




AFFX control: STAT1 


16.7 


100114 D00596 


Hs.82962 


Ihymidytate synthetase 


15.9 


100975 J02923 


Hs.76506 


lymphocyte cytosoDc protein 1 (L-plastin) 


30.1 


101031 J05070 


Hs.151738 


matrix metaOoprotelnase 9 (gelaGnase B; 9 


37.2 


101104 107615 


Hs. 169266 


Human neuropeptide Y receptor Y1 (NPYY 


18.3 


101143 L12723 


Hs.90093 


heat shock 70KO protein 4 


17:4 


101332 L47276 


Hs. 156346 


Homo sapiens (ceD One HL-6) alpha topots 


18.9 


101378 M13755 


Hs.833 


interferon-stimuiated protein; 15 kDa 


18.1 


101809 M86849 


Hs.323733 


Homo sapiens connexin 26 (GJB2) rnRNA 


22.5 


102618 U65932 


Hs.81071 


extracellular matrix protein 1 


23.2 


102721 U79241 


Hs.1 18666 


Human clone 23759 mRNA; partial cds 


15 


102817 U90904 


Hs.83724 


Human clone 23773 mRNA sequence 


15.2 


102907 X06985 


Hs.202833 


heme oxygenase (decycEng) 1 


22.7 


102985 X17644 


Hs.2707 


G1 toS phase transition 1 


20.6 


103060 X57766 


Hs.155324 


matrix metaDoprotelnase 11 (stromefysin 3 


17.8 


103180 X69433 


Hs.5337 


Isocitrate dehydrogenase 2 (NADP+); mito 


18.9 


103206 X72755 


Hs.77367 


monokine induced by gamma interferon 


15.1 


103821 AA157623 


Hs.198793 


K1AA0750 gene product 


23.3 


104115 AA428090 


Hs.26102 


ESTs 


28.7 


104667 AA007234 


Hs.30098 


ESTs 


16.6 


105186 AA191512 


Hs.26005 


Homo sapiens mRNA; cDNA DKFZp564G 


19.3 


106103 M421104 


Hs.12094 


ESTs 


15.4 


107151 AA621169 


Hs.8687 


ESTs 


19 


109415 AA227219 


Hs.1 10826 


trinucleotide repeat containing 9 


20.1 


110189 H20543 


Hs.6278 


DKFZP586B1621 protein 


16.6 


110561 H59617 


Hs.5199 


ESTs; Weakly similar to UBIQUmN-CON 


19.5 


110734 H98714 


Hs.24131 


ESTs 


30.2 


110915 N46252 


Hs.29724 


ESTs 


2Z2 


111179 N67239 


Hs.10760 


ESTs 


37 


111357 N91023 


Hs.87128 


ESTs 


15 


112134 R46025 


HsJ413 


ESTs 


17.4 


113970 W86748 


Hs.8109 


ESTs 


15 


114124 238595 


Hs.1 25019 


ESTs; Highly similar to K1AA0886 protein 


22 


114292 Z40715 


Hs.184641 


detta-6 tatty acid desaturase 


19.4 


114901 AA236276 


Hs.196437 


ESTs; Weakfy similar to R26660 J; partial 


16.9 


114965 AA250737 


Hs.72472 


ESTs 


35.1 


115652 M405098 


Hs.38178 


ESTs 


16.1 


115875 M433943 


Hs.43946 


ESTs; WeaWy similar to Weak similarity t 
microtubule-associated protein tau 


33.5 


116790 H29532 


Hs.101174 


22.2 


116921 H72948 


Hs.821 


blglycan 


20.7 


117412 N26722 


Hs.42645 


ESTs 


18.1 


120241 Z41815 


Hs.65946 


ESTs 


15.6 


120325 M195651 


Hs.104106 


ESTs 


15.2 


121596 AA416740 


Hs.174104 


ESTs 


22.6 


123619 AA609200 




ESTs 


23.1 


124006 060302 


Hs.270016 


ESTs 


20.6 


125852 K09290 


Hs.76550 


Homo sapiens mRNA; cDNA DKFZp564B 


25.9 


126160 N90960 


Hs.265398 


ESTs; Weakly similar to trarisformation-rel 
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127677 AA916752 Hs264190 ESTs; Highly similar to MEM3 [Mmuscul 17.3 

128595 U31875 Hs.152677 short-chain alcohol dehydrogenase family m 27.1 

128717 T30617 Hs.104222 Homo sapiens mRNA; cONA DKFZp566L 24.5 

129124 AA234530 Hs.108802 r^thylrnaleimide-sensitive factor 207 

5 129366 H18027 Hs.184697 plexind 182 

130455 X17059 Hs.155956 N«acetyltransferase 1 (arylamine N-acetyit 26.4 

130604 X03635 Hs.1657 estrogen receptor 1 39.9 

130913 W03592 Hs.21198 translocase of outer mitochondrial membra 20.9 

130944 M97935 Hs2U8$ signal transducer and activator of transcript 18,8 

10 131472 AA608962 Hs.27258 calcycfin binding protein 18.1 

131562 U90551 Hs28777 H2A htstone family; member L 18.8 

132180 AA405569 Hs.418 fibroblast activation protein; alpha: seprase 15.4 

132406 F09979 Hs.4774 ESTs 15 

132465 AA047896 Hs.49169 ESTs 15.4 

15 132994 AA505133 Hs.279905 solute carrier family 2 (facilitated glucose t 26.4 

133294 R79723 Hs.69997 zinc finger protein 238 " 30.4 

133634 U24166 Hs.234279 mlcrotubule-associated protein; RP/EB fam 152 

134374 D62633 Hs.8236 ESTs 152 

134405 J04177 Hs.82772 collagen; type XI; alpha 1 15.3 

20 134470 X54942 Hs.83758 CDC28 protein kinase 2 203 

134495 D63477 Hs.84087 KIAA0143 protein 16.1 

134714 U89922 Hs.890 lymphotoxtn beta (TNF superfamiiy; memb 35,7 

135237 AA454930 Hs.9691 ESTs 19.5 

301884 AA312082 Hs.105445 GDNF family receptor alpha 1 20.7 

25 302276 NMJ)04448Hs323910 EST cluster (not In UniGene) with exon hit 21.6 

302290 AL1 17607 Hs.175563 Homo sapiens mRNA; cDNA DKFZp564N 414 

309177 AI951118 EST singleton (not In UniGene) with exon 24.3 

309583 AW170035 EST 64.5 

310438 AW022192 Hs.200197 ESTs 39.1 

30 311166 AI821294 Hs.118599 ESTs 24.1 • 

312153 AA759250 Hs.153028 cytochrome b-561 27.1 

313915 A1969390 Hs.163443 ESTs 27.1 

314506 AA833655 Hs206868 ESTs 27.8 

314558 AI873274 Hs.190721 ESTs 215 

35 314691 AW207206 Hs.136319 ESTs 21.4 

314943 AI476797 Hs.184572 cell division cycle Z G1 to Sand (32 to M 18.4 

315196 AA972756 Hs.44898 ESTs 28.8 

316177 AI908272 Hs293102 EST cluster (not in UniGene) 32.6 

318073 AW167087 Hs.131562 ESTs 15.7 

40 318662 AI285898 Hs294014 ESTs 16.3 

318740 NM_002543Hs.77729 EST duster (not in UniGene) 21.3 

318744 A1793124 Hs.144479 ESTs 35 

319668 NM_002731Hs.87773 EST cluster (not in UniGene) 25.4 

320074 AA321166 Hs.278233 EST cluster (not In UniGene) 16.7 

45 320211 AL039402 Hs.125783 OEME-6 protein 243 

320727 U96044 Hs.181125 EST cluster (not in UniGene) 153 

322818 AW043782 Hs293616 ESTs 21 

322882 AW248508 Hs279727 DiGeome syndrome criilcai region gene 2 153 

324261 AL044891 Hs269350 EST cluster (not in UniGene) 50.1 

50 324432 AA464510 Hs.152812 EST duster (not in UniGene) 16.7 

324603 AW016378 Hs.292934 ESTs 23.1 

324620 AA448021 Hs.94109 EST cluster (not in UniGene) 212 

324988 T06997 Hs.121028 EST cluster (not in UniGene) 24.5 

330388 X03363 HER2 receptor tyrosine kinase (c-erbB-2; E 17.7 

55 330486 M13755 Hs.833 Interferon-stimulated protein; 15 kDa 67 

330814 AA015730 Hs.265398 ESTs; Weakly similar to trartsformatiorwel 44.1 

331145 R72427 Hs.129873 ESTs; Weakly similar to CYTOCHROME 41.9 

331306 AA252079 Hs.63931 (Jachshund (Drosophila) homolog 15.1 

331890 AA432166 Hs.3577 succinate dehydrogenase complex; subun'rt 24.3 

60 332526 AA281753 Hs.77515 inositol 1;4;5^riphosphaterecepton type 3 19 

332532 N63192 Hs.1892 EST; Highly similar to PHENYLETHANO 153 

332694 AA262768 Hs.243901 KIAA1067 protein 152 

332958 CH22_FGENES,48J5 17.8 

333769 CH2*J=GENES271J 48.3 

65 333968 CH22J : GENES.307_4 15.9 

334223 CH2Z.FGENES.360J 33.5 

334264 CH22J : GENES.367J5 18.5 
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335791 CH2^.FGEhES.61U 27.3 

33S512 CH22_FGENES.834_7 21.4 

338003 CH22.EMAC005500.GENSCAN.127-8 15,2 
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TABLE 14A 



Table 14A shows the accession numbers for those pkeys lacking unigenelD's for Table 14. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 

Pkey: Unique Eos probeset Identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

15 

Pkey CAT number Accession 

309583 1046029^-2 AW170035 
20 336512 CH22J941FGJ34JLUNKJW 

338008 CH22 W 6490FG_LINK.EMAC00 

333769 CH22J036FG.27U_UNK_.EM 

333968 CH22J245FGJ07_4JJNK_EM 

335791 CH22_3160FG.61U_UNK.EM 

25 309177 AI951118 

332958 CH22J82FG.48J5JJNICB/I: 

334223 CH22.1507FG.360_4_UNK.EM 

334264 CH22_1551FG_367_15.UNK_E 

123619 371681 1 AA602964 AA609200 
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TABLE 14B 

5 Table 14B shows the genomic positioning for those pkeys lacking unigene ED's and 
accession numbers in Table 14. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

10 Ptey. Unique number corresponding to an Eos prabeset 

Ret Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. 'Dunham I. et al." refers to the publication 

emitted The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DMA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref Strand NLposition 



332958 Dunham, I. etaL Plus 2516164-2516310 

20 333769 Dunham, I. eUl. Plus 7696625-7696707 

333968 Dunham. I. etai Plus 6681004-6681241 

334264 Dunham, I. etaL Plus 13234447-13234544 

338008 Dunham, I. etai Plus 7697068-7697236 

334223 Dunham, I. etaL Minus 12734365-12734269 

25 335791 Dunham. L etaL Minus 25948563-25948411 

336512 Dunham, I. etai Minus 34278373-34278275 
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TABLE 15: Table 3 from BRCA 001-5 US 
Table 15 shows genes downregulated in breast cancer cells. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unfcene number 

10 Unfejene Title: Unigene gene title 

R1: Rafio of normal breast fissue to tumor 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pkey ExAccn UnlGene IDUnigene Title R1 

100115 000632 Hs.172153 glutathione peroxidase 3 (plasma) 1.7 
100499 TIGR:HT1428 Hs.283108 Globin, Beta 1.5 
100502 T1GR:HT1496 Hs.169228 Adrenal-Specific Protein Pg2 2.3 
100815 TIGRHT4268 Hs.9739 L^ly(^^-Phospbate:Nad+Qxld<)reduct 1.7 

101125 110373 Hs.82749 transmembrane 4 superfamily member 2 1.5 

101367 M12963 Hs.4 alcohol dehydrogenase 1 (class I); alpha po 2.9 

101397 M15856 Hs.180878 lipoprotein lipase 1.6 

101883 M98399 Hs.75613 CD36 antigen (collagen type I receptor; thr 1.6 

102227 U25138 Hs.93841 potassium targe conductance calcHim-activ 1.6 

102857 X00129 Hs.76461 retincJ-oinding protein 4; Interstitial 3 

103211 X73079 Hs.288579 polymeric immunoglobulin receptor 1.8 

103496 Y09267 Hs.132821 flavin containing monooxygenase 2 1.5 

103562 Z21966 Hs.2815 POU A)main;class6;tnanscn^fonfector1 18 

104672 AA007629 glycerophosphate dehydrogenase 1 (sol 2.4 

105083 AA146619 Hs.18791 ESTs; Weakly similar to CALCIUM-BIND 1.7 

105138 AA164519 Hs.15248 ESTs 1.5 

106075 AA417915 Hs.25930 ESTs 1.5 

106870 AA487576 Hs.26530 serum deprivation response (phosphatidyls 1.6 

107099 AA609645 Hs.211568 eukaryotlc translation initiation factor 4 gam 2.7 

107616 AA004901 Hs.261164 ESTs 1.6 

107997 AA037388 Hs.82223 Human DNA sequence from done 141H5 o 1.7 

108604 AA099820 Hs.49696 ESTs 2.4 

111130 N64265 Hs.19515 yz44h12.s1 Morton Fetal Cochlea Homo sa 1.7 

111837 R36447 Hs.24453 ESTs 1.6 

112538 R70255 ESTs 1.9 

112808 R97970 Hs.281022 EST 1.5 

113086 T40652 Hs.209100 DKFZP434C171 protein 1.9 

115740 AA4 18033 Hs.283559 ESTs 1.6 

115949 AA443800 Hs.43125 ESTs 2 

115965 AA446661 Hs.173233 ESTs Z2 

117224 N20300 Hs.218707 ESTs 1.7 

117513 N32174 Hs.44317 SRY (sex-determining region Y)-box 10 1.7 

119059 R15436 Hs.77889 Friedreich ataxia region gene X123 1.7 

119175 R71792 Hs.301002 ESTs; Weakly similar to ceil death acOvalo 2.8 

119359 T71021. Hs.285681 ESTs; Highly similar to WS baslc-helix-too 1.9 

119798 W73386 Hs.249129 ESTs 3 

120889 AA365784 Hs.97044 ESTs 1.6 

121381 AA405747 Hs.97984 ESTs; Weakly similar to WASP-famfly pro U 

121750 AA421184 Hs.97549 ESTs 1.5 

122127 AA434447 Hs.106771 ESTs 2.5 

122348 AA443695 Hs.293410 ESTs 2.1 

122485 AA448300 Hs.160318 phospholemman 1.5 

123443 AA598841 Hs.167382 natriuretic pepfide receptor A/guarrylatecy 1.8 

123505 AA600135 ESTs; Moderately similar to llll ALU SUB 1.5 

125284 W94688 Hs.103253 penlipin 1.7 

126300 D81972 HUM427D08B Human fetal brain (TFujiw 1.8 

126747 R72515 Hs.160318 phospholemman 1.6 

127218 AA309765 Hs.116017 ESTs; Weakly similar to WAA0795 protei 1.5 

127357 AA452788 Hs.75432 *39g11.r1 Soares.totaLfetus_Nb2HF8^ 1.7 
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127638 AA634405 Hs.122608 ESTs 1.5 

128213 AA972780 Hs.129194 ESTs;WeaMyslrnOartoII!IALUSUBFA 1.5 

128351 AI092391 Hs.134886 ESTs 15 

128842 N44757 Hs.20340 ESTs 1.6 

5 128870 R714G3 Hs.75309 eukaryotic translation elongation factor 2 1.7 

129146 AA459944 Hs.108924 DKFZP586P1422 protein 1.5 

129285 T62068 Hs.11Q06 ESTs 2.1 

129331 N93465 Hs.279772 ESTs; Highly similar to CGW8 protBin (H 1,5 

130085 M62402 Hs.274313 toSn-Qke growth factor binding protein 6 1.7 

10 130400 M25079 H&283108 hemoglobin; beta 1.7 

131267 AA211776 Hs,2504 myomesln 1 (skelemin){185kD) 3.8 

131277 AA131466 Hs.23767 ESTs 1.9 

131282 M12272 Hs.4 alaholdeh^rogefiase3(dassl);gamrnap 22 

131304 AA295848 Hs^5475 equaporin7 1.7 

15 131810 D49487 Hs.1 94236 lerjto (niurine obesity hornolog) U 

132788 AA045503 Hs.56874 ESTs; Weakfy similar to Homo sapiens p2 1.6 

132931 241452 Hs.6090 deletedin bladder cancer chromosome regi 15 

133120 X64559 Hs.65424 tetranectin (piasminogen-bhdtng protein) 2 

133314 U95367 Hs.70725 gamma-aminobutyric acid (GABA) A recep 15 

20 133507 X74295 Hs.74369 integrfn; alpha 7 1.7 

133601 S95936 Hs.284176 transferrin 2.3 

133702 N56898 Hs.75652 glutathione S-transferase M5 1.9 

134111 N79674 Hs.8022 TU3A protein 4.6 

134699 U56814 Ks.86646 deoxynbonuciease l-iike 3 1.5 

25 134749 L10955 Hs.89485 carbonic anhydrase IV 1.6 

135173 M72885 Hs.95910 Human G0S2 protein gene; complete cds 1.9 

300132 AW027556 Hs.156286 ESTs 1.7 

300732 AJ369956 Hs.257891 ESTs 1.5 

300750 AA514805 Hs.293055 ESTs 1.8 

30 301140 AI807692 Hs.129129 ESTs 1.6 

301396 AA923549 Hs.224121 ESTs 2.1 

302910 N77976 Hs.25 1577 hemoglobin; alpha 1 1.8 

303798 V00505 Hs.36977 hemoglobin; delta 1.6 

303831 T04868 Hs.46780 EST cluster (not in UniGene) with exon hit 1.7 

35 303844 U94362 Hs.58589 glycogenln2 1.5 

304182 H91086 EST singleton (not m UniGene) with exon 1.5 

304622 AA516384 EST singleton (not in UniGene) with exon 1.5 

304682 AA550994 EST singleton (not in UniGene) with exon 1.7 

305612 AA782347 Hs.272572 EST singleton (not in UniGene) with exon 1.5 

40 306193 AA923457 EST singleton (not in UniGene) with exon 1.5 

307206 A1192534 EST singleton (not in UniGene) with exon 1.6 

307377 A1222691 EST singleton (not to UnJGene) with exon 1.5 

308023 AI452732 Hs.251577 EST singleton (not In UniGene) with exon 1.9 

308359 A1612774 Hs.79372 retinoid X receptor; beta 1.5 

45 309838 AW296073 Hs.255504 EST 1.5 

310403 AI720978 Hs.148006 ESTs; Moderately similar to alternatively s 1,8 

311671 AW241947 Hs.232478 ESTs 1.6 

311794 AW238092 Hs.254759 ESTs 2.1 

312082 T79860 Hs.1 18180 ESTs 1.9 

50 312575 H25237 Hs.306814 ESTs 2.3 

313076 N49684 Hs.143040 ESTs 1.8 

313283 W32480 Hs.157099 ESTs 2.2 

3i3374 AW328672 Hs.132760 ESTs 1.9 

314701 AI754634 Hs.131987 ESTs 1.7 

55 315391 AA759098 Hs.192007 ESTs 1.8 

315688 AA680055 Hs.264885 ESTs 1.5 

316249 AA948612 Hs.130414 ESTs 1.6 

316586 A1205077 Hs.294085 ESTs 1.7 

316890 AA837079 Hs.24647 ESTs 1.5 

60 316983 AI480204 Hs.177131 ESTs 1.5 

317604 AI650625 Hs.300756 ESTs 1.6 

317951 AW2C6520 Hs.129621 ESTs 1.5 

319400 W26902 Hs.154085 ESTs 1.7 

320757 H22654 Hs.6382 EST cluster (not In UniGene) 1.5 

65 321594 AA021402 Hs.11067 ESTs 1.7 

322102 H45589 EST cluster (not In UniGene) 1.5 

322814 AI824495 Hs^11038 ESTs 2.2 
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322929 A1365585 Hs.146246 ESTs 2.3 

323831 AA335715 Hs.200299 ESTs 1.7 

324044 AL045752 Hs.22350 ESTs 1.8 

324675 AW014734 Hs.157969 ESTs 11 

5 325272 CH.11_hsgij5866902 1.5 

325558 CH.12_hsgi|6Q56302 1.6 

325656 CH.14J»sgi|6056305 1.6 

326120 CH.17_hsgi|5867194 1.5 

326139 CH.17JsgP67203 15 

10 326855 CH.20Jis #552460 15 

327438 CR02_hsgi[6004454 16 

329733 CH.14jj2gi|6065783 1.6 

330931 F01443 Hs.284256 ESTs 4.6 

331591 N71677 Hs.42146 ESTs 15 

15 332159 AA621393 Hs.112984 EST 15 

332364 W94688 Hs.103253 periHpin 2.1 

332502 H21819 Hs.14896 Homo sapiens done 24590 mRNAsequenc 1.5 

334175 CH22JGENES.349J0 15 

334347 CH22.FGENES.375J1 1.8 

20 334737 CH22_FGENES.424J2 18 

335352 CH22LFGENES.539.5 15 

335639 CH22_FGENES584J9 16 

336244 CH22_FGENES.746_2 15 

336336 CH22JGENES514J 17 

25 336865 CH22_FGENES.305-1 16 

337494 CH22J=GENES.799-12 16 

337764 CH22_EMAC000097.GENSCAN.119-1 18 

337983 CH22JEMACX)05500.GENSCAN.110-1 2 

338192 CH2«IVtAC005500.GENSCAN.228-1 15 

30 339366 CH22_BA354l12.GENSCAN.34-2 15 
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TABLE 15A 



Table 15A shows the accession numbers for those pkeys lacking unigenelD's for Table 15. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



126300 250375J D81972 BE003132 
20 112538 504579 J AA908813 R70255 

123505 genbank_AA600135 AA600135 - 

104672 6735.7 AA349096 A136B018 F21390 F17759 R48772 A1421485 A1300352 H43971 A1378525 F33652 R47898 AI264177 F22289 N28263 



AI276281 R48205 AI245302 All 90036 AI281050 AW242903 H42892 AA910870 AW473816 H25721 AW451438 F19647 F22375 
H45809 F33447 AA774528 AA007629 H42537 C01077 F32386 



25 322102 46708J H45589H19807AF075038H19808 H42437 

336865 CH22.4590FG.305J. 

338192 CH2^6755FG_UNI^EMAC00 

329733 c14j>2 

326120 c17_hs 
30 326139 c17Jis 

326855 c20_hs 

335352 CH22_2699FG.539.5_UNK_EM 

335639 CH22J999FG.584_19J.INK_E 

307206 AJ192534 
35 307377 A1222691 

337494 CH22_5727FG_799JZ_ 

337764 CH22J115FG_UNK_EMAC00 

337983 CH2^6438FG_UNieEMAC00 

339366 CH22_6336FG_LINK^BA354I1 
40 325272 c11Jis 

325558 c12_hs 

325656 c14_hs 

334175 CH22J455FG.349J0JJNK-E 
304182 H91086 
45 334347 CH22_1640FG_375_31_UNK.E 
327438 cJJ\s 
304622 AA516384 

334737 CH22JMSrGJ24J2JuXKJ. 
304682 AA550994 
50 336244 CH22_3642FG_746JUUNICDA 
306193 AA923457 

336336 CH2^3746FG.814_8_UNK_BA 



10 



Pkey: 

CAT number 
Accession: 



Unique Eos probeset identifier number 
Gene duster number 
Genbank accession'numbers 



15 



Pkey CAT number Accession 
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TABLE 15B 



Table 15B shows the genomic positioning for those pkeys lacking unigene CD's and 
accession numbers in Table 15. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ret Sequence source. The 7 digit numbers in this column are Genbank identifier (Gl) numbers, "Dunham I. et al." refers to the publication 

entitled "The DNA sequence of human dvomosome 2T Dunham I. et aL, Nature (1999) 402:489-495. 
Strand: indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref 



Strand NLposition 



334347 
334737 
335639 
337494 
334175 



336244 
336336 
336865 
337764 
337983 
338192 
339366 
325272 
325558 
325656 
329733 
326120 
326139 



327438 



Dunham, I. etal. 
Dunham, I. et.al. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, L etal. 
Dunham, I. eta). 
Dunham, L etal. 
Dunham, I. etat. 
Dunham, I. etal. 
5866902 
6056302 
6056305 
6065783 
5867194 
5867203 
6552460 
6004454 



Plus 

Plus 

Plus 

Plus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Plus 

Minus 

Plus 

Plus 

Minus 

Minus 

Minus 



13663814-13663926 

15998517-15998685 

25173591-25173696 

33339024-33339148 

11668659-11668597 

22681512-22681384 

3140272941402583 

33797209-33797076 

8622405-8622289 

40356404035446 

7275495-7275271 

13248453-13248277 

33647431-33647293 

13247-13312 

70930-71030 

78190-78707 

163237-163450 

3611646276 

218901-218960 

111390-111463 
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TABLE 16: Table 4 from BRCA 001-5 US 

Table 16, a subset of table 15, depicts a preferred group of genes highly downregulated in 
breast cancer cells. 



Ptey: Unique Eos probeset identifier number 

ExAccn: . Exemplar Accession number, Genbank accession number 

10 UnigenelO: Unlgene number 

Unlgene Title: Unigene gene fide 

R1: Ratio of normal breast tissue to tumor 



15 Pkey ExAccn 



20 



25 



30 



35 



40 



45 



100502 
101367 
102857 
104672 
107099 
108604 
115949 
115965 
119175 
119798 
122127 
122348 
129285 
131267 
131282 
131810 
133120 
133601 
134111 
301396 
311794 
312575 
313283 
322814 
322929 
324675 
330931 
332364 
337983 



TIGR:HT1496 

M12963 

X00129 

AA007629 

AA609645 

AA099820 

AA443800 

AA446661 

R71792 

W73386 

AA434447 

AA443695 

T62068 

AA211776 

M12272 

D49487 

X64559 

S95936 

N79674 

AA923549 

AW238092 

H25237 

W32480 

AI824495 

AI365585 

AW014734 

F01443 

W94688 



UniGene tD Unigene Tide Rt 

Hs.169228 Adrenal-SpecSic Protein Pg2 2.3 

Hs.4 alcohol dehydrogenase 1 (class l); alpha 2.9 

Hs.76461 retinol-binding protein 4; interstitial 3 

gtyceroW-phosphate dehydrogenase 1 2.4 

Hs.211566 eukaryotlc translation initiation factor 4 gam 2.7 

Hs.49696 ESTs 2.4 

Hs.43125 ESTs 2 

Hs, 173 233 ESTs 22 

Hs.301002 ESTs: Weakly similar to ceB death activator £8 

Hs.249129 ESTs 3 

Hs.106771 ESTs 2^ 

Hs.293410 ESTs 2.1 

Hs.11006 ESTs 2.1 

Hs.2504 myomesin 1 (skelemin) (185kD) 3.8 

Hs.4 alcohol dehydrogenase 3 (class i); gamma 2.2 

Hs.194236 leptln (murine obesity homotog) 2.5 

Hs.65424 tetranectin (plasm!nogen-bir«fiig protein) 2 

Hs.284176 transferrin 2.3 

Hs.8022 TU3A protein 4.6 

Hs.224121 ESTs 2.1 

Hs.254759 ESTs Z1 

Hs.306814 ESTs 2.3 

Hs.157099 ESTs 2.2 

Hs.211038 ESTs 12 

Hs.146246 ESTs 2.3 

Hs.157969 ESTs 22 

Hs.284256 ESTs 4.6 

Hs.103253 perilipln 2.1 

CH22LEMAC005500.GENSCAN.110-1 2 
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TABLE 16A 



Table 16A shows the accession numbers for those pkeys lacking unigenelD's for Table 16. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubieTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



CAT number: Gene duster number 
Accession: Genbank accession numbers 

Pkey CAT number Accession 

104672 6735J AA349096 AJ368018 F21390 F17759 R48772 AI421485 AI300352 H43971 AI378525 F33652 R47898 AI264177 F22289 N28263 
AI276281 R48205 A1245302 A1190036 A1281050 AW242903 H42892 AA910870 AW473816 H25721 AW451438 F19647 F22375 
H45809 F33447 AA774528 AA007629 H42537 C01077 F32386 



Pkey: 



Unique Eos probeset identifier number 
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TABLE 17: Table 1 from BRCA 014 P 

Table 17 shows accession numbers representing 759 sequences of breast cancer genes or 
fragments thereof encoding breast cancer modulating proteins. For each overexpressed gene 
identified, a ratio of the relative amount of expression in breast tumors versus normal breast 
tissue. 



10 



15 



Pkey: Unique Eos probeset Identifier number 

ExAccti: Exemplar Accession number, Genbank accession number 

Unfgen8(D: Unigene number 

UnJgene Title: Unigene gene title 

R1: Ratb of tumor to normal breast tissue 



Pkey ExAccn UnfgenelD Unigene Title 



R1 



20 100227 AV654694 Hs.82316 interferon-lnduced, hepatitis C-associat 3 

100405 AW291587 Hs.82733 nldogen2 3.2 

100406 AI382060 Hs.1 18397 AE-binding protein 1 3.6 
100420 D86383 Hs.1 18893 Melanoma associated gene 3.2 
100911 X83300 Hs.289103 SMA4 5.2 

25 100960 J00124 Hs.1 17729 keratin 14 {epidermolysis bullosa simple 4,3 

101011 BE387036 Hs.1211 acid phosphatase 5, tartrate resistant 3 

101183 AA442324 Hs.795 H2A histone family, member 0 3.2 

101194 120971 Hs.1 88 phosphodiesterase 4B, cAMP-specific (dun 3 

101329 U66042 Hs.82171 Homo sapiens clone 191 B7 placenta expres 4.1 

30 101378 BE563085 Hs.833 interferoivstimulaled protein, 15 kDa 5.3 

101474 R07566 Hs.73817 small tndudbie cytokine A3 (homologous 3.9 

101491 M25809 Hs.64173 ATPase. H+ transporting, lysosomal (vacu 4.5 

101530 M29874 Hs.1360 cytochrome P450, subfamily IIB (phenobar 9 

101602 AA353776 Hs.901 C048 antigen (B-cell membrane protefo) 3.4 

35 101663 NMJ1G3528 Hs.2178 H2B histone family, member Q 5.6 

101758 BE019494 Hs.79217 pyrroline^-carboxylate reductase 1 3.6 

101767 M81057 Hs. 180884 carboxypeptidase B1 (tissue) 12 

101817 M89907 Hs.1 52292 SWl/SNF related, matrix associated, act] 3.2 

101851 BE260964 Hs.82045 midkine (neurite growfivpromofing factor 4.1 

40 101878 M97815 Hs.183650 ceflular retinolc ackttinding protein 2 6.5 

102209 NM.002038 Hs.265827 Interferon, alpha-lndudble protein (do 3 

102214 U23752 Hs.32964 SRY (sex determining region Y)-box 11 3 

102297 NMJ3015O4 Hs.198252 G protein-coupled receptor 9 3.7 

102299 ML005824 Hs.1 55545 37 kDa leudne-rich repeat (IRR) protein 3.7 

45 102301 NMJJ05651 Hs.183671 tryptophan 2,3-dioxygenase 5.2 

102305 AL043202 Hs.90073 chromosome segregation 1 (yeast homotog) 3.5 

102369 U39840 Hs.299867 hepatocytB nuclear factor 3. alpha 3.9 

102591 U62325 Hs,324125 amyloid beta (A4) precursor orotein-blnd 4 

102721 H16646 Hs.1 18666 hypothetical protein PP591 3.5 

50 102739 AA363025 Hs.155572 Human done 23801 mRNA sequence 3.2 

102791 AF080229 gb:Human endogenous retrovirus K done 1 3 

102804 NMJJ02318 Hs.83354 lysyl oxidase-Cke 2 3.2 

102903 M73779 Hs.250505 retinoic add receptor, alpha 3.3 

103010 X52509 Hs.161640 tyrosine aminotransferase 12.4 

55 103042 T81656 Hs.252259 ribosomal protein S3 4.5 

103117 X63578 Hs.295449 parvalbumin 3 

103207 X72790 gb:Human endogenous retrovirus mRNA for 5.9 

103282 BE390551 Hs.77628 steroidogenic acute regulatory protein r 3.9 

103284 AI751601 Hs.8375 TNF receptor-associated factor 4 a3 

60 103329 X85134 Hs.72984 retinoblastoma-blndlng protein 5 3.1 

103364 X90872 Hs^79929 gp25L2 protein 3 

103385 NMJW7069 Hs.37189 similar to rat HREV107 3.4 

103456 AA496425 Hs5629 papillary renal cell cardnoma (transloc 3.2 
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103498 Y09306 Hs.30148 homeodomaWnteracting proteJn kinase 3 3.4 
103558 BE616547 Hs.2785 keratin 17 3.7 
103563 L02911 Hs.150402 Activln A receptor, type I (ACVR1)(ALK 3.2 
103612 BE336654 Hs.70937 H3 histone family, member A 4.5 
5 103825 AI571835 Hs.55468 ESTs 4 
104073 AW779318 Hs.88417 ESTs 3.8 
104103 AW021102 Hs21509 ESTs 4.3 
104115 AF183810 Hs.26102 opposite strand to trichorhlnophalangeal 7.6 
.104168 AA461618 Hs.31704 ESTs, Weakly simitar to K1AA0227 IhLsapi 3.6 

10 104173 AA084273 Hs.76561 ESTs, Weakly simitar to S47072 finger pr 4 
104181 AF173296 Hs.283740 DC6 protein 3 
104189 AB040927 Hs.301804 KIAA1494 protein 3.2 
104269 AI559444 Hs.293960 ESTs 4.3 

104307 AJ929700 Hs.1 11680 endosulfine alpha 3.1 

15 104518 H20816 Hs.1 12423 Homo sapiens mRNA; cDNADKFZp586!1 420 (f 3.2 

104556 AV650851 Hs.96900 hypoftetfcai protein; KIAA1830 protein 4.4 

104658 AA360954 Hs.27268 Homo sapiens cDNA: FU21933 fis, clone H 3.2 

104748 AA015879 Hs.33536 ESTs 3.2 

104755 T49951 Hs.9029 OKFZP434G032 protein " 4.5 

20 104825 AA035613 Hs.141883 ESTs 6.9 

104830 AW294092 Hs.21594 hypothetical protein MGC1 5754 11.1 

104865 T79340 Hs.22575 B-cefl CUJymphoma 6, member Bfztncfi 3.5 

104906 BE298684 Hs.26802 protein kinase domains containing protei 6.5 

104961 H78517 Hs.33905 ESTs 3.6 

25 105038 AW503733 Hs.9414 KIAA1488 protein 4.5 

105088 H58589 Hs.35156 Homo sapiens cDNA RJ1 1027 fis, done PI 3.8 

105092 AA148982 Hs.29068 ESTs 3 

105093 AL137566 Hs.32405 Homo sapiens mRNA; cONA DKFZp586GQ321 (f 4.8 
105304 AW134924 Hs.190325 ESTs 8.2 

30 105397 AA814807 Hs.7395 hypothetical protein FU23182 3.1 

105409 AW5O5076 Hs.301855 DiGeorge syndrome critical region gene 8 4.2 

105431 AA252033 Hs.242413 hypothetical protein DKFZp434K1421 4.4 

105552 AA256750 Hs.28802 centeurtn-alpha 2 protein 3.2 

105598 AA279439 Hs.279763 hypothetical protein FU10504 3.5 

35 105650 W16741 Hs.25635 HSPC003 protein 3.7 

105688 AI299139 Hs.17517 ESTs 5.5 

105808 AI133161 Hs.286131 CGi-101 protein 3.5 

105809 AW973653 Hs.20104 hypothetical protein FU00052 3.3 
105909 AA195191 Hs.5111 hypothetical protein FU20729 3.2 

40 105965 AA131657 Hs.23830 ESTs 3.3 

106135 AL1 17474 Hs.41181 Homo sapiens mRNA; cDNA OKFZp727C191 (fr 3.2 

106184 W28948 Hs.10762 ESTs 3.3 

106293 N39842 Hs.301444 KIAA1673 4.1 

106400 BE397649 Hs.94109 Homo saptens cDNA FU13634 fis, clone PL 3.1 

45 106474 BE383668 Hs.42484 hypothetical protein aJ 1061 8 3.2 

106484 AA351978 Hs.4943 hepatocellular carcinoma associated prot 7.8 

106533 AL134708 Hs.145998 ESTs 3 

106614 AA648459 Hs.335951 hypothetical protein AF301 222 3.8 

106636 AW958037 Hs.286 ribosomal protein L4 33 

50 106661 AW499914 Hs.7579 hypothetical protein RJ 10402 3 

106743 BE613328 Hs.21938 hypothetical protein FU12492 4.2 

106844 AA465055 Hs.156213 sperm associated antigen 6 3.4 

106864 AJ31 1928 gb:qo89h04.x1 NCLCGAPJQd5 Homo sapiens 4.4 

106865 AW192535 Hs.19479 ESTs 3.6 
55 106871 AW472981 Hs.321130 hypothetical protein MGC2771 4.1 

106942 AA995351 Hs.31314 retinoblastoma-blnding protein 7 3.6 

106968 AF216751 Hs.26813 CDA14 5.3 

107105 AW963419 Hs.155223 stanniocalcln 2 3.4 

107158 N32849 Hs.31844 hypothetical protein FU12586 3.1 

60 107248 AW263124 Hs.315111 nuclear receptor co-repressoc(HDAC3comp 5.9 

107265 BE379594 Hs.49136 ESTs, Moderately similar to ALU7_HUMAN A 3.9 

107630 AW961576 Hs.60178 ESTs 4.6 

107710 AI955040 Hs.265398 ESTs, Weakly similar to transformations 3 

107890 AA025386 Hs.61311 ESTs, Weakly similar to S 10590 cysteine 3.1 

65 107985 T40064 Hs.71968 Homo sapiens mRNA; cDNA OKFZp564F053 (fr 4.8 

108000 A1263307 Hs.239884 H2B histone family, member L 33 

108217 AA058686 Hs.62588 ESTs 3.8 
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108435 T82427 Hs.194101 Homo sapiens cONA: FU20869 fis, dons A 3 

108591 AB033073 Hs.43857 simitar to glucosarrune^sulfatases 33 

108733 AA121022 gb:zn84f10j1 Stratagene tung carcinoma 3.9 

108771 AF068290 Hs.79741 hypothetical protein FU 101 16 6.1 

5 108819 AA011449 Hs.271627 ESTs 3.6 

108912 AA136674 Hs.1 18681 EST 3.9 

109086 AF186114 Hs.270737 tumor necrosis (ador (Dgand) supenam! 3.7 

109124 AK000684 Hs.183887 hypothetical protein FU22104 3.1 

109132 AI970536 Hs.16603 hypothetical protein FU13163 3.7 

10 109163 N23235 Hs.30567 ESTs, Weakly similar to B34087 hypothec 4.5 

109277 AA196443 Hs.86043 Homo sapiens cONA FU13558 fis, done PL 3.7 

109410 AW504732 Hs.21275 hypothetical protein FU1101 1 4.6 

109454 AA232255 Hs.295232 ESTs, Moderately similar to A46010 X-fin 6.4 

109514 AA234087 Hs.262346 ESTs, WeaJdy similar to S72482 hypofteti 4.8 

15 109581 R45584 Hs.23025 ESTs. Weakly similar to ALU5.HUMAN ALU S 3.3 

109632 AA325138 Hs.235873 hypothetical protein FU22672 3 

109644 AW973964 Hs.291531 ESTs, Highly similar to 1203217A dehydrc 3 

109700 F09609 gb^SC33H092 normalized Infant brain cON 3.2 

109768 F06838 Hs.14763 ESTs 3.2 

20 109807 R43646 Hs.12422 ESTs 3.8 

109842 AW818436 Hs.23590 solute carrier family 16 (monccarboxyDc 3.3 

109895 AK001680 Hs.30488 DKFZP434F091 protein 3.6 

110024 AW973152 Hs.31050 ESTs 4.2 

110561 AA379597 Hs.5199 HSPCISOprotemsimila/touW^iflrwon 5.1 

25 110675 H89355 Hs.2491 59 adrenergic, alpha-2A-, receptor 5.3 

110707 A1239832 Hs.15617 ESTs, Weakly similar to ALU4JWMAN ALU S 3,7 

110915 BE092285 Hs.29724 hypothetical protein FU13187 3.7 

111139 N64683 Hs.290943 ESTs 4 

111155 N66563 Hs.191358 ESTs 3.1 

30 111199 AJ767435 Hs.29822 ESTs 4.5 

111336 AI457338 Hs.29894 ESTs 5.4 

111510 R07856 Hs.16355 ESTs 3.2 

111532 R08440 gb:yf19f09.s1 Soares fetal Over spleen 3.1 

111689 AA6Q2004 Hs.23260 ESTs 3.2 

35 111823 R35252 Hs.24944 ESTs, Weakly similar to 2109260A B ceU 3.3 

111876 R38239 Hs.293246 ESTs. Weakly similar to putative p150(H 3.1 

111892 AA421081 Hs.12388 ESTs 3.4 

111893 AF070526 Hs.13429 Homo sapiens clone 24787 mRNA sequence 3.3 
112125 AW379029 Hs.118338 ESTs, Weakly similar to unnamed protein 4.4 

40 112170 BE246743 Hs.288529 hypothetical protein FU22635 7.3 

112287 AB033064 Hs.334806 K1AA1238 protein 3.2 

112300 H24334 Hs.26125 ESTs 4.4 

112303 R54797 gb:yg87b07.s1 Scares infant brain 1NIB H 3.4 

112478 R66067 Hs.28664 ESTs 8.2 

45 112561 A1791493 Hs.129873 ESTs, Weakly similar to A36036 cytochrom 5.5 

112631 R82040 gbryj06b06^1 Soares placenta Nb2HP Homo 3.9 

112637 R82331 Hs.164599 ESTs 5.4 

112657 AW844878 Hs.19769 hypothetical protein MGC4174 3.2 

112678 A1418466 Hs.33665 ESTs 4.7 

50 112917 AA082465 Hs.125031 (^ollne/eftajiolair^ephosphotrarttferase 3.7 

113070 AB032977 Hs.6298 KIAA1151 protein 3.1 

113095 AA828380 Hs.126733 ESTs 3.4 

113117 AW813731 Hs. 1591 53 ESTs. Moderately similar to S65657 alpha 3.4 

113187 BE613410 Hs.31575 SEC63, endoplasmic reticulum translocon 3.2 

55 113200 T57773 Hs.10263 ESTs 3.5 

113206 BE262470 Hs.241471 RNB6 6.2 

113374 T79925 Hs.269165 ESTs, WeaWy similar to ALU1 JHUMAN ALU S 3.7 

113440 U54727 Hs.191445 ESTs 3 

113494 T91451 Hs.86538 ESTs 3.4 

60 113518 AW367788 Hs.323954 postmelotic segregation Increased 24Ike 3.1 

113571 A1702609 Hs.15713 hypothetical protein MGC2776 3.1 

113822 NMJ04585 Hs.17466 retlnoic add receptor responder (tazaro 3.9 

113835 AI912410 Hs.27475 Homo sapiens cDNA FU1 2749 fis, done NT 3 

113938 W81598 gb:zd88g02.s1 ScarK^fetaUmnLNbHH19W 4.6 

65 113947 W84768 gb:zh53d03.s1 Soares.fetaLBveLspleeru 3.1 

113970 W27249 Hs.8109 hypothetical protein FU21080 6.9 

114086 AA378776 Hs.288649 hypothetical protein MGC3077 4.3 
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114148 AW470411 Hs.288433 neurotrimln 4.1 

114424 AW780192 Hs.267596 ESTs 3.4 

114518 AW163267 Hs.106469 suppressor of varl (S.rerevlsiae) 3-fflce 3.1 

114563 AI979168 Hs.82228 glycoprotein (transmembrane) nmb 4.8 

5 114965 AI733881 Hs.72472 BMP-R1B 10,1 

114995 AA769266 Hs. 193657 ESTs 3.6 

115121 AI634549 Hs.88155 ESTs 3.2 

115134 AW968073 Hs.194331 ESTs, Highly similar to A5571 3 Inositol 4.2 

115167 M749209 Hs.43728 hypothetical protein 3 

10 115253 BE149845 Hs.289038 hypothefical protein MGC4126 3.6 

115277 AA814100 Hs.86693 ESTs 3.9 

115327 N46436 Hs.109221 ESTs 3.4 

115354 AA281636 Hs.334827 ESTs 4.8 

115657 AA405620 Hs.55158 ESTs, Weakly similar to T29520 hypotheti 3.5 

15 115676 AA953006 Hs.88143 ESTs 9.3 

115709 AW293849 Hs.58279 ESTs, WeaWy similar to ALU7_HUMAN ALUS 3.4 

115729 AA417812 Hs.38775 ESTs 4 

115787 AI126772 Hs.40479 ESTs 3.1 

115830 AW970529 Hs.86434 hypothetical protein FU21 81 6 3.6 

20 115835 AA521410 Hs.41371 ESTs 3.1 

115850 NM.014937 Hs.52463 KIAA0966 protein 3 

115900 AK001500 Hs.165186 hypothetical protein RJ13852 3.2 

115935 AA354549 Hs.41181 Homo sapiens mRNA; cDNA DKFZp727C 191 (fr 3 

115948 AL042465 Hs.43445 poly{A>spedric ribonuctease {deadenyla 3.1 

25 116092 AB041035 Hs.93847 NM_016931:Homo sapiens NADPH oxidase 4 { 6.7 

116115 AL042355 Hs.70202 WD repeat domain 10 3.6 

116184 AW450737 Hs.128791 CGt-09 protein 3.1 

116192 AA464976 Hs.62528 ESTs, Moderately similar to A46010 X-iin 3.3 

116208 AI219083 Hs.42532 ESTs, Moderately similar to ALU8 HUMAN A 3.2 

30 116246 AF265555 Hs.250646 bacutoviral IAP repeat-containing 6 3.6 

116443 AW962196 Hs.321264 LBP protein 32 4.1 

116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 4.1 

116726 AK001114 Hs.53913 hypolhetical protein FU10252 8.6 

116845 AA649530 gb:ns44f05.$1 NCLCGAPJUvl Homo sapiens 3.2 

35 117026 H88256 Hs.50456 ESTs, Moderately similar to ZN75 HUMAN Z 3.5 

117216 AI569804 Hs,42792 ESTs, Weakly similar to I78885 serine/th 3.1 

117296 AL133427 Hs.42506 Homo sapiens mRNA full length Insert cDN 3.2 

117403 H84455 Hs.40639 ESTs 4.7 

117691 AB040959 Hs.93836 DKFZP434N014 protein 3 

40 118229 AW968941 Hs. 166254 hypothetical protein DKFZD566I1 33 3.3 

118363 AI183838 Hs.48938 hypothetical protein FU21 802 4.3 

118416 N66028 Hs.49105 FKBP-associated protein 3.1 

118470 AW970584 Hs.291033 ESTs 3.4 

118502 AL157488 Hs.50150 Homo sapiens mRNA; cDNA DKFZp564B182 (fr 5.2 

45 118695 AK000465 Hs.50081 K1AA1 199 protein 3.4 

118925 N92293 Hs.206832 ESTs, Moderately similar to ALU8.HUMAN A 3.3 

119025 BE003760 Hs.55209 Homo sapiens mRNA; cDNA DKFZp434K0514 (f 19.7 

119036 R95872 Hs.117572 chemoklne binding protein 2 3.7 

119063 R16833 Hs.53106 ESTs, Moderately similar to ALU1J4UMAN A 4.1 

50 119075 M10905 Hs.287820 fibronectinl ~ 3.2 

119620 W47620 Hs.56009 Z-S'-oKgoadenyiate synthetase 3 {100 k 3.3 

119741 AF041853 Hs.43670 kinesin family member 3A • 3.1 

119747 A1970797 Hs.64859 ESTs 5 

119754 AL037824 Hs.194695 ras homolog gene family, member I 3.8 

55 119905 AW449064 Hs.119571 collagen, type III, alpha 1 (Ehiers-Danl 3.1 

120084 W94472 Hs.59529 ESTs, Moderately similar to ALU 1 HUMAN A 8.4 

120241 AA825686 Hs.321176 ESTs, Weakly simitar to S65824 reverse t 3.6 

120326 AA196300 Hs.21145 hypothetical protein RG083M05.2 3.2 

120742 AA225084 gfcnc21d06.r1 NCLCGAPJtt Homo sapiens 3.6 

60 120870 AA357172 Hs.292581 ESTs, Moderately similar to ALU1 HUMANA 5.8 

120885 AA365515 Hs.301872 hypothetical protein MGC4840 " 3 

120970 AA398118 Hs.97579 ESTs, Weakly simflar to A46010X-linked 3.7 

121054 AW976570 Hs.97387 ESTs S3 

121095 AA320134 Hs.196029 Homo sapiens mRNA for KIAA1657 protein, 4 

65 121103 AA398936 Hs.97697 EST 3.5 

121121 AA399371 Hs.189095 similar to SALL1 (sal (DrosophiIa)-like 6,3 

121337 AW885727 Hs.301570 ESTs 4.7 
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121351 AW206227 Hs.287727 hypothetical protein FU23132 5 

121611 M31669 Hs.1735 tohibin, beta 8 (activin AB beta polypep 3.6 

121643 AA640987 Hs.193767 ESTs 5.6 

121770 NM.0159G2 Hs.276428 progestin Induced protein 3.4 

5 122125 AK000492 Hs.98806 hypoftetical protein 4.1 

122338 AA443311 Hs.98998 ESTs 3 

122417 AA446965 Hs.112092 ESTs 4.7 

122513 AJ767879 Hs.99214 ESTs 3,8 

122544 AW973253 Hs.292689 ESTs 3 

10 122655 AA323296 H&97837 Homo sapiens mRNA; cDNA DKFZp547 J047 (fr 5.6 

122605 AA526911 Hs.82772 collagen, type XI. alpha 1 3.2 

122851 AW205931 Hs.99598 hypothetical protein MGC5338 8.6 

123105 AA487809 Hs.166011 catenin (cadierin-associated protein), d 3 

123111 AA228776 Hs.191721 ESTs 6.9 

15 123249 AA371307 Hs.125056 ESTs . 3.6 

123273 AA491253 Hs. 173611 Empirically selected from AFFX single pr 7 

123385 BE149685 Hs.17767 KIAA1554 protein 3.1 

123419 T66087 Hs.1 12482 Homo sapiens unknown tnRNA sequence 3.4 

123485 AI308876 Hs.103849 hypothetical protein DKFZp761D1 12 3.1 

20 123645 AJ675944 Hs.188691 Homo sapiens cDNA RJ12033 fis, done HE 3.8 

123819 AA580082 Hs.112264 ESTs 4.7 

124012 AA352723 Hs.241471 RNB6 3.8 

124243 H69125 Hs.133525 ESTs 4.1 

124357 N22401 gb:yw37g07.s1 Morton Fetal Cochlea Homo 4.1 

25 124359 N225Q8 Hs.13931 5 Homo sapiens cONA: FU21479 fis, dona C 3.6 

124567 AW451645 Hs.151504 Homo sapiens cDNA RJ11973 fis, clone HE 3.1 

124911 N34151 Hs.174195 interferon Induced transmembrane protein 3.5 

124972 R41396 Hs.101774 hypothetical protein FU23045 4.3 

125005 BE065136 Hs.145696 spGdng factor (CC1 .3) 6 

.30 125042 T78906 Hs.269432 ESTs t ModeraterysirnflartoAUJ1J1UMAN A 8.1 

125184 W60326 Hs.288684 Homo sapiens cONA FU11750 lis. clone HE 4.7 

125243 AW970536 Hs.105413 ESTs 3.1 

125286 AF086534 Hs.187561 ESTs, Moderately similar to ALlMJiUMAN A 3.3 

125304 AL359573 Hs.124940 GTP-binding protein 3 

35 125330 AW880562 Hs.1 14574 ESTs 3 

125331 AI422996 Hs.161378 ESTs 3.2 

125685 AI924630 Hs.4943 hepatocellular carcinoma associated prot 3.2 

126257 N99638 gb:za39g11.r1 Scares fetal liver spleen 4 

126474 AW975814 Hs.326714 Homo sapiens done IMAGE:713177. mRNA se 4 

40 126666 AA648886 Hs.151999 ESTs 3.8 

126872 AW450979 gb:Ul-H-BI3^-12-0-Ul.s1 NCLCGAP_Su 3 

127431 AW771958 Hs.175437 ESTs, Moderately similar to PC4259 ferri 3.6 

127980 AA961459 Hs.125644 ESTs 4.1 

127997 AW068311 Hs.31 1054 Homo sapiens mRNA full length Insert cON 3.3 

45 128420 AA650274 Hs.41296 ftbranectln leucine rich transmembrane p 4.6 

128609 NMJJ03616 Hs.102456 survival of motor neuron protein fnterac 3.9 

128946 Y13153 Hs.107316 kynurenine 3-monooxygenase (kynurenine 3 3.1 

128955 AA775076 Hs.185807 Homo sapiens, SlmBar to PRO0478 protein 3.9 

129092 056365 Hs.63525 pory{rCH)inding protein 2 3.3 

50 129270 AA357185 Hs.109918 ras homotog gene family, member H 3.1 

129301 AF182277 Hs.330780 cytochrome P450, subfamily IIS (phenobar 3.9 

129385 AA172106 Hs.1 10950 Rag C protein ^ ' 6.2 

129619 AA209534 Hs.284243 tetraspan NET-6 protein 3.4 

129629 AK000398 Hs.1 1747 hypothetical protein FU20391 3 

55 129725 X56411 Hs.1219 alcohol dehydrogenase 4 (dass II), pip 3.2 

130069 AI754813 Hs. 146428 collagen, type V, alpha 1 5.4 

130092 X03363 Hs.323910 Vnert>b2 avian erythroblastic leukemia v 4.4 

130298 AI347487 Hs.1 32781 dass I cytokine receptor 4.6 

130382 NM.0G3450 Hs.155204 zinc finger protein 174 5.6 

60 130622 A1582291 Hs.16846 ESTs, WeaWy similar to 04HUD1 debnsoqu 3 

130703 R77776 Hs.18103 ESTs . 3.8 

130881 AA809875 Hs.25933 ESTs 4.2 

130954 AB014544 Hs.21572 WAA0644 gene product . 47 

131095 AI399653 Hs.22917 ESTs 4.3 

65 131153 H09048 Hs.23606 ESTs 3.8 

131253 R71802 Hs.24853 ESTs 3.5 

131372 AW293399 Hs.1 44904 nudear receptor co-repressor 1 3.6 
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131507 AI826268 Hs.27769 ESTs, Weakly similar to MCAT_HUMAN M1TOC 3.2 

131587 AI695549 Hs.183868 glucuronidase, beta 3.1 

131739 AF017986 Hs.31386 secreted frizzfec^ted protein 2 3.2 

131795 BE501849 Hs.32317 hlgtwnoblUty group 20B 3.2 

5 131970 D86960 Hs.3610 K1AAQ205 gene product 3.6 

131988 NM.002314 Hs.36566 UM domain kinase 1 3.2 

132093 AA400091 Hs.39421 ESTs 3.2 

132122 AA426202 Hs.40403 Cbp/p3004nteracling trarsacfoator, wit 3 

132159 D76435 Hs.411S4 Zk family member 1 (odd-paired Drosophi 3.2 

10 132333 AA192669 Hs.45032 ESTs 3.5 

132406 AL133731 Hs.4774 Homo sapiens mRNA;cDNADKFZp761C1712(f 9.2 

132482 AV660345 Hs.238126 CGI-49 protein 8.2 

132543 BE568452 Hs.5101 protein regulator of cytokinesis 1 3.4 

132624 AA326108 Hs.33829 bHLH protein DEC2 3.2 

15 132700 AA319233 • Hs.5521 ESTs 4.8 

132725 NIO06276 Hs.184167 sptog factor, afghirie/serirw-rich 7 3.6 

132799 W73311 Hs.169407 SAC2 (suppressor of actin mutations 2, y 3.2 

132847 T48195 Hs.58189 eukaryotic translation initiation factor 3.5 

132857 Y00272 Hs.184572 ceil division cycle 2, G1 to S and G2 to 4.4 

20 132936 AL120659 Hs.6111 aryl-hydrocarbon receptor nuclear trans! 4.8 

133130 AI128606 Hs.6557 zinc finger protein 161 3.3 

133142 AW952412 Hs.65874 ESTs, Weakiy similar to A40348 Elav/Sex- 3.5 

133167 AW162840 Hs.6641 kinesln family member 5C 4.5 

133225 AW600291 Hs.6823 hypothetical protein RJ 10430 3.3 

25 133274 AA085191 Hs.6949 hypothetical protein MGC1 1275 3 

133275 293241 Hs.239934 CGI-96 protein 4.5 

133287 AW797437 Hs.69771 B-factor, properdin 4.1 

133376 BE618768 Hs.7232 acetyl-Coenzyme A carboxylase alpha 5.1 

133462 AW675064 Hs.73875 fumarylacetoacetate hydrolase (fumarylac 3 

30 133740 AW162919 Hs. 1 701 60 RAB2, member RAS oncogene family-like 3.4 

133831 BE274552 Hs.76578 protein Inhibitor of activated STAT3 3.9 

133976 AI908165 Hs.169946 GATA-bindlng protein 3 (T-cetl receptor 6.2 

134666 BE391929 Hs.8752 transmembrane protein 4 3.1 

134710 AI433797 Hs.8889 serine hydroxymethyltransferase 1 (sotub 3 

35 134731 D89377 Hs.89404 mshfDrosophfla)homeoboxhornoiog2 5.8 

134776 J05582 Hs.89603 mucin 1, transmembrane 4 

135230 AF064804 Hs.96757 suppressor of Ty(S.cerevi$be) 3 hornoto 3.2 

135303 R61253 Hs.98265 K1AA1877 protein 3.3 

135400 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 4.8 

40 135411 L10333 Hs.99947 reticulonl 3.8 

300089 AI199738 Hs.208275 ESTs. Weakly similar to ALUAJWMAN 111! 3.8 

300233 AW614220 Hs.189402 ESTs A2 

300254 AW183618 Hs.55610 solute carrier family 30 (zinc transport 9.9 

300256 AW591433 Hs.296241 Transmembrane protease, serine 3 4.9 

45 300378 Z45270 Hs.235873 hypothetical protein RJ22672 3.4 

300973 AA572949 Hs.207566 ESTs 3.5 

301111 R10799 Hs.191990 ESTs 3.8 

301341 AA887601 Hs.208229 G protein-coupled receptor 13.9 

301548 AJ091631 Hs.203845 two pore potassium channel KT3.3 4.4 

50 301884 AA312082 Hs.1 05445 GDNF family receptor alpha 1 5.7 

301936 U79745 Hs.1 14924 solute carrier family 16 (rronocarboxytic 8.6 

301976 T97905 gb:ye54c10,r1 Scares fetal Dver spleen 3.9 

302001 AB020711 Hs.278346 K1AA0904 protein 7.7 

302067 BE542706 Hs.222399 CEGP1 protein 7.3 

55 302094 AW749321 Hs.6786 ESTs 3.3 

302099 AL049670 Hs.137576 ribosomal protein L34 pseudogene 1 4.2 

302145 NMJ0O3613 Hs.1 51407 cartilage intermediate layer protein, nu 7.9 

302235 AL049987 Hs.166361 Homo sapiens mRNA; cDNA DKFZp564F1 12 (fr 5.6 

302276 AW057736 Hs.323910 HER2 receptor tyrosine kinase {c-erb-b2, 5.4 

60 302290 AA179949 Hs.1 75563 Homo sapiens mRNA; cDNA OKFZp564N0763 (f 34.1 

302372 AL117406 Hs.200102 ATP-binding cassette transporter MRP8 6.7 

302378 AL109712 Hs.296506 Homo sapiens mRNA full length insert cON 4 

302384 AI678059 Hs.202676 synaptonemal complex protein 2 4.3 

302385 AJ224172 Hs.204096 lipophilin B (uteroglobin famfly member) 13.8 
65 302680 AW192334 Hs.38218 . ESTs 9.6 

302830 AI038997 Hs.132921 ESTs 5 

302857 AF282265 Hs.44836 inner centromere protein antigens (135kO 3.4 
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302892 AW176909 Hs.42346 calaneumhblirtmg protein calsarcin-1 3.4 

302970 W05608 Hs.312679 ESTs, Weakly similar to A49019 dyneln he 5.1 

303271 AA652687 Hs.96151 Human ONA sequence from clone RP5-1 103G7 3.7 

303289 AL121460 H&272673 hypothetical protein FU20508 4.1 

5 303357 AW006352 Hs.159643 ESTs, Weakly similar to T32554 hypotheti 4.2 

303540 AA355607 Hs.309490 ESTs, Weakly similar to putative WHSC1 p 4.3 

303563 AA367699 Hs.1 0082 potassJum intermediate/small conductance 3.3 

303642 AW299459 gfcxs5(M08 jc1 NCLCGAPJ<id1 1 Homo saplen 4.2 

303780 AI424014 Hs.18995 K1AA1304 protein 3.6 

10 303797 AW629759 gb:hh70e05.y1 NCLCGAPJSU1 Homo sapiens 4.9 

303852 R53434 Hs.90207 hypoffietica! protein MGC1 1 138 3.7 

304328 AA149951 Hs.62112 zinc finger protein 207 3 

304782 AA582081 gb:nn32h08.s1 NCLCGAP_Gas1 Homo sapiens 4.1 

305913 AA876109 gb:nx24h0U1 NCLCGAPJ3C3 Homo sapiens 3 

15 305917 AA876469 gb:oe48b04.s1 NCI_CGAP_Pr25 Homo sapiens 3.1 

307010 AI140014 gb:qa68fl)9jc1 Soares_fetaLhearLNbHH19W . 3.5 

307041 AI144243 gb;qb85b12.x1 Soares_fetaLhearLNbHH19W 3.9 

306106 AM76803 gb^77e12jc1 Sc*nsJJSFJ : 8_9WJJTJ>AJ>_S 4.3 

308307 A1581398 Hs.172928 collagen, type I, alpha 1 4.6 

20 308615 AK000142 Hs.101774 hypofoetical protein FU23045 4.4 

309177 AI951118 Hs.326736 Homo sapiens breast cancer ant^enNY-BR 17.3 

309328 AW024348 Hs.233191 EST, Weakly similar to A27217 glucose tr 3.2 

309574 AW168083 gb^g59g04j(1 NCLCGAP_Ut4 Homo sapiens 3.1 

309583 AW170035 Hs.326736 Homo sapiens breast cancer antigen NY-8R 57.6 

25 310064 AI199712 Hs.1 48486 ESTs, Weakly similar to 1917210A Pro/Arg 4.6 

310098 AI685841 Hs.161354 ESTs 3.6 

310438 AW022192 Hs.200197 ESTs 4.6 

310683 AI939456 Hs. 160870 ESTs 3.2 

310727 AK000703 Hs.323822 Homo sapiens mRNA for K1AA1551 protein, 3.6 

30 310781 AI380797 Hs.158992 ESTs 10.2 

310895 AI955121 Hs.165724 N^cetylgalactD^ne^^urfotransfera 3.4 

310955 AI476732 Hs.263912 ESTs 103 

311117 AI671439 Hs.196029 Homo sapiens mRNA for KIAA1657 protein, 3.1 

311166 AI821005 Hs,118599 ESTs 10.8 

35 311237 AA641098 Hs.208809 ESTs, Moderately similar to ALU1 JSUMAN A 4.3 

311465 AI758660 Hs.206132 ESTs 4.4 

311587 A1828254 Hs.271019 ESTs, Weakly similar to A47582 8-ceH gr 5.1 

311598 AW023595 Hs.232048 ESTs 5.8 

311774 AA700870 Hs.14304 ESTs 3.3 

40 311785 AI056769 Hs.133512 ESTs 3,9 

311872 R12375 Hs.194600 ESTs 3.3 

311889 AA767342 Hs.122483 ESTs, Weakly similar to PSFJiUMAN PTB-AS 3 

311913 A1358522 Hs.270188 ESTs 3 

311923 T60843 Hs.189679 ESTs 5.6 

45 311935 AA216387 gb:nc16b02^1 NCLCGAPJ'rl Homo sapiens 5.2 

312019 AA373630 Hs.188750 ESTs 3 

312021 AA759263 Hs.14041 ESTs 3.4 

312067 T78968 Hs.14411 ESTs 3.5 

312090 T80177 Hs.1 18064 sim8ar to rat nuclear ubiquitous casein - 3.8 

50 312147 AI633744 Hs.195648 ESTs, Weakly similar to 138022 hypotheti 4.4 

312153 BE261944 Hs.1 18625 hexokinasel 5.2 

312168 T92251 Hs.198882 ESTs 3.3 

312182 T94344 Hs.326263 ESTs 3.3 

312187 AA700439 Hs.188490 ESTs 3.4 

55 312199 AW438602 Hs.191179 ESTs 3.9 

312219 H73505 Hs.117874 ESTs 4 

312226 AA315703 Hs.199993 ESTs, Weakly similar to ALU8_HUMAN 111) 4.9 

312299 AA972712 Hs.269737 ESTs 5.7 

312544 AA516420 Hs.183526 ESTs, Weakly similar to 138022 hypotheti 6.3 

60 312638 AW439195 Hs.256880 ESTs. Weakly similar to S65657 alpha-1C- 4.9 

312826 AW291545 Hs.185018 ESTs 4.9 

312837 AW292286 Hs.255058 ESTs 4.4 

312980 AA497043 Hs.1 15685 ESTs 3.1 

313070 AJ422023 Hs.161338 ESTs 4.3 

65 313079 N76497 Hs.1787 proteoilpid protein 1 {Pefcaeus-Merzbac 3.3 

313089 AF026944 Hs.293797 ESTs 5.8 

313096 AW073310 Hs.163533 Homo sapiens cONA FU14142 fis, ctone MA 4.5 
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313126 AA746503 Hs.283313 ESTs 10 

313166 A18Q1098 Hs.151500 ESTs 3.5 

313197 AW979008 Hs.222487 ESTs 3.3 

313280 AW960454 Hs.222830 ESTs 4.7 

5 313325 AI420611 Hs.127832 ESTs 3.4 

313328 AW449211 Hs.105445 GDNF family receptor alpha 1 12.4 

313352 AW150945 Hs.144758 ESTs 4.1 

313385 AI032087 Hs.269819 ESTs 3 

313393 AI674685 Hs.200141 ESTs 5.2 

10 313417 AA741151 Hs.137323 ESTs 33 

313434 W92070 gb:zh48gQ5.r1 SoaresJetaUrvecspleen, 3 J 

313569 AI273419 Hs.135 146 hypothetical protein FU13984 .3 

313591 AA046309 gtaf12f01.s1 Soares_fetaUiearLNbKH19W 5.6 

313615 AI540978 Hs.301997 hypothetical protein FU13033 3.2 

15 313915 C18863 Hs.163443 Homo sapiens cDNA FU1 1576 lis, done HE 263 

313975 AW175896 Hs.65114 keratin 18 3 

313979 A1535895 Hs.221024 ESTs 4.9 

313997 AV657317 Hs.288649 hypothetical protein MGC3077 3.9 

314043 AA827082 Hs.291872 ESTs 3.1 

20 314078 AW129357 Hs.329700 ESTs 8.3 

314097 AA648744 Hs.269493 ESTs 6.6 

314121 Ai732083 Hs.187619 ESTs 6.2 

314129 AA228366 Hs.115122 ESTs 4 
314138 AA740616 gb:ny97f11,s1 NCLCGAPJ3CB1 Homo sapiens 5.9 

25 314236 AA743396 Hs.189023 ESTs 3.1 

314244 AL036450 Hs.103238 ESTs 4 

314305 A12801 12 Hs.125232 Homo sapiens cDNA FU13266 fa, clone OV 8 

314306 A1697901 Hs.192425 ESTs 3.7 
314322 AA907153 Hs.190060 ESTs 3.3 

30 314394 AW961597 Hs.1 30816 ESTs, Moderately stater to 138022 hypoi A2 

314401 AI660412 Hs.234557 ESTs 3.3 

314465 AA602917 Hs.156974 ESTs 4.7 

314506 AA833655 Hs.206866 Homo sapiens cONA RJ 14056 fls, done HE 8.5 

314510 A1204418 Hs.190080 ESTs 4 

35 314546 AW007211 Hs.16131 hypothetical protein FU12876 3.4 

314547 AA399272 Hs.144341 ESTs 6.7 

314558 AI873274 Hs.190721 ESTs 27.4 

314627 AA425310 Hs.155766 ESTs, Weakly similar to A47582 B-ceO gr 4.4 
314648 AW979268 gb:EST39 1378 MAGE resequences, MAGP Homo 4.6 

40 314691 AW207206 Hs.136319 ESTs 20.7 

314729 AA457367 Hs.191638 ESTs 3.6 

314754 AW026761 Hs.134374 ESTs 3.6 

314814 BE350122 Hs.157367 ESTs, Weakly similar to 178885 serine/th 4.9 

314864 AW971198 Hs.294068 ESTs 4.3 

45 314881 AI095087 Hs.1 52299 ESTs. Moderately similar to S65657 alpha 3.7 

314882 AA828032 Hs.169076 ESTs 3.1 

314981 AW972359 Hs.293334 ESTs 3 

315006 A1538613 Hs.298241 Transmembrane protease, serine 3 10.9 

315021 AA533447 Hs,312989 ESTs 53 

50 315051 AW292425 Hs.163484 ESTs 123 

315060 AA551104 Hs.1 89048 ESTs. Moderately similar to ALUCJWMANI 5.8 

315073 AW452948 Hs.257631 ESTs 4.2 

315080 AA744550 Hs.136345 ESTs 3.7 

315175 AI025842 Hs.152530 ESTs 6 

55 315183 AW136134 Hs.220277 ESTs 3.9 

315193 AI241331 Hs.131765 ESTs, Moderately simflar to I38937 ONA/R 4.4 

315196 AI367347 Hs.44898 Homo sapiens done TCCCTA00151 mRNA sequ 8.2 

315198 A1741506 Hs.1 65900 ESTs. Weakly similar to ALU 1J1UMAN ALUS 3.6 

315240 R38772 Hs.172619 myelin transcription factor 1-like 3.4 

60 315263 AW510994 Hs.220740 ESTs 3.4 

315282 AI222165 Hs.144923 ESTs 4.9 

315296 AA876905 Hs.125286 ESTs 4 

315368 AB037745 Hs.104696 K1AA1324 protein 4.7 

315397 AA218940 Hs. 1 37516 fidgetin-like 1 3.1 

65 315489 AI378817 Hs.191847 ESTs 3.1 

315496 AA628539 Hs.116252 ESTs, Moderately similar to ALU1.HUMAN A 3.2 

315526 AI193043 Hs.128685 ESTs, Weakly similar to T17226 hypotheti 4.1 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



315530 AW015415 


Hs.127780 ESTs 


8.9 


315562 AA737415 


Hs.152826 ESTs 


5.5 


315634 AAS370S5 


Hs^20585 ESTs 


6.3 


315647 AA648983 


Hs^12911 ESTs 


3,6 


315707 AJ418055 


Hs.161160 ESTs 


5.1 


315772 AW515373 


Hs.271249 Homo sapiens cDNA FU13580 fis, clone PL 


3.1 


315850 AW270550 


Hs.1 16957 ESTs 


3.8 


315858 AA737345 


HsJ94041 ESTs 


5 


315878 AA683336 


Hs.189046 ESTs 


3.1 


315977 AW865916 


Hs.151206 ESTs 


47 


315978 AA830893 


Hs.119769 ESTs 


4.1 


315995 AI217477 


Hs.194591 ESTs 


4.1 


316012 AA764950 


Hs, 11 9898 ESTs 


7 


316042 AI469960 


Hs.170698 ESTs 


4.9 


316052 AI962796 


Hs.136754 ESTs 


4.1 


316072 AW517524 


Hs.135201 N0D2 protein 


3.2 


316074 AW975114 


Hs.293273 ESTs 


3.8 


316100 AW203986 


Hs.213003 ESTs 


3.2 


316133 AI187742 


Hs.125562 ESTs 


3.7 


316177 A1904982 


Hs.293102 ESTs. Moderately similar to ALU1JHUMAN A 


30.7 


316186 A1433540 


gb:li69g05jc1 NCLCGAP_KJd11 Homosapien 


3.1 


316244 A1640761 


Hs.224988 ESTs 


3.5 


316303 AA740994 


Hs.209609 ESTs 


3.8 


316313 AA74130O 


Hs.202599 ESTs. Weakly sirnilar to 138022 hypothefi 


4.4 


316364 AA747807 


Hs.149500 ESTs 


3.2 


316580 AA938198 


Hs.146123 poly(A) polymerase gamma 


9.4 


316697 AW293174 


Hs.252627 ESTs 


4.4 


316715 AI440266 


Hs.170673 ESTs, Weakly similar to T24832 hypolhetl 


3 


316868 AI660898 


Hs.195602 ESTs 


3.2 


316869 AI954880 


Hs.134604 ESTs 


3.2 


316886 AA836331 


Hs.134981 ESTs 


4-4 


316897 AA838114 


Hs.221612 ESTs 


3.7 


316943 AW014875 


Hs.137007 ESTs 


4.6 


317069 A1732892 


Hs.190489 ESTs 


5.9 


317194 AW445167 


Hs.1 26036 ESTs 


4.1 


317360 AI125252 


Hs.126419 ESTs 


3.5 


317404 AI806867 


Hs.126594 ESTs 


5.1 


317452 AA972965 


Hs.135568 ESTs 


6.9 


317501 AI822034 


Hs,137097 ESTs 


4.6 


317674 AW294909 


Hs.132208 ESTs 


4,3 


317803 AW664964 


Hs.128899 ESTs 


6.1 


317834 X56348 


Hs.287270 ret proto-oncogene (multiple endocrine n 


3.1 


317850 AI681545 


Hs. 152982 hypothetical protein FU131 1 7 


3,4 


317881 AI827248 


Hs.224398 Homo sapiens cDNA RJ1 1469 fis, clone HE 


9.6 


317902 AW102941 


Hs.211265 ESTs 


4.1 


317916 A1565071 


Hs.159983 ESTs 


10.3 


318042 AW294522 


Hs.149991 ESTs 


3.1 


318223 AI077540 


Hs.134090 ESTs 


3.9 


318327 AW294013 


Hs.200942 ESTs 


3 


318332 AI093930 


Hs.163440 Homo sapiens cDNA: FU21000 fo. done C 


4.4 


318418 AF1 07493 


Hs.1 18498 Homo sapiens LUCA-15 protein mRNA. spOc 


5.4 


318558 AW402677 


Hs.146381 RNA binding motif protein, X chromosome 


4.4 


318625 AA526235 


Hs.193162 Homo sapiens cDNA FU1 1983 fis, clone HE 


5.9 


318634 T49598 


Hs.156832 ESTs 


4 


318740 NM002543 


Hs.77729 oxidised bw density lipoprotein (lectin 


7.3 


318744 AI793124 


Hs.144479 ESTs 


17.8 


318781 F11802 


Hs.6818 ESTs 


3 


319191 NM.012391 


Hs.79414 prostate epithelium-specific Ets transcr 


3.6 


319478 AI524124 


Hs.270307 ESTs 


4.6 


319510 W88532 


Hs.254562 ESTs 


3.3 


319551 AA761668 


gb:nz24c08.s1 NCLCGAPJBC81 Homo sapiens 


3.2 


319745 T79366 


Hs.108258 actfn binding protein; macrophln (microf 


3.3 


319834 AA071267 


gb:zmS1g01.r1 Stratagene fibroblast (937 


6.2 


319840 C19035 


Hs.164259 ESTs 


3.3 


319977 AA534222 


gb:nj21d02.s1 NCLCGAPAA1 Homo sapiens 


4.3 


320074 AA321166 


Hs.278233 ESTs 


3.4 


320167 AA984373 


Hs.90790 Homo sapiens cDNA: FL/22930 fis. done K 


4.1 
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320187 T99949 Hs.303428 HomosapienscDNAFU14832fc, clone OV 5.3 

320211 AUJ39402 Hs.125783 DEME-6 protein 9.2 

320416 A1026984 Hs.293662 ESTs 3.1 

320588 U78082 Hs.167738 RNA polymerase II transcripfiona! regula 3. 1 

5 320635 N50617 . Hs.80506 small nuclear ribonucteoprotein potypept 6.1 

320654 AI160015 Hs.118112 ESTs 3.5 

320742 AI601188 Hs.120910 ESTs 3 

320832 AA214584 Hs.290167 ESTs 3.7 

320915 AI359144 Hs.143688 Homo sapiens cONA: FU23031 As, done L 3.1 

10 321016 BE144167 Hs.49994 hypofoetical protein similar to RNA-Wnd 3.3 

321107 AI732643 Hs.144151 ESTs 12.3 

321171 AI769410 Hs.221461 ESTs 3.3 

321253 AA610649 Hs.333239 ESTs 3 

321318 AB033041 Hs.137507 vang (van gogh, DrosophilaHike 2 . 3.9 

15 321642 A1432199 Hs.247084 ESTs 3 

321644 AW975944 Hs.237396 ESTs 11.7 

321683 AI471598 Hs.197531 ESTs 3.8 

321756 029112 Hs.196151 ESTs 4.4 

321811 D80630 gb:HUM091D02B Human fetal brain (TFuJlwa 3.2 

20 321828 R59890 Hs,83623 nuclear receptor subfamily 1, group I, m 3.1 

321910 H67065 Hs.271530 ESTs, Weakly simi/ar to ALU7_HUMAN ALU S 4.7 

321937 AL049351 Hs.302058 Homo sapiens mRNA; cONA DKFZp566C093 (fr 3.5 

321978 N77342 Hs.21851 Homo sapiens cDNA FU12900 fis, done NT 5 

322035 AL137517 Hs.334473 hypothetical protein DKFZp56401278 19 

25 322136 AF075083 gb^omo sapiens fulilength insert cONA 3.6 

322258 BE265745 Hs.194359 ESTs, Weakly similar to ALUCJtUMAN llll 3 

322296 W76326 gb2d6(W04.r1 St»resJetaLhearLNbHH19W 4.4 

322303 AI357412 Hs.157601 ESTs 115 

322476 AW963372 Hs.46677 PRO2Q00 protein 3 

30 322520 T5595S gb:yb35ft)5.r1 Straiagene fetal spteen (9 3 

322521 AF147347 gb:Homo sapiens full length insert cDNA 4.2 

322567 AF155108 Hs.256150 Homo sapiens, Similar to RIKEN cONA 2810 4 

322595 W92147 Hs.1 18394 ESTs 5.4 

322675 AA017656 gb:ze39h01.r1 Soares retina N2MHR Homo 3.1 

35 322766 AW068805 Hs.288467 Homo sapiens cDNA HJ12280 fis, done MA 5.2 

322818 AW043782 Hs.293616 ESTs 7.6 

322882 AW248508 Hs.279727 Homo sapiens cONA FU14035 fis, done HE 5.9 

322975 C16391 gb:C16391 Clontech human aorta polyA mRN 165 

323091 AJ902456 Hs.210761 ESTs, Weakly similar to 138022 hypothetl 4 

40 323131 AK002088 Hs.270124 Homo sapiens cDNA FU11226 fis, clone PL 3.3 

323168 AL120862 Hs.124165 programmed cefl death 9 (PDCD9) 6.3 

323244 AW675572 Hs.193820 ESTs 4.6 

323262 AL133990 Hs.190642 ESTs 10.5 

323332 AI829520 gb:wt19c06.x1 NCLCGAP.Utt Homo sapiens 6.2 

45 323333 AV651680 Hs,208558 ESTs 4.3 

323335 AI655499 Hs.161712 ESTs 9.2 

323645 AW445014 Hs.197746 ESTs 3.1 

323663 BE081058 Hs.243023 ESTs 4 

323693 AA317962 Hs.249721 ESTs, Moderately similar to PC4259 ferri 3 

50 323782 AW961560 Hs.97600 ESTs Z2 

323817 AA410943 BMP-R1B 8.4 

323930 AL043683 Hs.8173 hypothetical protein FU10803 3.3 

323974 A1825204 Hs^11408 ESTs 4.5 

324001 AL044949 Hs.1 16298 ESTs 4.5 

55 324036 AI472078 Hs.303662 ESTs 8.4 

324261 8E069341 gb:QV3-BT0381-270100-073-c08BTQ381 Homo 49.4 

324285 AA431159 Hs.122954 ESTs 3 

324296 AI524039 Hs.192524 ESTs 3 

324305 AA642007 Hs.1 16369 ESTs 3.3 

60 324432 AA464510 Hs.152812 ESTs 16.5 

324585 AI823969 Hs.132678 ESTs 3.3 

324598 AW972227 Hs.163986 Homo sapiens cONA: FU22765 fis, clone K 5 

324603 AW993522 Hs.292934 ESTs 10.4 

324631 AA937116 Hs.293683 ESTs, Weakly similar to 154374 gene NF2 3.3 

65 324716 BE169746 Hs.12504 likely orthoiog of mouse Arkadia 3.2 

324748 AW974941 Hs.292385 ESTs, Weakly similar to 178885 serine/th 3 

324771 AA631739 Hs.335440 EST 3 
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324774 AI031771 Hs.132586 ESTs 4.2 

324823 AW516704 Hs.208726 ESTs 3.4 

324824 AI826999 Hs.224624 ESTs 3.1 
324826 AA704806 Hs.143842 ESTs, WeaWy simBar to 2004399A chromes 4.4 

5 324961 AA613792 gfcno97h03.s1 NCI_CGAPJ>r2 Homo sapiens 3.9 

324987 AI375572 Hs.172634 ESTs 18.8 

324994 AI805416 Hs.213897 ESTs 3.3 

325146 AI064690 Hs.171176 ESTs 4.2 

325372 Phase2&3Exons 4.4 

10 325544 . Phase2&3Exons 5.7 

327075 Phase 2&3Exons 3.8 

332798 C22000007^qi2314195|emb|CAB99338.1|(A 4.3 

334223 NM_00508(r:Homo sapiens X-box binding pr 26.2 

334447 NM.012429*:Homo sapiens SEC14 (S. cerevl 3.9 

15 335809 NMJ)14509*:Homo sapiens kraken-iike (BK1 10.1 

335824 ENSP00000249072*:OJ222E13.1 (N-TERMlNAL 20 

338255 NIVL014323*:Homosaplens2incfingerpn3t 9 

409430 R21945 Hs.166975 splicing factor, anginmeTserine-dch 5 4 

426046 AW812795 Hs.1 55381 ESTs, Moderately similar to I38022 hypot 4.6 

20 432558 R97268 Hs.177269 ESTs 3.2 

436808 AA731602 Hs.120266 ESTs 3.9 

448569 BE382657 Hs.21486 signal transducer and activator of trans 4.1 

453542 AW836724 Hs.339660 Homo sapiens mRNA expressed only b plac 3.7 

M97935 AFFX control: STAT1 3.2 

25 M97935 AFFXcontrot STAT1 3 

M55150 fumarylacetoacetata 3 

M13755 interferon stimulated protein; 15 kDa 4.5 

AI052Q47 ESTs 6.7 

AA252033 ESTs; Weakly slmfer to llll ALU SUBFAMILY J 3.2 

30 AA401739 ESTs 3.3 

H 1 8459 hepatocellular carcinoma associated protein; 3 

R48744 ESTs 4.2 

M31682 inhioin; beta B(activinAB beta polypeptide) 3 

AA416873 ESTs 3 

35 D80240 HUM5G11A Human fetal brain (TFujwara) Homo 4 

R49590 ESTs 3.2 

CH22_FGENES.678_5 16.8 

CH22.FGENES.619.7 12.9 

CH22.FGENES.619J2 11.3 

40 Cr^EMAC005500.GENSCAN.1279 9.2 

CH2^EM^C005500.GENSCAN.3042 8.5 

CH2*_FGENES.271J 8.4 

CH22.FGENES.619J3 6 

CH22.FGENES.27L7 7.3 

45 CH22_FGENES,617_7 72 

CH.07 hsgl|6004473 7.1 

CH22^FGENES.264_1 6.8 
X03363 HER2 receptor tyrosine kinase {c erbB 2; ERBB2; neu) 6,6 

CH22_FGENES.617_9 6.5 

50 CH.07_hsgi|5868264 5.8 

CH.19_hsgi|5867439 5.7 

CH22.FGENES.63 5.3 

CK17Jwglj5867230 5.1 

CR20_hsgi!6552458 5.1 

55 CH22„ENtAC005500.GENSCAN.14822 4.7 

CH22_FGENES.669_10 4.6 

AA034918 KIAA1028 protein 4.6 

CH22JGENES.48J2 4.5 

CH22.FGENES.118J2 4.5 

60 AF049569 ESTs 4.4 

M13955 multiple UniGene matches 4.3 

CH22_FGENES.619_8 4.3 

CH22J : GENES.137 4.3 

HG4126 HT4396 Zinc Finger Protein Hzf4 4.3 

65 CH22_FGENES^60_3 4.3 

CH22_FGENES.706_9 4.3 

CH.21_hsgli6531965 4.2 
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CH.17_hsgl)5867215 
CH22J=GENES.669JB 

CH22.FGENES.48J8 

matrix metalloproteinase 14 (membrane Inserted) 
CH22J : GENESJ271_6 
CH22J : GENES.617_3 
CH2^FGENES^90_8 

CH22_FGENES.135 
CH22^FGENES.132 
CH.14_hsgi|S682474 
CR02_hsgi|5867750 
CH22J=GENES.617J 

CH22LDJ32t10.GENSCAN.23 39 

CH22_FGENES.543_20 

CH22_ErvtAC005500.GENSCAN.96 1 

CH22_FGENES.204J 

CH22_FGENES.619_4 

CH.16_hsgi|5867087 

EST cluster (not In UnlGene) 

CH2*_EM:AC005500.GENSCAN.149 9 

CH22_EM:AC005500.GENSCAN,421 5 

CH22_FGENES.134 

CH.07Jisgi|6004478 

CH22LFGENES.360J 

CH22_FGENES.6_2 

CH22_C20H12.GENSCAN.16 2 

CH2^_C65E1.GENSCAN.8 1 

ESTs; Weakly similar to cis Gotgi matrix 

CH22_FGENES.307_4 

CH22_EM:AC005500.GENSCAN.248 14 

CH.06_hsgi|5902482 

CH22_FGENES.669_5 

CH22_DJ32I10.GENSCAN.19 8 

CH22_FGENES.527_6 

CH22_FGENES.330_10 

CH22_FGENES.14 2 

ESTs 

CH22_FGENES.2267 
CH22J=GENES.133 
CH22_EMAC005500.GENSCAN.209 12 
CH22.FGENES.27U 



4.1 
4.1 

Collagen, Type Viii, Alpha 1 4.1 

4.1 

4 

3.9 
3.9 
3.8 

Guanosine 5' Monophosphate Synthase 3.8 

3.8 

3.8 

3.8 

3.8 

3.7 

Oncogene Ret/Ptc2, Fusion Activated 3.7 

3.7 

3.7 

3.7 

3.5 

3.5 

3.5 

3.4 

3.4 

3.4 

3.3 

3.3 

3.3 

Dna Binding Protein Ap 2, Alt Spiice 3 3.3 

3.3 

3.2 

3.2 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3 

3 

3 

3 

3 
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TABLE 17A 



10 



Table 17A shows the accession numbers for those pkeys lacking unigenelD's for Table 17. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



15 



Ptey: Unique Eos probesel Identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pkey CAT number Accession 

116845 393481J AA649530 AA659316 H64973 
103207 30635_-4 X72790 

126257 182217J N99638 AW973750 AA328271 H90994 AA558020 AA234435 N 59599 R94815 

102791 37186.1 AF080229 AF080231 AF080230 AF080232 AF080233 AF080234 BE550633 AI636743 AW614951 BE467547 AI680833 
AI633818 N29986 U87592 U87593 U87590 U87591 $46404 U875S7 M463992 AW206802 A1970376 AI583718 AI672574 
N25695 AW665466 AI818326 AA126128 AI480345 AW013827 AA248638 AI214968 AA204735 AA207155 AA206262 AA204833 
AW003247 AW496808 AK38G4BO AI631703 AI651023 A1867418 AW818140 AA502500 A1206199 AI671282 AI352545 BE501030 
AI652535 BE465762 AA206331 AW451866 AA471088 AA206342 AA204834 AA206100 AW021661 AA332922 N66048 
AA703396 H92278 AW139734 H92683 U87589 U87595 H69001 U87594 BE466420 AI624817 BE46661 1 A1206344 AA574397 
AA348354 AI493192 

126872 142696J AW450979 AA1 36653 AA 1 36656 AW4 19381 AA984358 AA492073 BE 1 68945 AA809054 AW238038 BE011212 BE011359 
BE011367 BE011368 BE011362 BE011215 BE011365 BE011363 
1746257 J R82040 R70934 
176835.1 AA225084AA302713 

324239J AJ31 1928 AA936030 T51931 AA609816 AA487195 AA664207 
genbanK_F09609 F09609 
genbanK_R08440 R08440 



112631 
120742 
106864 
109700 
111532 
113938 
113947 
124357 
108733 
112303 
322136 
322296 
321811 
314648 
322520 
322521 
322675 
323332 
316186 
322975 
324261 
323817 
301976 
324961 
303642 
303797 
319551 
311935 
319834 
319977 
314138 
313591 



genbank_W81598 W81598 

genbanLW84768 W84768 

genbank.N22401 N22401 

504187 J AA121022AA126422 

genbank_R54797 R54797 

46802J AF075083 H52291 H52528 

47334.1 W76326 AF086341 W72300 

1 52748 1_1 D80630 080896 D80895 

293660J AW979268 AA878419 AA431342 AA431628 

38916 1 T55958 T57205AF147346 

38917J AF147347 T55426 T55503 

86787.1 AA017656AA017374AA019761 

179142.1 AI829520 AI791832 AA228414 AI791823 AA229211 AA229315 
425440.1 AM33540AA728984AA8O4981 
1510563.1 C16391C16413 

273265.1 BE069341 AW748403 AL044891 AI9Q8240 AA393080 

233566.1 AA410943 AW948953 AA334202 AA332882 

128835.1 T97905AA101672 

376239.1 AA613792 AW182329 T05304 AW858385 

284260.1 AW299459 AA4171 12 

386364.1 AW629759 AW749955 AA633408 AI651005 

357371J AA761668 AA573621 R92814 R09670 

174129.1 AA216387 T63548 AA228676 

112523.1 AA071267 T65940 T64515 AA071334 

345248 1 AA534222 AA632632 T81234 

179960.1 AA740616 AA654854 AA229923 

103087.1 AA046309AJ263500AA046397 
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308106 AI476803 

338255 CH22J856FG_UNK.EMrtC0Q 

335809 CH22_3181FG_617_6JJNK_EM 

335824 CH22_3197FG_619 11 UNK.E 

307010 AI140014 

307041 AM44243 

305913 AA876109 

305917 AA876469 

309574 AW168083 

325372 c12Jis 

325544 c12_hs 

332798 CH22J4FGJ__5_UNK_C4G1.G 
334223 CH22L1507FG^360J_L!NK^EM 
327075 c21Jis 

334447 CH22_1746FG^387J_LINK_EM 
304782 AA582081 

313434 441798 J W92070 AW0 19952 W92053 
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TABLE 17B 



15 



20 



25 



Table 17B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 17. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



Ptey: 

10 m 



Stand: 
NLpositfcn: 

Pkey Ref 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. 'Dunham I. et al.* refers to the publication 

enfitted "The DMA sequence of human chromosome 7Z' Dunham I. et a!. ( Nature (1999) 402:489-495. 
Indicates 0NA strand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. 



334447 Dunham, I. 

335809 Dunham,!. 

335824 Dunham, I. 

332798 Dunham, I 

334223 Dunham, I. 

338255 Dunham, I. 

325372 5866920 

325544 6682452 

327075 6531965 



Strand 

etal. Plus 
etai. Plus 
etal. Plus 
etal Minus 
etaL Minus 
etal. Minus 
Minus 
Plus 
Plus 



NCposftfon 

14308764-14308824 

26310772-26310909 

26376860-26376942 

232147-231974 

12734365-12734269 

15242294-15242231 

1117061-1117304 

171228-171286 

40413184041431 
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TABLE 18: Table 2 from BRCA 014 P 

Table 18 shows genes with atleast five times the expression in breast tumor tissue than is 
5 expressed in normal body tissues. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, GenbanJc accession number 

10 UnigenelD: Unkjene number 

Unlgene Title: Ungene gene title 

R1: Ratio of tumor to norma] body tissue 



15 Pkey ExAccn UnigenelD Unlgene Title 



R1 



101378 BE563085 Hs.833 interferon-stimulated protein, 15 kDa 5.3 

101530 M29874 Hs.1360 cytochrome P450, subfamily liB (phenobar 9 

101767 MS1057 Hs. 180384 carboxypeptidase B1 (flssue) 12 

20 101878 M97815 Hs.183650 cellular retinoicacioM)lnding protein 2 6.5 

103010 X52509 Hs.161640 tyrosine aminotransferase 12.4 

104115 AF183810 Hs.26102 opposite strand to trichorhlnophaiangeal 7.6 

104825 AA035613 Hs.141883 ESTs 6.9 

107105 AW963419 Hs.155223 stannfocalcin 2 5.3 

25 108819 AA011449 Hs.271627 ESTs 6.1 

112287 AB033064 Hs.334806 KIAA1 238 protein 7.3 

112561 AI791493 Hs.129873 ESTs, Weakly similar to A36036 cytochrorn 8.2 

112637 R82331 Hs.164599 ESTs 5.4 

113206 BE262470 Hs.241471 RNB6 6.2 

30 113970 W27249 Hs.8109 hypothetical protein FU21080 6.9 

114965 A1733881 Hs.72472 BMP-R1B 10.1 

118925 N92293 Hs.206832 EST s, Moderately similar to ALU8 HUMAN A 19.7 

119905 AW449064 Hs.119571 collagen, type 111, alpha 1 (Ehlers-Danl 8.4 

121611 M31669 Hs.1735 inhibin, beta B (activin AB beta potypep 5.6 

35 129301 AF182277 Hs.330780 cytochrome P450. subfamily I1B (phenobar 6.2 

133976 AI908165 Hs.169946 GATA-binding protein 3 (T-ceii receptor 6.2 

134731 D89377 Hs.89404 nish(Drosophila}haTWoboxhornotog2 5.8 

300254 AW183618 Hs.55610 solute carrier family 30 (zinc transport 9.9 

301884 AA312082 Hs.105445 GDNF family receptor alpha 1 5.7 

40 302001 AB020711 Hs.278346 WAAG9Q4 protein 7.7 

302067 BE542706 Hs.222399 CEGP1 protein 7.3 

302276 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c-erb-b2 f 5.4 

302290 AA179949 Hs.175563 Homo sapiens mRNA; cONA DKFZp564N0763 (f 34.1 

302372 AL1 17406 Hs.200102 ATP-binding cassette transporter MRP8 6.7 

45 302385 AJ224172 Hs.204096 Opophllin B (uteroglobin family member) 13.8 

309177 AI951118 Hs.326736 Homo sapiens breast cancer antigen NY-BR 17.3 

309583 AW170035 Hs.326736 Homo sapiens breast cancer antigen NY-BR 57.6 

310781 AI380797 Hs,158992 ESTs 10.2 

311166 AI821005 Hs.118599 ESTs 10.8 

50 311935 AA216387 gb:nc16bQ2.s1 NCLCGAP_Pr1 Homo sapiens 5.2 

312153 BE261944 Hs.118625 hexokinasel 5.2 

313328 AW44921 1 Hs.105445 GDNF family receptor alpha 1 12.4 

313915 C18863 Hs.163443 Homo sapiens cDNA FLJ11576 fis, done HE 26.3 

314097 AA648744 Hs.269493 ESTs 6.6 

55 314138 AA740616 gb:ny97f11.s1 NCLCGAPJ3CB1 Homosaptens 5.9 

314506 AA833655 Hs2QS868 Homo sapiens cDNA FU14056 fis. done HE 8.5 

314558 AJ873274 Hs.190721 ESTs 27.4 

314691 AW207206 Hs.136319 ESTs 20.7 

315006 AI538613 Hs.298241 Transmembrane protease, serine 3 10.9 

60 315021 AA533447 Hs.312989 ESTs 5.3 

315051 AW292425 Hs.163484 ESTs 12.9 

315060 AA551104 Hs.189048 ESTs, Moderately similar to ALUCJWMANI 5.8 
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315196 AI367347 Hs.44898 Homo sapiens done TCCCTA00151 mRNA sequ8.2 



315530 AW015415 Hs.127780 ESTs 8.9 

315634 AA837085 Hs.220585 ESTs 6.3 

316012 AA764950 Hs.119898 ESTs 7 

5 316177 AJ904982 Hs.293102 ESTs, Moderately similar to ALUI.HUMAN A 30.7 

316580 AA938198 Hs.146123 poty(A) polymerase gamma 9.4 

317803 AW6649S4 Hs.128899 ESTs 6.1 

317881 AI827248 Hs.224398 Homo sapiens cONA FU1 1469 lis, done HE 9.6 

318740 NM_002543H$.77729 oxidised low density lipoprotein (ledin 7.3 

10 318744 AI793124 Hs.144479 ESTs \7& 

320211 AL0394O2 Hs.125783 OEME-6 protein 9.2 

321107 AI732643 Hs.144151 ESTs 12.3 

321644. AW975944 Hs.237396 ESTs 11.7 

321978 N77342 Hs.21851 Homo sapiens cDNA FU12900 fis, done NT 5 

15 322035 AL137517 Hs.334473 hypothetical protein DKFZp56401278 19 

322766 AW068805 Hs-288467 Homo sapiens cDNA FU 12280 fis, done MA 5.2 

322818 AW043782 Hs.293616 ESTs 7.6 

322975 C16391 gb:C16391 Ctontech human aorta polyA mRN 16.5 

323262 AL133990 Hs.190642 ESTs 10.5 

20 323332 AI829520 gb:wI19c06.x1 NCLCGAPJitl Homo sapiens 6.2 

323817 AA410943 BMP-R1B 8.4 

324261 BE069341 ' gb:QV3^TO381-27010(H)73H*8BT0381 Homo 49.4 

324432 AA464510 Hs.152812 ESTs 163 

324598 AW972227 Hs.163986 Homo sapiens cDNA: RJ22765 fc, done K 5 

25 324603 AW993522 Hs.292934 ESTs 10.4 

324987 AI375572 Hs.172634 ESTs 18.8 

325544 Phase2&3Exons 5.7 

330388 Hs.46 HER2 receptor tyrosine kinase (c-erb-b2, 6.6 

334223 NWL005080*:Homo sapiens X-box binding pr 26.2 

30 335809 NNL014509*:Homo sapiens kraken-Cke (BK1 10.1 

335824 ENSP00000249072MW222E13.1 (N-TERMiNAL 20 

AI052047 ESTs; WeaWy similar to CYTOCHROME P450 6.7 

R72427 CH22_EMAC005500.GENSCAN.127 9 5.5 

CH22_FGENES.619_13 9.2 

35 CH22_FGENES.617J) 8 

CH22_FGENES.271_7 65 

CH22_FGENES.619_7 7.3 

CH22_FGENES.271_8 12.9 

CH22_FGENES.619_12 8.4 

40 CH22_EMAC005500.GENSCAN.304 2 11.3 

CH.07Ju>gl|6004473 8.5 

CH22_FGENES.617J 7.1 

CH22J r GENE$.678_5 12 

CH22_FGENE$.678_5 16.8 



280 



WO 02/059377 



PCT/US02/02242 



10 



15 



TABLE 18A 

Table 18A shows the accession numbers for those pkeys lacking unigenelD's for Table 18. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset Identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 

Pkey CAT number Accession 



323332 179142 J A1829520AI791832AA228414 AI791823 AA229211 AA229315 
20 322975 1510563J C16391 C16413 

324261 273265 J BE069341 AW748403 AL044891 AI908240 AA393080 

323817 233566J AA410943 AW948953 AA334202 AA332B82 

311935 174129J AA216387 T63548AA228076 

314138 179960J AA740616 M654854 AA229923 
25 335809 CH22J181FG.617JJJNK.EM 

335824 CH22J197FGJ19J1JJNK £ 

325544 c12Jis 

334223 CH22J507FGJ60_A_UNHLEM 
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TABLE 18B 

Table 18B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 18. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

1 0 Pkey: Unique number corresponding to an Eos probeset 

Rer. Sequence source. The 7 digit numbers In this column are Genbank Identifier (GI) numbers. *Dunham I. et al." refers to the publication 

entitled "The ONA sequence of human chromosome 22." Dunham I. et al.. Nature (1999) 402:489-495. 
Strand: Indicates ONA strand from which exons were predicted. 

NLposftion: Indicates nucleotide positions of predicted exons. 



15 



Pkey Ref Strand Ntposftion 



335809 Dunham, I. eta!. Pius 26310772-26310909 

20 335824 Dunham, I. etal Plus 26376860-26376942 

334223 Dunham. I. etal Minus 12734365-12734269 

325544 6682452 Plus 171228-171286 
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TABLE 19: 1045 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES 

5 Table 19 shows 1045 genes up-regulated in breast cancer compared to normal adult tissues. 
These were selected from 59680 probesets on the Afiymetrix/Eos-Hu03 GeneChip array such 
that the ratio of "average" breast cancer to "average" normal adult tissues was greater than or 
equal to 2.5. The "average" breast cancer level was set to the 90 th percentile value. The 
"average" normal adult tissue level was set to the 90 th percentile value amongst 144 non- 
10 malignant tissues. In order to remove gene-specific background levels of non-specific 

hybridization, the 15 th percentile value amongst the 144 non-malignant tissues was subtracted 
from both the numerator and the denominator before the ratio was evaluated. 



1 5 Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal body tissue 



Pkey ExAccn UnigenelD UnigeneTKIe R1 

408591 AF015224 Hs.46452 mammaglobinl 137.6 

25 406364 M21305 gb:Human alpha satellite and sateKite 3 71.0 

400291 AA401369 Hs.190721 ESTs 68.4 

407277 AW170035 Hs.326736 Homo sapiens breast cancer antigen NY-BR 54.2 

449746 AI668594 Hs.176588 ESTs, Weakly similar to CP4Y_HUMAN CHOC 46.4 
426878 BE069341 gb:QV3^T0381-27010(W)73^08 BT0381 Homo 44.8 

30 400292 AA250737 Hs.72472 BMP-R1B 37.4 

427585 D31 152 Hs.179729 collagen, type X, alpha 1 (SchmkJ metaph 32.9 

408045 AW138959 Hs.245123 ESTs 31.9 

407178 AA195651 Hs.104106 ESTs 30.4 

407377 C16391 gb:C16391 Clontech human aorta potyA mRN 27,7 

35 450705 U90304 Hs.25351 Iroquois homeobox protein 5 24.8 

407212 AA412108 Hs.269350 ESTs 22.0 

428848 NH.000230 Hs.194236 teptin (murine obesny homolog) 21.9 

404561 trlchominophalangeal syndrome 1 (TRPS1) 21 .8 

407980 AA046309 gb:zf12f01.s1 Soares_fetaLhearLNbHH19W 19.8 

40 447350 AI375572 Hs.172634 ESTs 17.3 

450375 AA009647 Hs.8850 a dlsintegrin and metafloproteinase coma 16.6 

422109 S73265 Hs.1473 gastrin^eleasing peptide 16.5 

435496 AW840171 Hs.265398 ESTs, Weakly similar to transfofmatlon-r 16.0 

453160 AI263307 Hs.239884 H2B hlstone family, member L 15.8 

45 420813 X51501 Hs.99949 prolactin-induced protein 15.8 

415989 AI267700 Hs.317584 ESTs 15.5 

422505 AU20862 Hs.124165 programmed cell death 9 (PDC09) 14.8 

424399 A1905687 Hs.2533 aldehyde dehydrogenase 9 family, member 14.5 

423575 C18863 Hs.163443 Homo sapiens cONAFUl 1576 fis, done HE 13.7 

50 429441 AJ224172 Hs.204096 lipophilin B (uteroglobin family member) 13.6 

431474 AL133990 Hs.190642 ESTs 13,5 

448595 AB014544 Hs.21572 WAA0644 gene product 13.0 

427217 AA399272 Hs.144341 ESTs 12.8 

402578 C10O1134:gq2117372|pirl|l65981 fatty ac 12.6 

55 422805 AA436989 Hs.121 017 H2A hlstone family, member A 1£2 

424634 NM_003613 Hs.151407 cartilage Intermediate layer protein, nu 12.0 

456207 AA193450 gfczr40e07.r1 SoaresJihHMPu_S1 Homosapi 11.9 

424086 A1351010 Hs.102267 tysyl oxidase 11.9 

459587 AA031956 gb:zk15e04.s1 SoaiesjjregnanLutenJsJfoH 115 
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416208 AW291168 Hs.41295 ESTs. WeaWysimHartoMUC2JftJMANMUC[N 11^ 



429170 NMJJ01394 Hs.2359 dual spedfjcily phosphatase 4 . 11.5 

407276 AI951118 Hs.326736 Homo sapiens breast cancer antigen NY-fiR 11.4 

434377 AW137148 Hs.306593 Homo sapiens cONA FU11382 fis, clone HE 11.3 

5 448390 AL035414 Hs.21068 hypofhefical protein 11.1 

452401 NM.007115 Hs.29352 tumor necrosis factor, alpha-induced pro 11.0 

421037 AI684808 Hs.197653 programmed cell deaft 9 (POCD9) 103 

452461 N78223 Hs.108106 transcripfion factor 10.7 

443348 AW873596 Hs.182278 calmodulin 2 (phosphorylasa kinase, dett 10.6 

10 421155 H87879 Hs.102267 tysyt oxidase 10.5 

402606 NM_024626.-Homo sapiens hypothetical prot 10.4 

409269 AA576953 Hs.22972 rrypofoeGcal protein FU13352 10.4 

447268 AI370413 Hs.36563 hypothetical protein RJ22418 103 

447033 A1357412 Hs.157601 ESTs 10.2 

15 400295 W72838 Hs.2533 aldehyde dehydrogenase 9 family, member 10.1 

424905 NMP02497 Hs.153704 NIMA (never in mitosis gene a)-retated k 10.1 

432441 AW292425 Hs.163484 ESTs 9.9 

427365 AI873274 Hs.190721 ESTs 9.9 

438950 H23789 Hs.144530 EST 9.8 

20 422835 BE218705 Hs.121378 metafothtonein-Gke 5. testls-specmc 9.7 

425692 090041 Hs.155956 N-acetyitransferase 1 (arylamine N-acety 97 

411869 W20027 Hs.23439 ESTs 9.6 

439820 AL360204 Hs.283853 Homo sapiens mRNA full length insert cON 9.6 

446730 AI624342 Hs.170042 ESTs 9.5 

25 459583 A1907673 gb:lt^T152^)80399-004BT152Homosapien 9.3 

452744 AJ267652 Hs.30504 Homo sapiens mRNA; cONA DKFZp434E082 (fr 9.2 

432596 AJ224741 Hs.278461 matrffin3 9.1 

400297 AI127076 Hs.334473 hypothetical protein DKFZp56401 278 9.1 

449448 D6O730 Hs.57471 ESTs 9.1 

30 423945 AA410943 gb:zt32h03 /I Soares ovary tumor NbHOT H 9.1 

406348 Target Exon 9.0 

424735 U31B75 Hs.272499 short-chain alcohol dehydrogenase family 9.0 

453392 U23752 Hs.32964 SRY {sex determining region Y}box 1 1 9.0 

433365 AF026944 Hs.293797 ESTs 8.8 

35 405654 NA C12001521:gl|7513934|pir|lT31Q81 cca3 pr 8.8 

418601 AA279490 Hs.86368 calmegin 8.8 

451110 A1955040 Hs.265398 ESTs, Weakly similar to transformation-r 8.7 

408771 AW732673 Hs.47584 potassium voltage-gated channel, delayed 8.5 

409041 AB033025 Hs.50081 KIAA1 199 protein 8.4 

40 442353 BE379594 Hs.49136 ESTs, Moderately similar to ALU7_HUMAN A 8.3 

451561 N52812 Hs.177403 ESTs 8.2 

424001 W67883 Hs.137476 paternally expressed 10 8.2 

429859 NM.007050 Hs.225952 protein tyrosine phosphatase, receptor t 8.1 

423887 AL080207 Hs.134585 DKFZP434G232 protein 8.1 

45 405095 NA Target Exon 8.1 

419296 AA236115 Hs.120785 ESTs 8.0 

447164 AF026941 Hs.17518 Homo sapiens cig5 mRNA, partial sequence 8.0 

413472 BE242870 Hs.75379 solute carrier famliy 1 (glial high am* 8.0 

416747 AW876523 Hs.15929 hypothetical protein RJ 12910 . 8.0 

50 415385 R17798 Hs7535 COBW-like protein 7.9 

434424 AI811202 Hs.325335 Homo sapiens cONA: RJ23523 fis, done L 7.9 

42)931 AF044197 Hs.100431 smaQ Inducible cytokine B subfamily (Cy 7.9 

406687 M31 126 Hs.272620 matrix metatloproteinase 1 1 (MMP11; stro 7.8 

400285 NA Eos Control 7.7 

55 437207 T27503 Hs.15929 hypothetical protein FU12910 7.6 

427119 AW880562 Hs.1 14574 ESTs 7.5 

429534 AW976987 Hs.163327 ESTs, Weakly similar to 2109260A B cell 7.5 

433426 H69125 Hs.133525 ESTs 7.5 

411078 A1222020 Hs.182364 CocoaCrisp 7.4 

60 426214 H59846 Hs.1 28355 ESTs, Moderately similar to AUJ7JflJMAN A 7.4 

447475 AJ380797 Hs.158992 ESTs 7.3 

415263 AA948033 Hs.130853 ESTs 72 

439569 AW602166 Hs.222399 CEGP1 protein 72 

414142 AW368397 Hs.1 50042 Homo sapiens cONA FU14438 fis, clone HE 7.1 

65 426261 AW242243 Hs.168670 peroxisomal famesylated protein 7.0 

444783 AK001468 Hs.62180 anfifin (Drosophiia Scraps homotog), act 6.9 

445885 AI734009 Hs.127699 K1AA1603 protein 6.9 
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429432 A1678059 Hs.202676 synaptonemal complex prolan 2 6.9 

410781 AI375672 Hs.165028 ESTs 6.9 

443788 AI732643 Hs.144151 ESTs 6.9 

421373 AA808229 Hs.167771 ESTs 6.8 

5 451398 AI793124 Hs.144479 ESTs 6.8 
404253 NMj021058 4 :Homo sapiens H2B hlstone fami 6.8 

441098 AI015591 Hs.131004 ESTs, Weakly similar to T17227 hypotfiefl 6.7 

426215 AW963419 Hs.155223 stanniocalcin 2 6.6 

428227 AA321649 Hs.2248 small Inducible cytokine subfamily B (CX 6.6 

10 422956 BE545072 Hs.122579 hypothetical protein RJ10461 6.6 
411111 AW818127 gb:CM1-ST0277^)61299-0594307ST0277 Homo 6.6 

434988 AI418055 Hs.161160 ESTs 6.6 

442580 AI733682 Hs.130239 ESTs 6.6 

449611 AI970394 Hs.197075 ESTs 6.6 

15 408000 L11690 Hs.620 bullous pemphigoid antigen 1 (230/240kD) 6.5 

420757 X78592 Hs.99915 androgen receptor (dfliydrotestosterone r 6.5 

431089 BE041395 Hs.283676 ESTs, WeaWy similar to unknown protein 6.5 

400301 X03635 Hs.1657 estrogen receptor 1 6.5 

427356 AW023482 Hs.97849 ESTs 6.5 

20 425704 U79293 Hs.159264 Human clone 23948 mRNA sequence 6.4 

441134 W29092 H$J67B cellular retinolcadd-binamg protein 1 6.4 

424902 NM.003866 Hs.153687 inositol polyphosphate-4-phosphatase, ty 6.4 

448693 AW004854 Hs.228320 hypothetical protein RJ23537 6.4 

431448 AL137517 Hs.334473 hypothetical protein DKFZp56401278 65 

25 444342 NUL014398 Hs. 10887 similar to lysosome-associated membrane 6.1 

422168 AA586894 Hs.112408 S100<^durrhWndingpfotelnA7{psorias 6.1 

453331 AI240665 Hs.8895 ESTs 6.1 

418007 M13509 Hs.83169 matrix metalfoprotekase 1 (MMP1; Inters 6.0 

441233 AA972965 Hs.135568 ESTs 6.0 

30 416092 R45154 Hs.106604 ESTs 6.0 

430044 AA464510 Hs.152812 ESTs 5.9 

432837 AA310693 Hs.87329 HSPC072 protein 5.9 

433285 AW975944 Hs.237396 ESTs 5.9 

450701 H39960 Hs.288467 Homo sapiens cDNARJ12280fis, done MA 5,9 

35 425707 AF115402 Hs.11713 E7«e factor 5 (ets domain transcript 5.9 
410785 AW803341 gb:IL24)M0079^9W()04)5(W)03U^79Horre)55 

425398 AL049689 Hs.156369 hypotheMprotefosMlartoterciSOT 5.9 

414812 X72755 Hs.77367 monokine induced by gamma interferon 5.8 

459371 R20991 gb:yg06h01rl Soares infant brain 1NIB H 5.8 

40 411284 N28519 Hs.135191 ESTs, Weakly simitar to unnamed protein 5.8 

453511 AL031224 Hs.33102 transcription factor AP-2 beta (acfivati 5.8 

451807 W52854 Hs.27099 hypothetical protein FU23293 similar to 5.7 

430510 AW162916 Hs.241576 hypothetical protein PR02577 5.7 

415539 AI733881 Hs.72472 BMP-RIB 5.6 

45 438199 AW016531 Hs.122147 ESTs 5.6 

417866 AW067903 Hs.82772 collagen, type XI, alpha 1 5.5 

430019 AA463893 Hs.220933 ESTs 5.5 

439809 R41396 Hs.101774 hypothetical protein RJ23045 5.5 

423811 AW299598 Hs.50895 homeoboxC4 5.4 

50 434539 AW748078 Hs.214410 ESTs, Weakly similar to MUCZ-HUMAN MUCIN 5.4 

439138 AI742605 Hs.193696 ESTs 5.4 

453931 AL121278 Hs.25144 ESTs 5.4 

444078 BE246919 Hs.10290 U5 snRNP^pecific 40 kDa protein (hPrp8- 5.4 

447102 BE167434 Hs,98471 ESTs, WeaWy similar to T18712 hypotheti 5.4 

55 451621 AI879148 Hs.26770 fatty acid binding protein 7, brain 5.4 

425236 AW067800 Hs.155223 stanniocalcin 2 5.3 

421464 AA291553 Hs.190086 ESTs 5.3 
450736 AW970060 gb:EST352140 MAGE resequences, MAGK Homo5.3 

428085 AA421081 Hs.12388 ESTs 5.3 

60 452838 U65011 Hs,30743 preferentially expressed antigen In mefa 5.3 

445424 AB028945 Hs.12696 cortactin SH3 domain-binding protein 5.3 

456938 X52509 Hs.161640 tyrosine aminotransferase 5.3 

422867 L32137 Hs.1584 cartilage oligomeric matrix protein (COM 5.2 

438167 R28363 Hs.24286 ESTs 5.2 

65 433330 AW207084 Hs.132816 hypothetical protein MGC14801 5.2 

449765 N92293 Hs.206832 ESTs, Moderately similar to ALU8 JiUMAN A 52 

416276 O41060 Hs.79136 LIV-1 protein, estrogen regulated 5.2 
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400300 X03363 HER2 receptor tyrosine Wnase (o-erb-b2, 5.2 

418004 U37519 Hs.87539 aldehyde dehydrogenase 3 family, member 5.2 

439840 AW449211 Hs.105445 GONF family receptor alpha 1 5.2 

428771 AB028992 Hs.193143 KIAA1069 protein 5.2 
5 455047 AW852530 gbPMW0243-071099401-g06 CT0243 Homo 5.2 

419169 AW851980 Hs.26234S ESTs, WeaWy siniilar to S72482rrypofoeti 5.2 

453197 AI916269 Hs.109057 ESTs, WeaWy similar to ALU5_HUMAN ALU S 5.1 

400298 AA032279 Hs.61S35 six transmembrane epithelial anffgen of 5.1 

431023 AI283133 Hs.297420 ESTs 5.1 

10 427666 A1791495 Hs.180142 calmodutin-Gke skin protski 5.1 

427718 A/798680 Hs.25933 ESTs 5.1 

434531 AA642007 Hs.1 16369 ESTs 5.1 

429220 AW207206 Hs.136319 ESTs 5.1 

405494 NA C2001837^gi|12697903]dbjjBAB21770.1|(A 5.1 

15 452930 AW195285 Hs.194097 ESTs, WeaWy similar to 138022 hypotheti 5.1 

444910 A1201849 gb:qs76g04j(1 NCLCGAPJ>r28 Homo sapiens 5.1 

453310 X70697 Hs.553 solute carrier family 6 (neurotransmitte 5.0 

444381 8E387335 Hs.283713 ESTs, Weakly similar to S64054 hypotheti 5.0 

450603 R43646 Hs.12422 ESTs 5.0 

20 416575 W02414 Hs.38383 ESTs 5.0 

438504 AW665281 Hs.224625 ESTs 5.0 

416209 AA236776 Hs.79078 MAD2 (mitotic arrest deficient yeast, h 5.0 

428804 AKD00713 Hs.193736 hypothetical protein FU20706 5.0 

420077 AW512260 Hs.87767 ESTs 4.9 

25 450480 X82125 Hs.25040 zinc finger protein 239 4.9 

437637 AJ003O29 Hs.65792 syntrophin. gamma 2 4.9 

431808 M30703 Hs.270833 amphiregulin (schwannoma-derived growth 4.9 

418836 AI655499 Hs.161712 ESTs 4.8 

442441 A1820662 Hs.129598 ESTs 4.8 

30 435635 AF220050 Hs.181385 uncharacterized hematopoietic stem/proge 4.8 

400286 NA C16000922:glf7499103lpifilT20903 hypothe 4.8 

407506 U71600 gb:Human zinc finger protein zfp31 (zf31 4.8 

420026 AI831190 Hs.166676 ESTs 4.8 

441377 BE218239 Hs.202656 ESTs 4.8 

35 457726 AI217477 Hs.194591 ESTs 4.8 

412785 AW997556 Hs.78521 KIAA171 7 protein 4.8 

428368 BE440042 Hs.83326 matrix metalloproteinase 3 (stromelysin 4.7 

436026 AI349764 Hs.217081 ESTs 4.7 

409110 AA191493 Hs.48778 nfoanprotem 4.7 

40 400284 NA estrogen receptor 1 4.7 

410102 AW248508 Hs.279727 Homo sapiens cONA FU14035 fis, clone HE 4.7 

407819 R42185 Hs.274803 ESTs 4.7 

430486 BE062109 Hs.241551 chloride channel calcium activated, fem 4.7 

422896 AW961489 Hs.154116 ESTs 4.7 

45 453616 NNL003462 Hs.33846 dyneh, axonemal, fight intermediate pol 4.7 

427427 AF077345 Hs.177936 ESTs 4.6 

421751 AW813731 Hs.159153 ESTs, Moderately scmflar to S65657 alpha 4.6 

454074 R63503 Hs.28419 ESTs 4.6 

405718 C4000799^Il6330365ldbpAA86508.1|(AB 4.6 

50 444649 AW207523 Hs.197628 ESTs 4.8 

429431 240313 Hs.106330 Homo sapiens done 1MAGE53371, mRNA seq 4.6 

427811 M81057 Hs.1 80884 <art)Oxypeptidase B1 (tissue) 4.6 

447342 AI199266 Hs.19322 Homo sapiens, Similar to RiKEN cDNA 2010 4.6 

430345 AK000282 Hs.239681 hypothetical protein FU20275 4.6 

55 454307 AW855717 gb^C1-CT0279^81299-013^01 CT0279 Homo 4.6 

400303 AA242758 Hs.79136 UV-1 protein, estrogen regulated 4.6 

438180 AA808189 Hs.272151 ESTs 4.6 
451340 AW936273 gb:QV0-OT002(W390200-107-g07OT0020 Homo 4.6 

458711 AL036877 Hs.282878 ESTs 4.6 

60 457430 AA514660 Hs.128443 ESTs 4.6 

416030 H15261 Hs.21948 ESTs 4.6 

447233 AW246333 Hs.17901 Homo sapiens, clone IMAGE:3937015, mRNA. 4.6 

445537 AJ245671 Hs.12844 EGF-like-domaln, multiple 6 (EGFL6) 4.5 

424590 AW966399 Hs.46821 hypothetical protein FU20086 4.5 

65 432374 W68815 Hs.301 885 Homo sapiens cDNA FU11346 Gs, done PL 4.5 

423833 AW503329 gb:UIW-BN0^e-02-(UJLr1 NlrLMGC_50 4.5 

406747 A1925153 Hs.217493 annexinA2 45 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



429270 
443903 
65 427122 
410275 
432912 



412102 
431716 
411050 
401418 
436194 
436211 
414080 
424115 
415786 
442117 
452784 
432731 
410534 
405196 
430217 
401793 
415747 
423679 
400238 
425627 
400608 
458634 
407771 
405906 
405925 
439382 
445263 
407162 
419536 
454359 
411558 
450715 
421451 
452864 
409757 
413043 
413499 
444619 
408380 
406992 
404285 
425247 
428046 
446163 
421147 
426451 
415227 
452176 
452862 
443646 
425523 
424687 
430009 
434469 
451381 
450229 
455700 
431924 
438885 
401451 
431676 



H56435 
D89053 
AW814902 
NA 

AK001074 

AK001581 

AA135257 

AA335497 

AW419198 

AW664964 

BE463857 

R31178 

AW905138 

NA 

N47863 

AA381209 
AB0O7975 
NA 

AF019612 

AV657310 
AL138272 
NA 
NA 

BE247684 

H57646 

N63855 

AA603305 

N71277 

AA102670 

A1266484 

AA291377 

AA033714 

NM001898 

BE158766 

BE144884 

BE538082 

AF123050 

S82472 

NA 

NM_005940 

AW812795 

AA026880 

AW592167 

AI908165 

AW821113 

AA024538 

AW378065 

A1085198 

AB007948 

J05070 

AA894564 

AA634806 

BE241831 

R18717 

BE068115 

AKD00850 



gb:yq98e09.rt Scares fetal liver spleen 4.5 

Hs.268012 fatty-add-CoenzymeAligase.lorig-chaln 4.5 
gb:MR1-ST0206-120400-022-f08ST0206Homo 4.5 

C14000338*:gl|7459502|plr||S74665 outer 4.5 

Homo sapiens cONA FU10212 fis, done HE 4.5 

hypothetical protein RJ10719; KJAA1794 4.4 

B aggressive lymphoma gene 4.4 

ESTs, Weakly similar to I38022 hypothefl . 4.4 

hypothetical protein FU13782 4.4 

ESTs 4.4 

hypothetical protein FU21062 4.4 

fibronectin 1 4.4 
gb:QVO-NN1071-280400-207-g07 NN1071 Homo4.4 

C2000662*:glI7512792|pir|lT12482 hypothe 4.4 

ribosomal protein S24 4.4 

C17001545gi|5360127|gb|AAD42882.1|AF155 4.4 

gb:EST94257 Activated T-ceib I Homo sap 4.4 

KIAA0506 protein 4.4 

C19000274*:gi|12741327|reqXP_008833^l 4.4 

membrane-bound transcription factor prat 4.4 

C10001899gi|7508633lpir||T25392 hypothe 4.4 

ESTs 4.3 

ESTs 4.3 

Target Exon 4.3 

Target Exon 4.3 

ESTs 4.3 

K1AA1560 protein 4.3 

zinc finger protein 4.3 

gb:np12d1U1 NCI_CGAPJ>r3 Homo sapiens 4.3 

gbza36e03.s1 Scares fetal liver spleen 4.3 

gamma^minobutyrfc add (GABA) A recepto 4.2 

ESTs, Weakly similar to WAA1324 protein 4.2 

ESTs 4.2 

hypothetical protein FU14260 4.2 

cystatinSN 4.2 

gb:IL2-HTO97-071299^244 : 02HT0397Homo 42 
gb:CMOW0182-041099-065-e11 HT0182 Homo4.2 



Hs.333435 

Hs.334828 

Hs.47783 

Hs.293965 

Hs.257924 

Hs.128899 

Hs.151258 

Hs.287820 



Hs.336901 

Hs.131454 

Hs.297007 

Hs.282898 
Hs.62713 



Hs.103070 

Hs.42586 

Hs.142634 



Hs.70725 

Hs.31570 

Hs.50831 

Hs.287629 

Hs.123114 



AI685464 

AI735283 

W60379 

AI220547 

AW057736 



BE007371 



Hs.8172 
Hs.44532 



Hs.155324 

Hs.155381 

Hs.25252 

Hs.293299 

Hs.169346 

Hs.72402 

Hs.282990 

Hs.8687 

Hs.164226 

Hs.158244 

Hs.151738 

Hs.22242 

Hs.172330 
Hs.8929 

Hs.272203 
Hs.184987 



Hs.172608 

Hs.57773 

Hs.135223 

Hs.323910 

Hs.61796 

Hs.200313 



ESTs, Moderately similar to A46010 X-On 4.2 

diubiquitin 42 

gb:beta -poNDNA polymerase beta {exon a 42 

C6001909:gi|704441|dblIBAA18909.1| P298 4.2 

matrix metalloproteinase 1 1 (MMP11; stro 42 

ESTs, Moderately similar to I38022 hypot 42 

prolactin receptor 4.2 

ESTs 42 

GATA-binding protein 3 (T-cell receptor 4.2 

ESTs 42 

Human DNA sequence from done RP1-28H20 4.2 

ESTs 42 

ESTs 4.2 

K1AA0479 protein 4.1 

matrix metalloproteinase 9 {gelatinase 8 4.1 

ESTs 4.1 

gb:ab28c02.r1 Stratagene lung (937210) H 4.1 

hypothetical protein MGC2705 4.1 

hypothetica! protein FU11362 4.1 
gb:CM1-8T0368-061299-060^g07 8T0368 Homo4.1 

Homo sapiens cDNA FU20843 fis, clone AO 4.1 

ESTs 4.1 

NlvL004496*:Homo sapiens hepatocyte nude 4.1 

gb^88f04a1NCI_CGAP_Pr28 Homo sapiens 4.1 

ESTs 4.1 

ESTs 4.1 

ESTs - 4.1 

HER2 receptor tyrosine kinase (c-erb-02, 4.1 

transcription factor AP-2 gamma (activat 4.1 

ESTs 4.1 
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403585 Target Exon 4.1 

438295 A1394151 Hs.37932 ESTs 4.1 

420380 AA640891 Hs.102406 ESTs 4.1 

431118 BE264901 Hs.250502 carbonic anhydrase VIII 4.1 

5 416182 NM.004354 Hs.79069 cydfnG2 4.1 

418994 AA296520 Hs.89546 selectln E (endothelial adhesion molecul 4.1 

400555 Target Exon 4.1 

410079 U94362 Hs.58589 glycogenic 4.0 

427674 NM_003528 Hs.2178 H2B histone family, member Q 4.0 

10 427131 AA448460 Hs.112017 GE36gene 4.0 

439759 AL359055 Hs.67709 Homo sapiens mRNA full length insert cDN 4.0 

429353 AL1 17406 Hs.200102 ATP-btndlng cassette transporter MRPd 4.0 

421296 NIVL002666 Hs.103253 perflipin 4.0 

418819 AA228776 Hs.191721 ESTs 4.0 

15 424188 AW954552 Hs.142634 zinc finger protein 4.0 

455431 AW936484 gb^M0-DT0O57-29O20O-25W06 DT0057 Homo4.0 

404142 NA Target Exon 4.0 

441143 AI027604 Hs.159650 ESTs 4.0 

444540 AJ693927 Hs.265165 ESTs 4.0 

20 415579 AA165232 Hs.222069 ESTs 4.0 

452891 N75582 Hs.212875 ESTs, Weakly similar to DYH9J1UMAN CHJA 4.0 

414605 BE390440 gb^01283601F1 NIH_MGC_44 Homo sapiens c 4.0 

452281 T93500 Hs.28792 Homo sapiens cONAFLM 1041 fls, clone PU 4.0 

417801 AA417383 Hs.82582 Integrin, beta-like 1 (with EGF-IDce rep 4.0 

25 446232 A1281848 Hs.194691 retinoic acid induced 3 4.0 

447377 X77343 Hs.334334 transcription factor AP-2 alpha {activat 4.0 

437854 AL119723 gb:DKFZp761A2124_r1 761 (synonym: hamy2) 4.0 

446140 AA356170 Hs.26750 hypothetical protein FU21 908 4.0 

452240 AI591147 Hs.61232 ESTs 4.0 

30 459574 AI741122 Hs.101810 Homo sapiens cDNA FU14232 fis, clone NT 4.0 

458673 N99626 gb:za39d1 Iji Scares fetal liver spleen 4.0 

444858 A1199738 Hs.208275 ESTs, Weakly sIm2arto ALUAJWMAN U 4.0 

452166 AI948607 Hs.264680 ESTs 4.0 

452681 AF 153330 Hs.30246 solute carrier famfly 19 {thiamine trans 3.9 

35 450192 AA263143 Hs.24596 RAD51-interacflng protein 3.9 

406554 NA Target Exon 3.9 

416259 AA573006 Hs.19173 ESTs 3.9 

445813 Z42023 Hs.106576 alanine^lyoxylate aminotransferase 2-0 3.9 

451024 AA442176 gb:zw63b08.r1 Soares_totaLfetusJlb2HF8 w 3.9 

40 413930 M86153 Hs.75618 RAB11 A, member RAS oncogene famfly 3.9 

401781 Target Exon 3.9 

415296 F05086 Hs.328142 ESTs 3.9 

452564 AA026777 gfcze93c11.r1 SoaresJetaLhearLNbHH19W 3.9 

442500 A1819068 Hs.209122 ESTs 3.9 

45 419759 Z21336 Hs.135411 actfn related protein 3.9 

. 424638 AI472106 Hs.49303 Homo sapiens cDNA FU1 1663 fis, clone HE 3.9 

439699 AF086534 Hs.187561 ESTs, Moderately similar to ALU 1JHUMAN A 3.9 

428042 AA419529 Hs.76391 myxovlrus (influenza) resistance l.homo 3.9 

452501 AB037791 Hs^9716 hypoMcal protein FU10980 3.9 

50 453049 BE537217 Hs.30343 ESTs 3.9 

443213 BE568414 Hs.145497 Homo sapiens cDNA: FU22097 fis, clone H 3.9 

443489 A1073512 Hs.133916 ESTs 3.9 

455092 BE152428 gb:CM04fT0323-151299-126^MHT0323Homo3.9 

401785 NMJX)2275*:Homo sapiens kerafln 15 (KRT1 3,9 

55 426427 M86699 Hs.1 69840 TTK protein kinase 3.9 

446009 AJ989885 Hs.231926 ESTs 3.9 

436033 H75391 Hs.255748 ESTs 3.9 

451067 BE172186 gb:MROW0559-110300^054i11 HT0559 Homo 3.8 

419348 AA236645 Hs.98274 ESTs 3.8 

60 444635 A1184268 Hs,339665 ESTs 3.8 

412140 AA219691 Hs.73625 RAB6 interacting, Wnesin-Bke (rabWnes 3.8 

403593 NA Target Exon 3.8 

442323 AW016669 Hs.29190 ESTs 3.8 

419854 AW664873 HsJ7B36 Homo sapiens PAC done RP5-1087M19 from 3.8 

65 433871 W02410 Hs.205555 ESTs 3.8 

445253 A1217928 Hs.144762 ESTs 3.8 

409542 AA503020 Hs.36563 hypothetical protein FU2241 8 3.8 
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443162 T49951 Hs.9029 DKF2P434G032 protein 3.8 

458194 AW383618 Hs.265459 ESTs. Moderately similar to ALU2JWMAN A 3.8 

422475 AL359938 Hs.117313 Meis (mouse) homolog 3 3.8 

440705 AA904244 Hs.153205 ESTs 3.8 

5 447290 A1476732 Hs.263912 ESTs 3.8 

403426 Target Exon 3.8 

427821 AA470158 Hs.98202 ESTs 3.8 

454288 BE222648 Hs.279458 ESTs, Highly similar to c38QA1.1b |H.sap 3.8 

443801 AW206942 Hs.253594 ESTs 3.8 

10 410658 AW105231 Hs.192035 ESTs 3.8 
410672 AW794600 gb:RC64JM0014-170300-O22-C05 UM0014 Homo3.8 

428579 NM.005756 Hs.184942 6 protein-coupled receptor 64 3.8 

445495 BE622641 Hs.36489 ESTs, Weakly similar to I38022 hypotheti 3.8 

447995 A1742618 Hs.181733 ESTs, Weakly similar to nitrDase homoto 3.7 

15 401747 Homo sapiens kerafln 17 (KRT17) 3.7 

420633 NM.014581 Hs.274480 odorantfinding protein 2A 3.7 

423545 AP000692 Hs.129781 chromosome 21 open reading frame 5 3.7 

* 433138 AB029496 Hs.59729 semaphorin sem2 3.7 

434715 BE005346 Hs.116410 ESTs 37 

20 428664 AK001666 Hs.1 89095 simfiar to SALL1 (sal (DrosophilaHike 3.7 

450951 AA018534 Hs.103334 ESTs 3.7 
402696 NA C3002523:gi|6686211|sp|Q27533|YH2M„CAEEL37 

446868 AV660737 Hs.135100 ESTs 37 

458154 AW816379 Hs.335018 ESTs , 37 

25 422026 U80736 Hs. 11 0826 trinucleotide repeat containing 9 37 

419440 AB020689 Hs.90419 KIAA0882 protein 37 

421524 AA312082 Hs.105445 GDNF My receptor alpha 1 37 

417283 N62840 Hs.48648 ESTs 3.7 

401508 NA NlVL024817:Homo sapiens hypothetical prot 37 

30 410303 AA324597 Hs.21851 Homo sapiens cDNA FU12900 fis, clone NT 37 

420362 U79734 Hs.97206 hunfingtin interacting protein 1 3.7 

433384 AI021992 Hs.124244 ESTs 37 

434302 AA629065 Hs.1 16301 ESTs 3.7 

443938 R55373 Hs.20864 ESTs 37 

35 448420 BE623004 gb:601441282F1 NIHJWG(L72 Homo sapiens c 37 

458712 AI347502 Hs.107872 hypothetical protein FU20761 37 

433404 T32982 Hs.102720 ESTs 37 

405232 NM_015832:Homo sapiens methyl-CpG bindln 37 

430491 AL109791 Hs.241559 Homo sapiens mRNA full length insert cDN 37 

40 455609 BE011668 gb:CM3-BN0223-100500-177-a04 BN0223 Homo37 

450164 A1239923 Hs.30098 ESTs 3.7 

453948 AI970797 Hs.64859 ESTs 37 

436061 AI248584 Hs.190745 Homo sapiens cDNA: FU21326 fis, clone C 3.7 

401049 NA Target Exon 3.6 

45 418867 031771 Hs.89404 msh(Drosophfla)homeo box homolog 2 3.6 

420179 N74530 Hs.21168 ESTs 3.6 

458663 AV658444 Hs.280776 tankyrase, TRF1-inieracting ankyrtowela 3.6 

437259 AI377755 Hs.120695 ESTs 3.6 

428309 M97815 Hs.183650 cellular retinoic add-binding protein 2 3.6 

50 450522 AI698839 gb:wd31f02.x1 SoaresJIFLJJ3BC_S1 Homo s3.6 

451952 AL120173 Hs.301663 ESTs 3.6 
412209 AW901456 gb:RCQ-NN1012-27O3QO431^7 NN1012 Homo3.6 

425201 AA352111 gb£ST60061 Activated T^cells XX Homo sa 3.6 

443830 AI142095 Hs.143273 ESTs 3.6 

55 439255 BE164500 gb^C4-HTO469-230300-O14-e10HT0469 Homo 3.6 

414869 AA157291 Hs.21479 ubtnudeb 1 3.6 

409064 AA062954 Hs.141883 ESTs 3.6 

407721 Y12735 Hs.38018 dua^ped^tyrosineKYHihosphoryi 3.6 

445135 AK000O54 Hs.12347 hypothetical protein RJ20047 3.6 

60 404091 NA TaigetExon 3.6 

409731 AA125985 Hs.56145 thymosin, beta, identified in neuroblast 3.6 

405153 Target Exon 3.6 

423248 AA380177 Hs.125845 fauiose^-ptospnate^pimerase 3.6 

403639 NA ENSP00000233023 # :CDNAFU12662frs,clon 3.6 

65 404360 C7001385;gii12082809|gb[AAG48618.1|AF315 3.6 

422352 AA766296 Hs.99200 ESTs 3.6 

423338 AB007961 Hs.127338 KIAA0492 protein 3.6 
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424202 BE350295 ■ Hs.15032 RAN binding protein 17 3.6 

431750 AA514986 Hs.283705 ESTs 3.6 

439907 AA853978 Hs.1 24577 ESTs 3.6 

453596 AA441838 Hs. 62905 hypothetical protein FU14834 3.6 

406446 NA Target Exon 3.6 

418454 AA315308 Hs.195870 hypothetical protein RJ14991 3.6 

434360 AW015415 Hs.127780 ESTs 35 

409079 W87707 Hs.82065 interleukin 6 signal transducer (gp130, 3.6 

440132 A1697121 Hs.202466 ESTs, Weakty similar to S65824 reverse t 3.6 

448706 AW291095 Hs^1814 interleukin 20 receptor, alpha 3.6 

440671 AW297920 Hs.130054 ESTs 3.5 
407647 AW860158 gb^OWT0379-290100^^CT0379Hoino35 

459023 AW968226 Hs.60798 ESTs 3.5 

402820 Nrvt017646*:Homo sapiens tRNA isopentenyl 3.5 

417009 AA191719 Hs.314714 ESTs 3.5 

447048 AW393080 Hs^28320 hypothetical protein FU23537 3.5 

449978 AI806335 Hs.200829 ESTs, Weakly similar to T30171 runein - 3.5 

428062 AA4206B3 Hs.98321 hypothetical protein FU14103 3.5 

452909 NM.015368 Hs.30985 pannexln 1 3.5 

400610 NA Target Exon 3.5 

417843 W07361 Hs.22545 Homo sapiens cDNA FU12935 fis, clone NT 35 

419335 AW960146 Hs.284137 hypothetical protein RJ12888 3.5 

451592 AI805416 Hs.213897 ESTs 3.5 

443270 NMJW4272 Hs.337737 Homer, neuronal Immediate early gene, 1B 3.5 

423948 AW392342 Hs.283077 centrosornai P4.1 -associated protein; unc 3.5 

449424 AW448937 Hs.197030 ESTs 35 
423841 AW753967 gb:RC2^T0304^010W114i12CT0304Hrirno35 

416806 NMJ0OO288 Hs.79993 peroxisomal biogenesis factor 7 3.5 

422060 R20893 Hs.325823 ESTs, Moderately similar to ALUS JHUMAN A 3.5 

433104 AL043002 Hs.128246 ESTs, Moderately simDar to unnamed prot . 3.5 

415778 HS4847 Hs.49391 hypothetical protein LOC54149 3.5 

413054 AW316843 Hs.66309 hypothetical protein MGC11061 35 

416636 N32536 Hs.42645 solute carrier famDy 16 (rnorracarboxyfx 3.5 

424639 AI917494 Hs.9812 Homo sapiens cDNA HJ14388 fis, done HE 3.5 

424827 AI057094 Hs.96867 Homo sapiens cDNA: FU23155 fis, done L 3.5 

437782 A137Q876 Hs.79090 exportin 1 (CRM1. yeast, homotog) 3.5 

411514 AW850178 gb:tL«nT)219-271099422-H12 CT0219 Homo 3.5 

413783 AA314337 Hs501547 ribosomal protein S7 35 

421106 AAB77124 Hs.172844 ESTs 3.5 

431291 N25521 Hs.25275 KruppeMype zinc finger protein 3.5 

440623 AI935016 Hs.216639 ESTs 3.5 
455838 BE145808 gb:IWO-HT0208-101299-103-f1 1 HT0208 Homo 3.5 

458771 AW295151 Hs.163612 ESTs 35 

442942 AW167087 Hs.131562 ESTs 35 

436550 Z50158 Hs.270235 ESTs, Weakly slmtlar to MMHUB1 taminin b 3.5 

418849 AW474547 Hs.53565 Homo sapiens PIG-M mRNA formannosyltran 3.5 

424420 BE614743 Hs.146688 prostaglandin E synthase 35 

430916 AW505021 Hs.88414 BTB and CNC homology 1. basic leudnezi 35 

432030 A1908400 Hs.143789 ESTs 35 

439405 AF086224 Hs55238 ESTs 3.5 

405917 NA C17O0O675:gij729O7O3|gb]AAF46150.1|(AE0 35 

452727 AW993582 Hs.176220 ESTs 35 

426320 W47595 Hs.1 69300 irahstorrning growth factor, beta 2 3.4 

421070 AA283185 Hs.19327 ESTs 3.4 

42462 AW904466 Hs.321197 PDZ domain protein (Drosopriila InaMce 3.4 

428508 BE252383 Hs.184668 SBBI31 protein 3.4 
455651 BE064962 gb:RC1^313-13040M1&cQ2 BT0313 Homo 3.4 

410555 U92649 Hs.64311 a dislntegrin and metalloproteinase doma 3.4 

447754 AW073310 Hs.1 63533 Homo sapiens cONA FU14142 fis. clone MA 3.4 
418636 AW749855 gb:QV4-8T0534-281299453«fl5 BT0534 Homo 3.4 

404097 NA C50O)242^gll9369379lgblAAF87128.1|A(X06 3.4 

434205 AF1 19861 Hs.283032 hypothetical protein PRO201 5 3.4 

421072 AI215069 Hs.89113 ESTs 3.4 

402421 NA C1001578*:giJ6759903|gb|AAF28099.1| (AF1 3.4 

405248 NA Target Exon 3.4 

407638 AJ404672 Hs.334483 hypothetical protein FU23571 3.4 

403000 BE247275 Hs.15 1787 U5 snRNP^spedfc protein, 116 kD 3.4 
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433393 
432239 
458747 
442082 
417974 
446002 
448995 
436007 
424698 
435202 
410467 
405460 
441826 
453472 
447078 
441690 
420092 
418478 
408908 
414737 
449650 
418912 
436405 
453911 
409361 
429548 
420807 
409695 
445189 



426681 
458722 
409430 
443194 
445432 
410908 
406151 
436461 
411171 
432415 
439310 
401575 
420900 
445628 
448243 
445102 
442118 
416783 
435039 
451474 
442559 
453921 
420036 
435627 
411598 
446733 
410153 
403637 
405547 
427878 
451871 
410313 
416856 
449490 
450506 
440634 
459055 



AF038564 

X81334 

BE618395 

R41823 

AA210765 

A1346468 

AI613276 

AI247716 

M164366 

A1971313 

AF102546 

NA 

AW503603 

AL037925 

AW885727 

R81733 

AA814043 

U38945 

BE296227 

AI160386 

AF055575 

NMJJ00685 

AA1 60079 

AW503857 

NMJJ05982 

AW138872 

AA280627 

AA296961 

AI936450 

NA 

AA994896 

AA741545 

R21945 

AI954968 

AV653771 

AA121686 

NA 

AW511956 

AW820260 

T16971 

AF086120 

NA 

AL045633 

A1344166 

AW369771 

AW204610 

AA976718 

AA206186 

AW043921 

T70874 

T10213 

AI824009 

R60336 

W88774 

BE336654 

AA863360 

BE311926 

NA 

C05766 

AI821005 

R10305 

N27833 

AI652777 

NMJJ04460 

AI253123 

N23235 



Hs.98074 
Ks.2936 
Hs.257391 
Hs.7413 

Hs.145789 
H&5662 
Hs^32168 
Hs.151973 
Hs.170204 
Hs.63931 

Hs.129915 

Hs.301570 
Hs.33106 
Hs.88045 
Hs.1174 
Hs.250822 
Hs.125087 
Hs.23838 
Hs.89472 
Hs.172932 
Hs.4007 
Hs.54416 
Hs.13528d 
Hs.57846 

Hs.147482 

Hs.22514 
Hs.282832 
Hs.166975 
Hs.279009 

Hs,10592 

Hs.293261 

Hs^89014 
Hs.102793 

Hs.44269 
Hs.155743 
Hs.52620 
Hs^2270 
Hs.202242 
Hs.79889 
Hs.130526 
Hs.207636 
Hs.159993 
Hs.44577 
Hs.52792 
Hs.118370 
Hs.70937 
Hs.26040 
Hs.15830 



Hs.181022 

Hs.1 18599 

Hs.185683 

Hs^69028 

Hs.197069 

Hs.418 

Hs.127356 

Hs.30567 



Itchy (mouse homolog) E3 ublqultin prate 3.4 

matrix metailoproteinase 1 3 (coifagenase 3.4 

hypothetical protein OKFZp761 J1523 3.4 

ESTs; catsyntenin-2 3.4 
gb2i90c06.r1 NCLCGAPJ3CB1 Homo sapiens 3.4 

ESTs 3.4 

guanine nucleotide binding protein (G pr 3.4 

ESTs 3.4 

hypothetical protein FU2351 1 3.4 

KIAA0551 protein 3.4 

dachshund (DrosophOa) homolog 3.3 

Target Exon 3.3 

phosphotriesterase related 3.3 

gb:DKFZp564M037_r1 564 (synonym: hfcr2) 3.3 

ESTs 3.3 

ESTs 3.3 

ESTs 3.3 

cydin-dependent kinase inhibitor 2A (me 3.3 

serine/threonine kinase 15 3.3 

ESTs 3.3 

calcium channel, voltage-dependent L ty 3.3 

angiotensin receptor 1 3.3 

Homo sapiens mRNA for partial 3'UTR, seq 3.3 

SarcolemmakissoclatBd protein 3.3 

sine oculis homeobox (Drosophila) homolo 3.3 

ESTs 3.3 

ESTs 3.3 

gb:EST1 12514 Adrenal gland tumor Homo sa 3.3 

ESTs 3.3 

Target Exon 3.3 

ESTs 3.3 

ESTs, Weakly similar to T24961 hypotheti 33 

splicing factor, argWne/serine-rich 5 3.3 

matrix Gia protein 3.3 
gb:AV653771 GLC Homo sapiens cDNA clone 3.3 

ESTs 3.3 

Target Exon 3.3 

ESTs 3.3 
gb:QV2-STO296-15020(WMO-c1O ST0296 Homo 3.3 

ESTs. Weakly similar to A43932 mucin 2 p 3.3 

ESTs 3.3 

Target Exon 3.3 

ESTs 3.3 

ESTs 3.3 

integrin, beta 8 3.3 

ESTs 3.3 

ESTs 3.3 

monocyte to macrophage difierentiation-a 3.3 

ESTs 3.3 

ESTs 3.2 

gyoosyttransferase 3.2 

ESTs 3.2 
HorrwsapjeramRNA;cDNADKF2p586l1823(f 3.2 

ESTs 3.2 

H3 histone family, member A 3.2 

ESTs, WeaHy similar to fatty acid omega 3.2 

hypothetical protein FLJ 12691 3.2 

C3001108^i|10047201|dbJ|BAB13394.1| (A 3.2 

NMJ)18833*:Homo sapiens transporter 2, A 3.2 

CG1-07 protein 3.2 

ESTs 3.2 

ESTs 3.2 

ESTs, Weakly similar to I38022 hypotheti 3.2 

ESTs 3.2 

fibroblast activation protein, alpha 3.2 

ESTs, HigWy similar to S21424 nestin (H 3.2 

ESTs. Weakly simiiarto B34087 hypoM 3.2 
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452190 
430965 
405394 
424693 
454265 
437687 
428372 
414083 
411670 
416283 
437488 
428398 
452042 
421477 
438078 
448816 
419519 
404580 
447046 
457473 
429838 
459702 
400195 
417860 
417995 
422589 
435870 
440801 
426274 
423728 
439677 
452834 
431349 
417576 
430264 
416827 
410835 
426269 
405336 
437783 
440931 
455945 
430437 
405848 
455685 
406970 
409602 
423518 
425653 
426326 
433805 
437152 
448602 
452844 
407366 
408254 
424085 
416790 
420020 
426119 
426968 
457421 
453403 
454141 
426650 
450865 
407993 



H26735 
AA489732 

BE169810 

H03556 

AA765917 

AKD00884 

AL121282 

AW856552 

NIWL0O5429 

AA758239 

AI249368 

H38857 

Ai904743 

AI016377 

AB033052 

AI198719 

AA326167 

AW974903 

AW904907 

AI204995 

NA 

AW408557 

AW974175 

AA312735 

AA701327 

AA906366 

D38122 

AW891294 

R82331 

AJS33627 

AA503653 

AA339449 

AA470519 

8E327311 

AW806906 

H15302 

NA 

AI683150 
AI583052 
BE160636 
AI768801 
NA 

8E066976 

M29994 

W26713 

045027 

AI065104 

BE165753 

AA706910 

AL050027 

AJ541305 

AW407181 

AF026942 

AW807227 

NMJKJ2914 

R83066 

BE295866 

W94997 

U07616 

AL117431 

BE466639 

AW138413 

AA382814 

AI248013 

AW135274 



Ks.91668 
Hs.154918 

Hs.47557 
Hs.300949 
Hs.122840 
Hs.183887 
Hs.257786 

Hs79141 
Hs.180330 
Hs.98558 
Hs.243901 
Hs.104650 
Hs.131693 
Hs.22151 
Hs.176376 

Hs.17170 
Hsi91231 
Hs.30732 



Hs.235498 
Hs.188751 
Hs.30512 
Hs.17949 
Hs.190535 
Hs.2007 
Hs.132136 
Hs.164599 
Hs.105685 
Hs.156942 
Hs.82285 

Hs.47166 

Hs.168950 

Hs.201550 
Hs.270058 

Hs.169943 



Hs.256972 
Hs.129732 
Hs.249718 
Hs.250528 
Hs.112742 

Hs.48778 
Hs.218377 



Hs.139226 
Hs.7043 
Hs.94382 
Hs.189917 
Hs.173034 
Hs.1 12165 
Hs,61779 
Hs.139336 

Hs.106532 
Hs,12433 



Homo sapiens done PP1498 unknown mRNA 3.2 

ESTs 3.2 

Target Exon 3.2 

ESTs 3.2 

ESTs, Weakly similar to thyroid hormone 3.2 

ESTs 3.2 

hypothetical protein FU22104 3.2 

ESTs 3.2 
gb:RC1-CT0294-08010lH}12-a04 CT0294 Homo3.2 

vascular endothelial growth factor C 3.2 

ESTs 3.2 

ESTs 3.2 

Homo sapiens cDNA FU20738 (is, done HE 3.2 

hypothetical protein FU 10292 3.2 

ESTs 3.2 

K1AA1226 protein 3.2 

ESTs 3.2 

NM_01411?:Homo sapiens WctwrhJrwphala 32 

G protein-coupled receptor 4 3.2 

ESTs 3.1 

hypothetical protein RJ13409; WAA1711 3.1 

gb:an03c03jc1 Stratagene schizo brain S1 3.1 

NM_007057*:Homo sapiens ZW10 interactor 3.1 

hypothetical protein FU14075 3.1 

ESTs, Weakly similar to MAPBJWMAN MICRO 3.1 

Homo sapiens mRNA for WAA0556 protein, 3.1 

ESTs 3.1 

ESTs 3.1 

tumor necrosis factor (Bgand) superfaml 3.1 

solute carrier family 4, sodium btearbon 3.1 

ESTs 3.1 

WAA1688 protein 3.1 

ESTs, Moderately similar to ALU2_HUMAN A 3.1 

phosphortoosyiglydnamkis formyitransfer ■ 3.1 

gb:nc71f10.s1NCLCGAP.Pr1Homosapiens 3.1 

HT021 3.1 

gb:QV4-ST0023-160400-172Hd12 ST0023 Homo 3.1 
Homo sapiens mRNA; cDNA DKFZp566A1046 (f 3.1 

Target Exon 3.1 

ESTs, Weakly similar to ALU1JWMAN ALUS 3.1 

ESTs 3.1 
gb:PM1-HT0422-291299-002-c08 HT0422 Homo 3.1 

Homo sapiens cQNA FU13569 fis, done PL 3.1 

Target Exon 3.1 

gb:PM0-8T0340-21 1299-003^12 B 70340 Homo 3.1 

gbrHuman alpha-l spectrin gene, exon 12. 3.1 

ESTs 3.1 

R3H domain (binds single-stranded nuclei 3.1 

ESTs, Weakly similar to A46010 X-linked 3.1 

Homo sapiens, done 1MAGE:4098694, mRNA. 3.1 

ESTs 3.1 
gb:Homo sapiens mRNA; cONA DKFZp566C03243.1 

niban protein 3.1 

Homo sapiens cONA FU1 1927 fis, done HE 3.1 

gb:Homo sapiens dg33 mRNA, partial sequ 3.1 
gb:MR4^0l)62-180200^01-e10ST0062Homo3.1 

replication factor C (activator 1) 2 (40 3.1 

sucdnate-CoA tigase, GDP^ormlng, alpha 3.1 

adenosine kinase 3.1 

ESTs 3.1 

amphiphysin (StirTWann syndrome with br 3.1 

Homo sapiens cONA RJ12198 fis, done MA 3.1 

Homo sapiens cONA FU13591 fis. done PL 3.1 

ATP^irKTing<^etto,sub-eamiVC(a r m 3.1 

gb£ST96097 Testis I Homo sapiens cONA 5 3.1 

ESTs, Weakly similar to 138588 reverse t 3.1 

ESTs 3.1 



292 




WO 02/059377 PCT/US02/02242 



446466 H38026 Hs.308 arrestin 3, retinal (X-arrestin) 3.1 

457888 BE219794 Hs.293471 ESTs 3.1 

420058 AK001423 Hs.94694 Homo sapiens cDNA FU10561 lis, clone NT 3.0 

409248 AB033035 Hs.51965 KIAA1209 protein 3.0 

5 452747 BE153855 Hs.61460 Ig superfamDy receptor LNIR 3.0 

418926 AA232658 Hs.105794 UDP^IuaKaglycoprotBinglucosyliransle 3.0 

419346 A1830417 Hs.44143 polybromol 3.0 

4298M N93266 Hs.40747 ESTs 3.0 

435147 AL133731 Hs.4774 Homo sapiens rnRNA; cDNA DKFZp761C1712 (f 3.0 

10 420139 NM.005357 Hs.95351 lipase, hormone-sensitive 3.0 

405609 NA ENSPO00OO241065*:CDNA 3.0 

404274 NIWL002944*:Homo sapiens v-ros avian UR2 3.0 

449777 AI971362 Hs.231945 ESTs 3.0 

415459 H07118 Hs.6099 ESTs 3.0 

15 415245 N59650 Hs.27252 ESTs 3.0 

406291 NA Target Exon 3.0 

414210 BE383592 gb:601297871F1 NIH_MGC_19 Homo sapiens c 3.0 

432055 AW972359 Hs.293334 ESTs 3.0 

442246 AI7919B8 Hs.129115 ESTs 3.0 

20 451353 N21043 Hs.42932 ESTs 3.0 

451177 AI969716 Hs.13034 ESTs 3.0 

418026 BE379727 Hs.83213 fatty add binding protein 4, adipocyte 3.0 

401326 NA C10000447*:gij1168375|sp|P43467|AGA1J 3 ED 3.0 

409920 BE169746 Hs.12504 likely ortholog of mouse Arkadia 3.0 

25 432887 AI926047 Hs.162859 ESTs 3.0 

411789 AF245505 Hs.72157 DKFZP564I1922 protein 3.0 

401045 C1 1001883*:gil6753278|reflNP J33938.1] c 3.0 

432162 AA584062 Hs.272798 hypothetical protein RJ20413 3.0 

434627 AI221894 Hs.39311 ESTs 3.0 

30 442611 BE077155 Hs.1 77537 hypothetical protein DKFZp761B1 514 3.0 

425477 AW958879 Hs.270535 ESTs 3.0 

457183 H91882 Hs.1 18569 Dvt-binding protein IDAX (inhibition of 3.0 

433014 NMJJ14711 Hs.279912 KIAA0419 gene product 3.0 

415542 R13474 Hs.290263 ESTs, Weakly similar to I38022 hypotheti 3.0 

35 416173 R52782 gb:yg99d09.r1 Soares Infant brain 1NIB H 3.0 

408155 AB014528 Hs.43133 WAA0628 gene product 3.0 

453913 AW004683 Hs.78934 mutS (E. coB) homolog 2 (colon cancer, 3.0 

435495 AI754212 Hs.21951 Homo sapiens Xq pseudoautosoma) region; 3.0 

423629 AW021173 Hs.18612 Homo sapiens cDNA: FU21909 fis, done H 3.0 

40 411836 AW901879 Hs.314453 ESTs 3.0 

415030 D31118 Hs.191735 hypothetical protein MGC1 0520 3.0 

419606 AW294795 Hs.198529 ESTs 3.0 

440310 AA878939 Hs.125406 ESTs 3.0 

443608 AI375957 Hs.289074 F-boxoniy protein 22 3.0 

45 414140 AA281279 Hs.23317 hypothetical protein FU14681 3.0 

444781 NJVL014400 Hs.1 1950 GPUnchored metastasis-associated prate 3.0 

435393 AA701259 Hs.189299 ESTs 3.0 

454071 AI041793 Hs.42502 ESTs 3.0 

446922 BE175605 gkRC54ff0580-100500-022-H07 HT058O Homo3.0 

50 448062 AW295923 Hs.255472 KIAA1843 protein 3.0 

427711 M31659 Hs. 180408 solute carrier family 25 (mitochondrial 3.0 

450382 AA397658 Hs.60257 Homo sapiens cDNA FU13598 fis, done PL 3.0 

424866 W01938 Hs.337243 ESTs, Weakly similar to ALU7_HUMAN ALU S 2.9 

433043 W57554 Hs.125019 lymphoid nudear protein (LAF-4) mRNA 2.9 

55 420802 U22376 Hs.1334 vnnyb avian myeloblastosis viral oncogen 2.9 

445625 BE246743 Hs.288529 hypothetical protein RJ22635 2.9 

403677 NA C4001462:gi|4887715|gb|AAA79329.2|(L088 2.9 

411093 BE067650 gb:MR4-BTO358-O90300-003-e01 BT0358 Homo 2.9 

435255 W87434 Hs.106015 ESTs.ModerateiysirnilartoALUI^HUMANA 2.9 

60 443127 BE568102 Hs.180312 mitochondrial ribosomai protein S16 2.9 

448104 AI674818 Hs.316433 Homo sapiens cDNA FU1 1375 fis, done HE 2.9 

427315 AA179949 Hs.175563 Homo sapiens mRNA; cDNA DKFZp564N0763 (f 2.9 

430414 AW365665 Hs.120388 ESTs 2.9 

423600 AI633559 Hs.310359 ESTs 2.9 

65 458562 N34128 Hs.145268 ESTs 2.9 

402109 NA Target Exon • 2.9 

429629 BE501732 Hs.30622 Homo sapiens cDNA FU 13010 fis, done NT Z9 
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442295 
419752 
404721 
445107 
401987 
430566 
444517 
445563 
427691 
456561 
401458 
421039 
459504 
424962 
409617 
416931 
413221 
409732 
433687 
434340 
454529 
421379 
452291 
457402 
449051 
408761 
401093 
435061 
447985 
449340 
426384 
411905 
405953 
420854 
434265 
428365 
442861 
448337 
452554 
412248 
449450 
434757 
409038 
454545 
439842 
428479 
424800 
411086 
400250 
449168 
456482 
426044 
431854 
405873 
440400 
458265 
413708 
423739 
424408 
453096 
421825 
417742 
402765 
444378 
419172 
401497 
402376 



A1208121 Hs.147313 



AI827248 t&224398 Homo sapiens cDNA FU1 1469 fis, done HE 2.9 

AA249573 Hs.152618 ESTs, Moderately similar to ZN91_HUMANZ 2.9 

NM_005596*:Homo sapiens nudear factor 1 2.9 

ESTs, Weakly similar to 138022 hypothefi 2.9 

NM_002737*:Homo sapiens protein kinase C 2.9 

Hs.190149 ESTs 2.9 

Hs.146883 ESTs 2.9 

Hs.149006 ESTs 2.9 

Hs.20726 ESTs 2.9 

Hs.246358 ESTs. WeaWy similar to T32250 hypothec 2.9 

Target Exon 2.9 

Hs.101299 culiinS 2.9 
gb£01315974F1 NIH_MGC_8 Homo sapiens cO 2.9 

Hs.153954 TRAM4ike protein 2.9 
Hs.55209 Homo sapiens mRNA; cDNA DKFZp434K0514 (f 2.9 

Hs.80485 adipose most abundant gene transcript 1 2.9 
gb:PM0W0425-141299-001-F08 HT0425 Homo2.9 

Hs.56148 NY-REN-58 antigen 2.9 

gb*Jiy57g01.s1 NCLCGAP_Pr18 Homo sapiens 2.9 

Hs.128685 ESTs,WeaWysirnIlartoT17226hyp(jflie1i 2.9 

Hs^70425 ESTs 2.9 

Hs.103982 small inducible cytokine subfamily 8 (Cy 2.9 

Hs.28853 CDC7 (cell division cycle 7, S. cerevisi 2.9 

H&149342 activation-Induced cytidine deaminase 2.9 

Hs.333526 HER2 receptor tyrosine kinase (c-erb-b2, Z9 

Hs.238936 ESTs, Weakly similar to (deffine not ava 2.9 

C120M586*:gI|6330167|dbJ|BAA66477.1| (A 2.9 

Hs.163944 ESTs 2.9 

Hs.200949 ESTs 2.9 

Hs.195359 hypothetical protein MGC10954 2.9 

Hs.303662 ESTs 2.9 



AA481282 
AI939339 
AW873606 
AW194426 
AI868634 

NMJ03478 
BE514127 
NH.012288 
BE003760 
045371 
BE161151 
NMJD16122 
AA743991 
AI193043 
Z45439 
Y15221 
AF015592 
AW452648 
AW961400 
AA057264 

AI651474 
AI681475 
AW235786 
AW72078 
BE265067 
NA 

AW296927 
AA846811 
AA295331 
AA243837 
AW206453 
AW452434 
BE176480 
AL039852 



T97490 

AW806899 

AI910896 

Y00272 

AL035588 

BE070800 

NA 

NM_016206 

AA485224 

AA502490 

AA383550 

NA 

AA994364 

AI075375 

BE158791 

AA398155 

AI754813 

AW294631' 

AA298758 

R64719 



Hs.130554 
Hs.183861 
HS.577B7 
Hs.3782 
Hs.58006 

Hs.49136 
Hs.132921 
Hs.50002 

Hs.132413 
Hs.184572 
Hs.153203 



AW338625 



Hs.23142 

Hs.336695 
Hs.271699 

Hs.125594 
Hs.128193 

Hs.97600 
Hs. 146428 
Hs.11325 
Hs.183747 



Hs.12569 
Hs.22120 



gb:601193893F1 NIH_MGCJT Homo sapiens cO 2.9 
Target Exon 2.8 
gb:UI-H-BWf>ajcH>074WJI^1 NCLCGAP^Su 2.8 
Homo sapiens cONA:FU23089 fis, done L 2.8 
Homo sapiens cDNA FU20042 fis, clone CO 2.8 
ESTs 2.8 
ESTs 2.8 
ESTs, Weakly similar to ALU5_HUMAN ALUS 2.8 
gfcRC34fT0585-16030(W22-<J02 HT0585 Homo 2.8 
ESTs, Moderately similar to ALU7 JiUMAN A Z8 
ESTs 2.8 
small Inducible cytokine subfamily A (Cy 2.8 
gb:QV4-ST0023-160400-172-c12 ST0023 Homo 2.8 
ESTs 2.8 
cell division cycle 2, G1 to S and G2 to 2.8 
MyoO family inhibitor 2.8 
gbKC3^T0502^51199-011^07 BT0502 Homo JL8 
Eos Control 2.8 
colon carcinoma related protein 2.8 
gfcaa41b12.s1 NC|_CGAP_GCB1 Homo sapient 



ESTs 2.8 

polymerase (ONA directed) Iota 2.8 

Target Exon 2.8 

ESTs, Weakly similar to T25472 hypoteli 2.8 

ESTs, Weakly similar to IRXSLHUMAN IROQU 18 

gb:lL2-HT0397-091 299-025^)02 HT0397 Homo 2.8 

ESTs 2.6 

collagen, type V, alpha 1 2.8 

ESTs Z8 

ESTs, Moderately similar to CALB _HUMAN C 18 
gb:EST22d11 WATM1 Homo sapiens cONA don 2.8 

C1003621':gi|12407405IgbJAAG53491.1|AF22 2& 

ESTs Z8 

ESTs 2.8 

Target Exon 2.8 

C19000763^i|1363912(pirl|JC4296 ring f 2.8 
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405041 NA C3001706*gl|1345652|sppi5989|CA36_CHlC 2.8 



408758 NMJ)03686 Hs.47504 exonucleasel 2.8 

431917 D16181 Hs.2868 peripheral myelin protein 2 2.8 

437583 AA761190 Hs.244627 ESTs 2.8 

5 453737 AA744862 Hs.194293 ESTs, Weakly simHar to I54374 gene NF2 2.8 

458094 AF086325 gb:Homo sapiens full length Insert cONA 2.8 

401283 NA Target Exon 2.8 

410784 AW803201 gb:IL2-UJVf0077-070500^E06UM0077 Homo 2.8 

417601 NM.014735 Hs.82292 WAA0215 gene product 2.8 

10 418236 AW994005 Hs.337534 ESTs 2.8 
435532 AW291488 Hs. 117305 Homo sapiens, done IMAGE3682908, mRNA 28 
454714 AW815098 gb:QV4^T0212-091199-023^10ST0212Homo 2.B 

418629 8E247550 Hs.86859 growth factor receptor-bound protein 7 2.8 

442101 AI651930 Hs.135684 ESTs 2.8 

15 405080 AK000375 Hs.m20 HOCMC28P protein 2.8 

414661 T97401 Hs.21929 ESTs 2.8 

425589 AI650633 Hs.143688 Homo sapiens cDNA: FU23031 fis, clone L 2.8 

429630 AI916662 Hs.21 1577 klnectln 1 (kinesin receptor) 2.7 

428824 W23624 Hs.173059 ESTs 2.7 

20 421566 NMJJ00399 Hs.1395 early growth response 2 (Krox-20 (Drosop 2.7 
414596 BE386870 gb.*601275271F1 NIHJUGC JO Homo sapiens c 2.7 

440868 R79707 Hs.263339 ESTs, Moderately similar to I38022 hypot 2.7 

452943 BE247449 Hs.31082 hypothetical protein FU10525 2.7 

443772 AV646449 Hs.282872 ESTs 2.7 

25 432361 AI378562 Hs.159585 ESTs 2.7 

430375 AW371048 Hs.93758 Whistor* family, member H 27 

406504 NA C5000558:gl|4504675|refINPJ)02175.1|mt 2.7 

423279 AW959861 Hs.290943 ESTs 2.7 

424871 NMJ504525 Hs.153595 low density lipoprotetrwelated protein 2.7 

30 453619 H87648 Hs.33922 Homo sapiens, clone MGCD084, mRNA, comp 2.7 

423961 D13666 Hs.136348 osteoblast spedfic factor 2 {fascWin 2.7 

422156 N34524 gb:yy56d10.s1 Soaresjnulfiple^sclerosis. 2.7 

415752 BE314524 Hs.78776 putafive transmembrane protein 2.7 

419987 NMJH35Q14 Hs.94070 osteomodulin 2.7 

35 406182 NA Target Exon 2.7 

416495 X69970 Hs.79350 RYK receptor-tfte tyrosine kinase 2.7 

444701 AI916512 Hs.198394 ESTs ■ 2.7 

408171 AA301228 Hs.43299 hypothetical protein FU12890 2.7 

430153 AW968128 Hs.336679 ESTs 2.7 

40 413383 AA128978 Hs.154706 hypothetical protein FU1491 7 2.7 

414831 M31158 Hs.77439 protein kinase. cAMP-dependent regulato 2.7 

413278 BE563085 Hs.833 interfeion-stimulated protein, 15 kOa 2.7 

433132 AB026264 Hs.284245 hypothetical protein IMPACT 2.7 

437030 AA742577 Hs,303781 EST 2.7 

45 439031 AF075079 gbiHomo sapiens full length insert cDNA 2.7 

449532 W74653 Hs.271593 ESTs, Moderately similar to A47582 B-cd 2.7 

406153 Target Exon 2.7 

406625 Y13647 Hs.1 19597 stearoyW>oA desaturase (delta-9-<lesatur 2.7 

444698 AI188139 Hs.147050 ESTs 2.7 

50 432328 AI572739 Hs.195471 6-r^pr©fructo-2^ase/fructose-2 ( 6^ 2.7 

429628 H09604 Hs.13268 ESTs 2.7 

420149 AA255920 Hs.88095 ESTs 2.7 

431207 AA495925 Hs.9394 ESTs 2.7 

438394 BE379623 Hs.27693 peptidylprolyl Isomerase (cydophlGnH 2.7 

55 443304 AI050073 Hs.135338 ESTs 2.7 

427660 AI741320 Hs.114121 Homo sapiens cDNA: FLJ23228 fts, clone C 2.7 

408460 AA054726 Hs.285574 ESTs 2.7 

416515 N91716 Hs.194140 ESTs, WeaWy similar to J38022 hypotheti 2.7 

429922 297630 Hs.2261 17 H1 histone family, member 0 2.7 

60 418203 X54942 Hs.83758 COC28 protein kinase 2 2.7 

439509 AF086332 Hs.58314 ESTs 2.7 
402184 NA ENSP00000245238*:CDNAFU10922fis.cton 2.7 

450496 AW449251 Hs.257131 ESTs 2.7 

451963 AI825440 Hs.224952 ESTs 2.7 

65 457938 AI373638 Hs.133900 ESTs 17 

441541 AA938663 Hs.199828 ESTs 2.7 

441111 AI806867 Hs.126594 ESTs 2.7 
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423020 AA383092 Hs.1608 repfication protein A3 (14kD) 2.7 

445354 AV653485 Hs.6390 Homo sapiens dona FLB3344 PRO0845 mRNA, 2.7 

427961 AW293165 Hs.143134 ESTs 2.7 

410889 X91662 Hs.66744 twist (Drosophfla) homotog (acrocephatos 2.7 

5 445234 AW137636 Hs.146059 ESTs 2.7 

413903 AA496493 Hs.23136 ESTs 2.7 

406069 NA Target Exon 2.7 

447410 A1470235 Hs.172698 EST 2.7 

401256 NA NM_024Q89*:Homo sapiens hypothetical pro 2.7 

10 415139 AW975942 Hs.48524 ESTs 2.7 . 

420218 AW958037 Hs.286 ribosomal protein U 2.7 
455511 BE144762 gb:CMO-HT0180^1099465-b04 HT0180 Homo2.7 

438825 BE327427 Hs.79953 ESTs 2.6 

409564 AA045857 Hs.54943 fracture caOus 1 (rat) homolog 2.6 

15 452837 AL121053 Hs.5534 Homo sapiens cONA FU12961 lis. done NT 16 

434876 AF160477 Hs.61460 lg superfamlly receptor LNIR 2.6 

421565 AK001122 Hs.105859 hypothetical protein FU 10260 2.6 

453279 AW893940 Hs.59698 ESTs 2.6 

430785 230201 gb:HHEA22G Atrium cONA library Human hea 2.6 

20 456986 D38299 Hs.170917 prostaglandin E receptor 3 (subtype EP3) 2.6 

433068 NM.006456 Hs.288215 sialyltransferase 2.6 

421952 AA30O9O0 Hs.98849 ESTs, Moderately similar to AF161 511 1 H 2.6 

429208 AA447990 Hs.190478 ESTs 2.6 

430733 AW975920 Hs.283361 ESTs 2.6 

25 441720 AI346487 Hs.28739 ESTs 2.6 

418986 A1123555 Hs.81796 ESTs 2.6 

432481 AW451645 Hs.151504 Homo sapiens cONA FU11973 fis, done HE 2.6 
434338 AW754311 gb:CMUrr0337-1412994)68-fl)7 CT0337 Homo 2.6 

417061 AI675944 Hs.188691 Homo sapiens cONA FU12033 fis, done HE 2.6 

30 410530 M25809 Hs.64173 ATPase, H transporting, lysosomal (vacuo 2.6 

456672 AK002016 Hs.1 14727 Homo sapiens, done MGC:16327, mRNA, com 2.6 

425071 NMJ)13989 Hs.154424 delodinase, todothyronine, type II 2,6 

408868 AW292286 Hs.255058 ESTs 2.6 

451531 AA018311 Hs.1 14762 ESTs 2.6 

35 405822 Target Exon 2.6 

418301 AW976201 Hs.53913 hypothetical protein FU10252 2.6 

417315 A1080042 Hs.336901 ribosomal protein S24 2.6 

434699 AA643687 Hs.149425 Homo sapiens cDNA FU1 1980 (Is. clone HE 2.6 

443204 AW205878 Hs.29643 Homo sapiens cONA FU13103 fis, clone NT 2.6 

40 405638 Target Exon 2.6 

452542 AW812256 gb:RC0*ST0174-191099^)31-aO7 ST0174 Homo 2.6 

403943 05000355:0114503225116^000765.110)1 2.6 

404535 Z25884 Hs.121483 chloride channel 1 , skeletal musde (Tn 2.6 

402800 NA Target Exon 2.6 

45 449144 AI989503 Hs.233405 ESTs 2.6 

454934 AW846080 Hs.314324 ESTs 2.6 

424717 H03754 Hs.1 52213 wingless-type MMTV Integraiion site (ami 2.6 

428303 AW974476 Hs.183601 regulator of G-proteln signaffing 16 2.6 

427970 AA418187 Hs.330515 ESTs 2.6 

50 450638 AK001826 Hs.25245 hypothetical protein FU1 1269 16 

453034 BE246010 Hs.271468 Homo sapiens mRNA for FU00038 protein, 2.6 
455097 AW855802 gb^C1-a0279-170200^023«d08CTT)279 Homo 2.6 

427317 AB028955 Hs.175780 K1AA1032 protein 2.6 

408875 NM15434 Hs.48604 OKFZP434B1 68 protein 2.6 

5 5 427510 Z47542 Hs.179312 small nuclear RNA activating complex, po 2.6 

423201 NM_000163 Hs.125180 growth hormone receptor 2.6 

406271 Target Exon 2.6 

442696 BE566962 Hs.7063 Homo sapiens cONA: FU20913 fis, done A 2.6 

454018 AW016892 Hs.100855 ESTs 2.6 

60 435420 AI928513 Hs.59203 ESTs 2.6 

434398 AA121098 Hs.3838 serunvlndudble kinase 2.6 
455708 BE069326 gb:Qv3*8T0381-170100-06ag03 BT0381 Homo 2.6 

439347 W24320 Hs.102941 Homo sapiens cDNA:FU21531 fis. done C 2.6 

407523 X64984 gb'Hsaptons mRNA HTPCRX10 for olfactory 2.6 

65 425101 AA830431 Hs.180811 ESTs 2.6 

435153 AA668763 Hs.291939 ESTs 2.6 

409139 AI681917 Hs.3321 ESTs, Highly similar to IRX1JJUMAN IROQU 16 
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455100 
414612 
440283 
423025 
431473 
404440 
403388 
403775 
405037 
407447 
420952 
435447 
440202 
445854 
421247 
414870 
457411 
424676 
404443 
452268 
430832 
444779 
408633 
459089 
447197 
454111 
411165 
406922 
456045 
413111 
423123 
432201 
427032 
445417 
422225 
428330 
410011 
426310 
419589 
437770 
442818 
414251 
428301 
431956 
455732 
458624 
428257 
418054 
458652 
458012 
422996 
410804 
457211 
440029 
448141 
409163 
431385 
414569 
439963 
455935 
425025 
416589 
404826 
422938 
421991 
417404 
448516 



BE160198 
BE274552 
AI732892 
AA831267 
AA825686 

NA 
NA 
NA 

AF290544 

AA282Q67 

AJ872932 

AW516211 

AI702885 

BE391727 

N72264 

AW085961 

Y08565 

NMJJ03512 

AI073913' 

AI192105 

AW963372 

F13036 

R36075 

AW081681 

NH.000169 

S70284 

H62943 

BE065837 

IWL012247 

AI538613 

AF012023 

AK001058 

BE245652 

L22524 

AB020641 

NM.000909 

AW973708 

AA767881 

AK001741 

AL042306 

AW628666 

AK002032 



AI362790 

BE394723 

NMJW2318 

AW375610 

A1424899 

BE091089 

U64820 

AW972565 

AW089705 

AI471598 

AA065081 

BE178536 

AF109298 

AW247529 

BE158687 

AW953168 

AA652687 

NMJQ1809 
NMJJ14918 
NAOO7350 
AW898595 



gb:QV1-HT0413-01020(M)59-h03HT0413Homo2.6 

Hs.76578 protein Inhibitor of activated STAT3 2.6 

Hs.190489 ESTs 2.6 

Hs. 12244 hypothetical protein FLJ20097 2.6 

Hs.321176 ESTs, Weakly similar to S65824 reverse t 2.6 

NM_021048:Homo sapiens melanoma antigen, 2.6 

C3001398*:gij12248917|dbjlBAB20375.1| (A 2.6 

Target Exon 2.6 

NMJ)21628*:Homo sapiens arachidonate Up 2.6 

gb.Homo sapiens aminopeptidase mRNA, par 2.6 

Hs.88972 ESTs, Moderately similar to A46010 X-fin 2.6 

gb.-wm72e03jdNCLCGAP.Ut2 Homo sapiens 2.6 

Hs.125300 ring finger protein 21, interfejoiwespo 2.6 

Hs. 145568 ESTs 2.6 

Hs. 102910 general transcription factor IIH, polype 2.6 

Hs.300670 WAA12G4 protein 2.6 

Hs.130093 ESTs 2.6 

Hs.151678 UDP4^-acetyUlpha<>^aiactc«amIne:poiyp 2.6 

C8001428*:gi|6572242Iemb|CAB62951.1| (Z9 2.6 

Hs.28777 H2A histone family, member L 2.6 

Hs.100686 ESTs, Weakly similar to JE0350 Anterior 2.6 

Hs.147170 ESTs 2.6 

Hs.46677 PRO2000 protein 2.6 
Hs.27373 Homo sapiens mRNA; cONA DKFZp56401763 (f 2.6 

gb.-yh88b01.s1 Scares placenta Nb2HP Homo 2.6 

Hs.269064 ESTs,WeaWysimflartoT42689hypotheti 2.6 

Hs.69089 galactosidase, alpha 2.6 

2.6 
2.6 



gbstearoyf-CoA desaturase (human, adfro 
Hs.154188 ESTs 

gb:RC2-BT0318-110100-012-g12 BT0318 Homo 2.6 
Hs.124027 SELENOPHOSPHATE SYNTHETASE ; Human seten2.6 

Hs.298241 Transmembrane protease, serine 3 2.5 

Hs.173274 integrin cytoplasmic domain-associated p 2.5 

Hs.12680 Homo sapiens cONA FU10196 fis, clone HE 2.5 

Hs.1 18281 zinc finger protein 266 2.5 

Hs.2256 matrix metalfoproteinase 7 (MMP7; uterin 2.5 

Hs.57856 PFTAiRE prated kinase 1 2.5 

Hs.169266 neuropeptide Y receptor Y1 2.5 

Hs^)1925 Homo sapiens cONA FU13446 fis, done PL 2.5 

Hs.122897 ESTs 2.5 

Hs.8739 hypothetical protein FLI10879 2.5 

Hs.97689 VASA protein 2.5 

Hs.98440 ESTs, Weakly similar to 138022 hypotheti 2.5 

Hs.272245 Homo sapiens cONA RJ11170 fis, done PL 2.5 

gb:QV1^TO^t-280200-084-h078T0631 Homo 2.5 

Hs.278639 KIAA1684 protein; Ekelyhomolog of mous 2.5 

Hs.275243 S100 calcium-binding protein A6 (calcycl 2.5 

Hs.83354 lysyfoxidase-Gke2 2.5 

Hs.117102 hypothetical protein FLJ13046 similar to 2.5 

Hs.188211 ESTs 2.5 

gb:PM4^T0724-1304W)Wc07 BT0724 Homo 2.5 

Hs.66521 Machadc-Joseph disease (spinocerebellar 25 

Hs.32399 ESTs, WeaWy similar to S51797 vasodilat 2.5 

Hs.293711 ESTs, Weakly similar to S64329 probable 2.5 

Hs.197531 ESTs 2.5 

gb:zm1 3a03.s1 Stratagene pancreas (93720 2.5 

Hs.1 1090 membrane-spanning 4<iomains f subfamily A 2.5 

Hs.1 18258 prostate cancer associated protein 1 2.5 

Hs.6793 platelet-activating factor acetylhydrola 2.5 
gb:CMO-HT0395-280100-169-b09 HT0395 Homo2.5 

Hs.12407 ESTs 2.5 

Hs.96151 Human DNA sequence from clone RP5-1103G7 2.5 

Target Exon 2.5 

Hs.1594 centromere protein A (17kD) 2.5 

Hs.110488 KIAA0990 protein 2.5 

Hs.82101 pfeckstrin homology-tfke domain, family 2.5 
gb:RC1^N0073-2604Q0411-g09 NN0073 Homo2.5 
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403356 NA ENSP0Q0W251525^HypothetIcaJ protein Kl 2.5 

404983 ENSP00000252242*J<erafin, type II cytosk 2.5 

418282 AA215535 Hs.98133 ESTs 2.5 

427409 AW467143 Hs.135411 actin related protein 2.5 

5 431808 AF186114 Hs370737 tumor necrosis factor (Qgand) superfami 2.5 

443367 AW071349 Hs.215937 ESTs 15 

421246 AW582962 Hs.102897 CGM7 protein 2.5 

439217 AF086041 Hs.42975 ESTs • 15 

400925 Target Exon 2.5 

10 404552 NA ENSP00000220388*:ZINC FINGER TRANSCRIPTS 

417168 AL1331 17 Hs.81376 Homo sapiens fnRNA;cONADKFZp5fi6L1 121 (f 15 

418841 NNL002332 Hs.89137 low density Opoprotein-related protein 2.5 

426853 U32974 Hs.172777 bacuiovlrai IAP repeat-containing 4 2.5 

427738 NM.000318 Hs.180612 peroxisomal membrane protein 3 (35kD, Ze 2.5 

15 457384 AA501760 Hs.15806 Homo saptens mRNA; cONA DKFZp434H2019 (12.5 

447128 AI271898 Hs.164866 cydinK 2.5 

454693 AW813428 gb:MR3^0192-010200-21O<05ST0192 Homo 2.5 

434657 AA641876 Hs.191840 ESTs 15 

402077 NA Target Exon 2.5 

20 400289 X07820 Hs.2258 matrix metafloproteinase 10 (MMP10; sir 2.5 

. 409723 AW885757 Hs.257862 ESTs 15 

447020 T27308 Hs.16986 hypothefical protein RJ11046 2.5 

455068 AI807894 Hs.47274 Homo sapiens mRNA; cONA DKFZp564B176 (fr 2.5 

431232 A1024353 Hs.131755 hypothetical protein FU 14298 15 

25 408938 AA059013 Hs.22607 ESTs 2.5 

411571 AA122393 Hs.70811 hypothefical protein FU2051 6 2.5 

426504 AW162919 Hs. 170160 RAB2, member RAS oncogene femffy-fike 2.5 

428248 AI126772 Hs.40479 ESTs 2.5 

408813 AI560090 Hs.48295 RNA heficase family 15 

30 423504 N80077 Hs.24792 chromosome 12 open reading frame 5 15 

425441 AA449644 Hs.193063 Homo saptens cONA FU14201 fis, clone NT 15 

443066 AW297921 Hs.255703 ESTs 2.5 

443556 AA256769 Hs.94949 methylmalonyKJoA epimerase 15 

428943 AW088180 Hs.37636 ESTs, Weakly similar to K1AA1 392 protein 2.5 

35 425320 U29344 Hs.83190 fatty acid synthase 2.5 

430388 AA356923 Hs.240770 nuctearcap binding protein subuntt 2, 2 15 

423242 AL039402 Hs.125783 DEME-6 protein 2.5 

416241 N52639 Hs.32683 ESTs 2.5 

440244 A1743977 Hs.205144 ESTs 15 

40 409239 AA740875 Hs.44307 ESTs, Moderately similar to 138022 hypot 2.5 

452464 AW500507 Hs.192619 KIAA1 600 protein 15 

410718 A1920783 Hs.191435 ESTs 15 ( 

408877 AA479033 Hs.130315 ESTs, Weakly similar to A47582 B-oeO gr 15 

445150 AI446747 Hs.338704 olfactory receptor, family 7, subfamily 2.5 

45 407756 AA1 16021 Hs.38260 ubiquitin specific protease 18 2.5 

407633 NNLGQ7069 Hs.37189 similar to rat HREV1 07 15 

449754 H00820 Hs.30977 ESTs, Weakly similar to B34087 hypothetl 2.5 

419316 AA236255 Hs.298419 ESTs 15 

429118 H2Q669 Hs.35406 ESTs, Highly slmflar to unnamed protein 25 

50 440331 AL046412 Hs^02151 ESTs 2.5 

449344 AI640355 Hs.312691 ESTs 2.5 

459006 AW298631 Hs.27721 WoiWirschhom syndrome candidate 1-lik 2.5 

423165 A1937547 Hs.124915 hypothefical protein MGC2601 15 

417337 AW837349 gb:QV2-LT0038-270300-108-d12 LT0038 Homo 15 

55 438290 AA843719 Hs.122341 ESTs 2.5 

406414 C50005a6*:gil124941|sp|P18614|iTA1„RAT I 15 

424498 AB033043 Hs.149377 hypothefical protein DKFZp761L0424 15 

443464 BE548446 Hs.5167 Homo sapiens mRNA; cONA DKFZp434F152 (fr 2.5 

424856 AA347746 Hs.9521 ESTs, Weakly similar to ZN43JWMAN ZINC 15 

60 440304 BE159984 Hs.125395 ESTs 2.5 

409045 AA635062 Hs.50094 Homo sapiens mRNA; cONA DKFZp434O0515 (t2JS 

422648 086983 Hs. 118893 Melanoma associated gene 2.5 

428819 AL135623 Hs.193914 WAA0575 gene product 2.5 

412520 AA442324 Hs.795 H2A histone famiry, member O 15 

65 430602 D13752 Hs.184927 cytochrome P450, subfamily X1B {steroid 15 

408031 AA081395 Hs.42173 Homo sapiens cONA FU10366 to, done NT 15 

403133 Target Exon 15 
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413189 BE070231 gb:QV4^T0407-2601<XM)87-f12 BT0407 Homo ^5 

400346 AB041269 Hs.272263 Homo sapiens mRNA for keratin 19, parfia £5 

435509 AI458679 Hs.181915 ESTs 2.5 

458145 AI239457 Hs.130794 ESTs 2.5 
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TABLE 19A 



Table 19A shows the accession numbers for those pkeys lacking unigenelD's for Table 19. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 

Pkey. Unique Eos probeset Identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 

15 

Pkey CAT number Accessions 

407647 1007366J AW860158 AW862385 AW860159AW862386AW862341 AW821869AW821893AW062660AW062656 
20 407980 103087 J MQ46309 AI263500 AA046397 

408254 1049348 1 AW807227 AW807576 AW807137 AW807157 AW807495 AW807494 AW807417 AW807083 AW845786 AW845801 
AW807130 AW807335 AW807081 AW807349 AW807339 AW807164 AW807341 AW807224 AW845903 AW177424 
AW807159 AW807123 BE141576 AW807340 AW807334 AW807520 AW807205 AW807505 BE141574 AW807390 
AW807395 AW845789 AW807101 AW807089 AW807519 AW807239 AW807509 AW807356 AW807526 AW807098 
25 AW807307 AW807153 AW807295 AW807313 AW807322 AW807265 AW807513 AW807516 AW807168 AW807501 

AW807120 AW807168 AW807121 AW807527 AW807151 AW807203 AW807489 AW807511 AW807158 AW845800 
AW807507 AW845793 AW845796 AW807296 AW845781 AW807444 AW845871 AW807512 AW807242 AW807141 
AW807522 AW807487 AW807514 AW607142 AW807232 AW807379 AW8071 14 AW807518 AW807199 AW80721 1 
AW807498 AW807086 AW807492 AW80721 8 AW807082 AW807525 AW807493 AW807523 AW807087 AW845784 
30 AW807037 AW807128 AW807080 AW807118 AW845807 AW807524 AW845803 AW807249 AW845795 AW807160 

AW807343 AW807515 AW807233 AW807289 AW177102 AW807352 AW807394 AW177105 AW807176 AW177103 
AW845870 AW177099 AW177101 AW807528 AW807336 AW807038 AW177100 AW80741 1 AW807088 AW845865 
AW807226 AW807517 AW807397 AW807303 AW807177 AW807154 AW807136 AW807146 AW807085 AW807521 
A W807488 AW807385 AW807355 AW807223 AW807155 
3 5 409163 110418 1 AA065081 M075017 AA084791 AA071015AA081560AA071459 AA545727AA083100AA085366 AA115845AA075457 
AA064704 AA082878 AA075742 AA069162 
409695 1 14876 1 AA296961 AA296889 AA076945 AA077528 AA077497 
410534 1207247J AW905138 AW753008 R13818 243519 
410672 1214882.1 AW794600AW794730 
40 410784 1221005.1 AW803201 8E079700 BE062940 

410785 1221055J AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 
410835 1223785.1 AW806906 AW806915 AW866460 AW866475 AW866462 AW866448 AW866372 AW866604 
411050 1230330.1 AW814902 BE156656 8E156667 BE156590 BE156441 BE156447 
411086 1231500.1 BE070800 AW875226 BE1491 15 
45 411093 1231970.1 BE067650 AW817053 

411111 1232669J AW818127AW818161 R09719 
411171 1234393.1 AW820260 AW820332 R94406 
411337 1239217J AW837349AW837355 AW88Z717 
411514 1248638J AW850178AW850233 AW850445 AW850446 
50 411670 1253680J AW856552AW861101 AW856574AW861099AW861100AW856573AW856576AW856562 
411905 1265181J BE265067 BE264978 AW875420 
412102 1277395.1 H56435 H56572 AW892929 
412209 1283610.1 AW901456 AW901450 AW901441 
412248 1285000.1 BE176480 AW903298 AW903313 
55 413043 1346556J BE158766 BE061699 BE147360 BE147362 BE061666 BE061697 BE061647 BE061678 

413111 1349546.1 BE065837 BE065805 BE065799 BE065818 BE065839 BE065831 BE065894 BE065789 BE065792 
413189 1352723.1 BEO70231 BE070229 BE070255 

413221 1353887.1 BE161151 BE162495 BE161002 BE072205 BE160989 BE162482 
413499 1373910.1 BE144884 H97942 
60 413708 1384140J BE158791 BE158806 BE158748 BE158744 BE158740 BE158739 BE158811 BE158700 BE158741 BE158683 BE158685 
414210 1426051.1 BE383592BE261671 
414596 1465004.1 BE386870 Z41986 H08501 
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414605 1465790.-1 BE390440 

415747 155189 1 AA381209 AA381245M167683 

416173 1574973J R52782 R17313 H24192 R19876 

417742 1696282J R64719 Z44680 R12451 

5 417974 171237.1 AA210765 T95700 H94407 

418636 177402 J AW749855 AA225995 AW750208 AW750206 

419536 185688 J AA603305AA244095AA244183 

420854 197072.1 AW296927AI684514AI263168AA281079 

422156 212379 1 N34524 AA305071 AW954803 AA502335 AI433430 AI203597 AW026670 AW265323 AW850787 AA317554 AW993643 

1 0 AW835572 AW385512 AI334966 W32951 H62656 H53902 R88904 AW835732 

422996 223666 1 BE091069 BE091123 AA319959 

423833 232451J AW503329N46610AA331571 

423841 232507 1 AW753967 AA370795 AA331630 AW962550 

423945 233566 1 AA410943 AW946953 AA334202 AA332882 

15 425201 247933J AA352111 AW962247AA429695 

426650 270283 J AA382814 AA402411 AA412355 

426878 273265 J BE069341 AW748403 AL044891 AI908240 AA393080 

430264 315008 J AM70519 BE303010 BE302954 BE384120 

430785 323486 J Z30201 AA486132 T72025 

20 431676 336411J AI685464AW971336AA513587AA525142 

433687 373061.1 M743991 AA604852AW272737 

434338 383982J AW754311 AA630185 AW803285 

434469 387447 1 AA634805 C18732 AA729161 AA729860 

435447 406400 J AI872932AA682306BE220163 W88695TB1307H91447 

25 437152 43386.1 ALQ50027 8E089G51 

437854 44418.1 AL1 19723 AL1 1 9874 A1909018 U50537 

439031 46798J AF075079 H48601 H48795 

439255 470321.1 BE164500 AA832198 BE164502 

444910 624951.1 AI201849 BE069007 AW946544 

30 445432 63943.1 AV653771 BE089370 

446922 69865 1 BE175605 Z43529 F06610 BE175602 AV661027 

447197 711623.1 R36075 AI366546 R36167 

448420 76273.1 BE623004 AA380669 BE263627 BE246433 

448516 766241 1 AW898595 AW898588 AW898590 AW898663 AW898592 A1525093 

35 450522 837264.1 AI698839 A1909260 A1909259 

450736 844652.1 AW970060 AI732366 AI792313 AW839644 

451024 85565.1 AA442176 AA259181 

451067 85759 1 BE 1 72186 AA059279 AA020815 AA013437 

451340 86640 1 AW936273AW340350AA017208 

40 452542 921410.1 AW812256AW812257AI906423 AI906422 

452564 92227.1 AA026777 N50065 R09961 N54721 

453472 968371.1 AL037925 AL037931 AL037957 

454307 1106070.1 AW855717 AW362452 AW362443 

454359 1130674.1 N71277AW390764 

45 454545 1223779 1 AW806899 AW866451 AW866393 AW866297 AW817869 

454693 1229132.1 AW813428 AW813444 AW813367 AW813368 AW813429 AW813424 

454714 1230493.1 AW815098 BE154843 BE154831 

455047 1250536.1 AW852530AW852527AW852526 

455092 1252971J BE152428 AW855572 AW855607 

50 455097 1253130 1 AW855802 AW855794 AW855797 AW855806 AW855796 AW855808 AW855793 AW855807 

455100 1253334.1 BE160198 AW935898 T1 1520 AW935930 AW856073 AW861034 

455431 1289854 1 AW938484 BEO0 1245 BE001 190 

455511 1321229J BE144762 AW979091 

455609 1337548 1 BEQ11668 BE011689 BE011627 BE011679 BE011699 BE011678 BE011696 BE011675 BE011622 BE011635 

5 5 455651 1348732J BE064962 BE064979 BE064853 BE064857 BE064856 BE064977 BE064960 BE064860 BE064815 BE064957 BE064804 
BE064816 BE064850 BE064806 BE064796 BE064818 BE064975 BE064819 BE064810 BE064668 BE065059 

455685 1350393 1 BE066976 BE066928 BE066927 

455700 1351264.1 BE068115 BE068104 BE068102 BE068096 BE068103 BE068154 BE068198 

455708 1352232.1 BE069326 BE069290 BE069352 

60 455732 1353874 1 BE080908 BE072258 8E072190 BE072236 

455838 1374605.1 BE145808 BE145807 BE181883 

455935 1384144J BE1586B7 BE158688 

455945 1385588.1 BE1 60636 BE 160606 BE1 60703 

456207 165078.-1 AA193450 

65 456482 192289 J AA485224AA287308AA258121 

458094 . 47311 1 ARJ86325 W72956 W73221 AA219112 

458673 679507.1 N99626 AI302701 
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TABLE 19B 



Table 19B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 19. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey. Unique number corresponding to an Eos probeset 

Refc Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gl) numbers. 'Dunham I. et ah" refers to the publication 

entitled The DMA sequence of human chromosome 22." Dunham L et at, Nature (1999) 402:489495. 
Strand: Indicates DMA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref 


Strand 


NLposition 


400555 9801191 


Minus 


134694-134817 


400608 9887666 


Minus 


96756-97558 


400610 9887671 


Minus 


117606-117928,124040-124147 


400925 7651921 


Plus 


38183-38391,43900-44086 


401045 8117619 


Plus 


90044-90184,91111-91345 


401049 7232177 


Plus 


149157-150692 


401093 8516137 


Minus 


22335-23166 


401256 9796573 


Minus 


4548245620 


401283 9800093 


Minus 


47256-47456 


401326 9212516 


Minus 


226246-227505 


401418 7452889 


Minus 


124865-125075 


401451 6634068 


Minus 


119926-121272 


401458 9187886 


Plus 


76485-77597 


401497 7381770 


Plus 


92607-92813 


401508 7534110 


Minus 


110779-110983 


401575 7229804 


Minus 


76253-76364 


401747 9789672 


Minus 


118596-118816,119119-119244,119609-119761.120422-120990,130161-130381.130468-13 






131258,131866-131932,132451-132575,133580-134011 


401781 7249190 


Minus 


83215^35,83531^6,8374W3901,84237-84393.84955^037,8629(«6814 


401785 7249190 


Minus 


165776-165996.166189-166314J6640S-166569.167112-167268.167387-167469.168634-168942 


401793 7263888 


Minus 


102945-103083 


401987 4406829 


Minus 


72893-73021.76938-77049 


402077 8117414 


Plus 


65014-65195 


402109 8131678 


Minus 


171722-171859,173197-173303 


402184 8576001 


Minus 


112844-112986,113505-113636 


402376 9625329 


Minus 


21753-22385 


402421 9796341 


Minus 


46609-46662.46758-4681 1 .86293-86346,89776-89829.90048-90101 .102817-1 02924 


402578 9884928 


Pius 


66350-66496 


402606 9909429 


Minus 


81747-82094 


402696 7328818 


Minus 


23600-23731 


402765 9367757 


Plus 


109588-109726 


402800 6010175 


Plus 


43921-44049,46181-46273 


402820 6456853 


Minus 


82274-82443 


402892 8086844 


Minus 


194384-194645 


403133 7331427 


Plus 


38314-38634 


403356 8569930 


Plus 


92839-93036 


403388 9438331 


Plus 


112733-113001,114599-114735 


403426 9719529 


Minus 


157156-158183 


403585 8101208 


Minus 


131266-131769 


403593 6862650 


Minus 


62554-62712,69449-69602 


403637 8671936 


Minus 


142647-142771.145531-145762 


403639 8671948 


Plus 


113234-113326,115186-115287.119649-119786 


403677 7331517 


Minus 


55008-55083.62860-63051 


403775 7770580 


Minus 


102247-102326,103095-103148 


403943 7711864 


Plus 


100742-100904.101322-101503 
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404091 7684554 


Minus 


82121-83229 


404097 7770701 


Plus 


55512-55781 


404142 9856692 


Minus 


80316-80459 


404253 9367202 


Minus 


55675-66055 


404274 9885189 


Plus 


104127-104318 


404285 2326514 


Plus 


32282-32416 


404360 9858450 


Minus 


122873-122966,151324-151469,153093-153253 


404440 7528051 


Plus 


80430-31581 


404443 7579073 


Minus 


8719847441 


404552 7243881 


Plus 


19854-20010 


404561 9795980 


Minus 


69039-70100 


404580 6539738 


Minus 


240588-241589 


404721 9856648 


Minus 


173763-174294 


404826 6572184 


Plus 


47726-48046 


404983 4432779 


Minus 


51178-51374.52000-52173 


405037 7543748 


Minus 


127374-127578 


405041 7547195 


Plus 


121230-121714 


405095 8072599 


Plus 


138877-139066 


405153 9965565 


Minus 


175317-175500 


405196 7230083 


Minus 


135716-135851 


405232 7249042 


Plus 


125904-126063 


405248 7259728 


Plus 


637-777 


405336 6094635 


Plus 


33267-33563 


405394 6624123 


Minus 


3190(W2373 


405460 7684569 


Minus 


52223^2389 


405494 8050952 


Minus 


70284-70518 


405547 1054740 


Plus 


124361-124520,124914-125050 


405609 5757553 


Minus 


4281443010.4358343783,4486345033,46429465^ 






52918,5546^55601,57111^7307,58169-58296,60215-60332,61482-61727 


405638 6289229 


Plus 


199260-199372.199826-199929 


405654 4895155 


Minus 


53624-53759 


405718 9795467 


Plus 


113080-113266 


405822 6273498 


Minus 


154660-154974,155203-155379 


405848 7651809 


Minus 


28135-28244 


405873 6758747 


Minus 


32129-32764 


405906 7705124 


Minus 


10835-11059 


405917 7712162 


Minus 


106829-107213 


405925 6758795 


Plus 


129935-130282 


405953 7960374 


Minus 


65101-65574 


406069 9117732 


Plus 


68880-69374 


406151 7144806 


Minus 


94087-94285 


406153 9929734 


Minus 


12902-13069 


406182 5923650 


Minus 


28256-28935 


406271 7534217 


Plus 


36179-36692 


406291 5686274 


Plus 


9562-9867 


406348 9255985 


Minus 


71754-71944 


406414 9256407 


Plus 


4959349850 


406446 9454509 


Minus 


116424-116527.118721-118859.121187-121364 


406504 7711360 


Minus 


107068-107277 


406554 7711566 


Pius 


106956-107121 
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TABLE 20: 544 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
LIKELY TO ENCODE EXTRACELLULAR OR CELL- 
SURFACE PROTEINS 

Table 20 shows 544 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode extracellular or cell-surface proteins. These were selected as for 
Table 19, except that the ratio was greater than or equal to 3.0, the "average" normal adult 
tissue level was set to the 85 th percentile value amongst 144 non-malignant tissues, and the 
96 th percentile value amongst die 73 breast cancer specimens was greater than or equal 100 
units, and the predicted protein contained a structural domain that is indicative of 
extracellular localization (e.g. ig, fh3, egf, 7tm domains, signal sequences, transmembrane 
domains). The predicted protein domains are noted. 



15 



20 



Pkey. Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UrogeneiD: Unfgene number 

Pred.ProLDomalns: Predicted Protein Domains 

Unjgene Title; Unlgene gene trtte 

R1: Ratio of 93* percentile of tumor to 85* percentile of normal body tissue 



25 Pkey ExAccn UnigeneiD PredJVotDomalns 



UnlgeneTttfe 



408591 
400291 
449746 

30 407277 
400292 
424735 
426878 
428848 

35 407178 
408000 
427585 
429441 
450375 

40 420931 
422109 
445730 
451110 
400297 

45 420813 
452744 
424634 
420757 



50 



55 



AF015224 Hs.46452 

AA401369 Hs.190721 

AI668594 Hs.176588 

AW170035 Hs.326736 

AA250737 Hs.72472 

U31875 Hs.272499 
BE069341 

NM_000230Hs.194236 

AA195651 Hs.104106 

111690 Hs.620 

D31152 Hs.179729 

AJ224172 Hs.204096 

AA009647 Hs.8850 

AF044197 Hs.100431 

S73265 Hs.1473 



AI624342 

A1955040 

AJ127076 

X51501 

A1267652 



Hs.170042 

Hs.265398 

Hs.334473 

Hs.99949 

Hs.30504 



NM_003613Hs.1514Q7 
X78592 Hs.99915 



447350 
456207 
431448 
427217 
456938 
435496 
402578 
453160 
422505 
444342 



Hs.2533 
Hs.172634 



AJ905687 
AI375572 
AA193450 

AL137517 Hs.334473 
AA399272 Hs.144341 
X52509 Hs.161640 
AW840171 Hs.265398 

AI263307 Hs.239884 
AL120862 Hs.124165 
NMJ)14398Hs.10887 



SS,Utenxjtobin t SS.Uteroglobin 
TM 

,SS,p450 
TM 

death .ZUS.TM^ctivin.recp.pkinase, 
.SS.TM 
TM 

SS,LeptIn,SS.Leptin, 
f SS.DDiydroorotase, 

Plectin_repeatSH3,spectrin l SS.P!ectIn^ 
SS.CIq.Collagen.SS.CIq.CoHagen, 
.SS.Uteroglobln. 

< SS,TM,dislntegrin f PepJ/112B jxopep.Repro a disintegrln and metaJloprotelnase doma 



1 

ESTs 

ESTs. Weakly similar to CP4Y .HUMAN CYTOC 
Homo sapiens breast cancer antigen NY-BR 
BMP-R1B 

short-chain alcohol dehydrogenase family 
gb:QV3^T0381-27010(M)73^08BT0381 Homo 
leptin (murine obesity homobg) 
ESTs 

bullous pemphigoid antigen 1 (23Q7240kD) 
collagen, type X, alpha 1 (Schmld metaph 
Bpophifin B (uteroglobin family member) 



SS.iL8.SS 
SS.Bombesin.SS 
.SS.TM.Cation.efflux 
SS 
TM 

SS SS 

,SS.TM,GNS1.SUR4 l cNMP_blnding,Rna 
fg,tspJ,SS,AAA 

honnonejecAndrogenjecep,zf-C4 l 
SS 

.pidnasa, 

,SSJM,p450.p450 
TM 
SS 



small Inducible cytokine B subfamily (Cy 
gastrin-reieaslng peptide 
ESTs 

ESTs. Weakly similar to transformafiofK 
hypoUietical protein DKFZp58401278 
prolactirvinduced protein 
Homo sapiens mRNA; cDNA DKF2p434E082 (fr 
cartilage intermediate layer protein, nu 
androgen receptor (dihydrotBstosterone r 
aldehyde dehydrogenase 9 family, member 
ESTs 

gbzr40e07.r1 SoaiesJJhHMPu_S1 Homosapl 
hypothetical protein DKFZp56401278 
ESTs 



,SS,lT^amin6trarLL2,Caa7ienTLC_term,cad tyrosine aminotransferase 

SS ESTs, Weakly similar to transformation-** 

SS.p450.SS.TM.p450 C1001134rgi)2117372lpir|Il65981 fetty ac 

SS H2B histone family, member L 

SS programmed cell death 9 (PDCD9) 

Ump.SS.TM.Ump. similar to lysosomo-assodated membrane 

304 



R1 

168.6 

73.2 

65.7 

57.6 

55.9 

53.8 

50.3 

40.8 

39.3 

37.3 

35.2 

30.0 

25.7 

25.2 

24.8 

24.1 

24.0 

23.8 

22.8 

22.6 

21.7 

21.7 

20.3 

192 

18.3 

18.2 

182 

18.1 

17.9 

17.8 

173 

17.7 

17.5 
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15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



449765 N92293 Hs.206832 


SS 


ESTs, Moderately similar to ALUS JHUMAN A 


17.3 


428227 AA321649 Hs.2248 


SSJLB, 


small inducible cytokine subfamily B (CX 


17.0 


425692 D90041 Hs.155956 


t SSAceiyttransf2, 


N-acetyltransferase 1 (arylamine N-acety 


16.7 


424001 W67883 Hs.137476 


.pkinase, 


paternally expressed 10 


16.5 


448595 AB014544 Hs.21572 


LRRCT,LRR < SS J LRRCT,SQrine_carbpept 


K1AA0544 gene product 


16.3 


449448 D60730 Hs.57471 


SS 


ESTs 


16.2 


418007 M13509 Hs.83169 


SS.hemopexIn.Peptkiase.MIO.SS.Peptidase.matrix metallopnolelnase 1 (MMP1; inters 


15.7 


418994 AA296520 Hs.89546 


SS.!ectii\_csushl,EGF l SS,EGF,tectin - csu 


seiecfin E (endothelial adhesion molecul 


15.5 


453596 AA441S38 Hs.62905 


SS 


hypothetical protein FU14834 


15.5 


452401 NH.007115HS.29352 


,SS,CUB.Xilnk, 


tumor necrosis factor, alpha-induced pro 


15.0 


446591 H44186 Hs.15456 


PDZ.SS 


PDZ domain containing 1 


14.9 


419296 AA236115 Hs.120785 


SS 


ESTs 


14.8 


452838 U65011 Hs.30743 


SS.SS 


preferentially expressed antigen In mela 


14.7 


422805 AA436989 Hs.121017 


hlstone.SS.hlstone.histone 


H2A hlstone family, member A 


14.3 


448390 AL035414 Hs.21068 


SS 


hypothetical protein 


14.2 


447342 AI199268 Hs.19322 


.SS.Hpocatin 


Homo sapiens, Similar to RIKEN cDNA 2010 


14.2 


411869 W20027 Hs.23439 


,SS,Peptidase_M1, 


ESTs 


13.9 


443709 At082692 Hs.134662 


,SS,TM,SNF 


ESTs 


13.7 


459587 AA031956 


.ssjjm, 


gb:zk15e04.s1 SoaresjjregnanLuterusJW 


13.7 


442580 AI733682 Hs.130239 


SS 


ESTs 


13.5 


400289 X07820 Hs.2258 


hemopexinPeptidase_M10,SSPeptidas8j^10malrix metaOoproteinase 10 (MMP10; str 


13.5 


411598 BE336654 Hs.70937 


hfetorte.SS.histone.histone 


H3 histone family, member A 


13.3 


415263 AA948033 Hs.1 30853 


,SS,histone,histone,linkerJ)istone 


ESTs 


13.2 


433805 AA706910 Hs.1 12742 


,SS.RlbasomaU-7Ae, 


ESTs 


13.1 


407276 AI951118 Hs.326736 


TM 


Homo sapiens breast cancer antigen NY-BR 


13.1 


443348 AW873596 Hs.1 82278 


,SS,DENN 


calmodulin 2 (phosphorylase kinase, delt 


13.0 


421037 AI684808 Hs.1 97653 


SS 


programmed cell death 9 (PDC09) 


12.9 


424086 A1351010 Hs.102267 


.SS.LysyLoxWase 


iysyi oxidase 


12.8 


400295 W72838 Hs.2533 


SS 


aldehyde dehydrogenase 9 family, member 


12.7 


452461 N78223 Hs.108106 


,SS.G9a,PHD, 


transcription factor 


12.5 


427365 AI873274 Hs.190721 


TM 


ESTs 


12.4 


433365 AF026944 Hs.293797 


.SSJPR 


ESTs 


12.3 


409269 AA576953 Hs.22972 


SSJM,UPF0016,SS,TKUPF0016 


hypothetical protein FU13352 


12.0 


432596 AJ224741 Hs.278461 


SS.EGF.vwa.SS.TM.vwa, 


roatrifin 3 


11.9 


408771 AW732573 Hs.47584 


TM.K^telra.lorLlrans.SS.TM.ICtetra^onJ 


potassium voltage-gated channel, delayed 


11.9 


432912 BE007371 Hs.200313 


,SS,TM,Folate carrier 


ESTs 


11.9 


447033 AI357412 Hs.157601 


SS 


ESTs 


11.8 


421155 H87879 Hs.102267 


SS,LysyLoxldase,Aldose_epim,Epimerase,S lysyi oxidase 


11.8 


424905 NMJ>02497Hs.153704 


pklnase 1 SS,TM,pldnase,poJyprenyLsyrrt, 


NIMA (never in mitosis gene a)-re!ated k 


11.7 


425398 AL049689 Hs.156369 


SS 


hypothetical protein similar to tenasdn 


11.6 


438167 R28363 Hs.24286 


l SS 1 TM,7tnLl l p450 1 rrm 
.pkinase, 


ESTs 


11.5 


459583 AI907673 


gb:IL-8T152-080399404 BT152 Homo saplen 


11.5 


423945 AA410943 


deam,ZU5,TM l Acfivin_recp,pkinase, 


gb:zi32h03.ii Soares ovary tumor NbHOTH 


11.4 


439820 AL360204 Hs.283853 


SS 


Homo sapiens mRNA fuD length insert cDN 


11.4 


402606 


SS 


NivL024626:Homo sapiens hypothetical prot 


11.3 


445263 H57646 Hs.42586 


.SSAcyitransferase, 


KIAA1560 protein 


112 


430217 N47863 Hs.336901 


> SS ( R^IAJ3oLAR^^U^.RtbosomaLS24e ( ribos«n S24 


11.1 


447164 AF026941 Hs.17518 


JM.IBR 


Homo sapiens dg5 mRNA, partial sequence 


11.1 


431385 BE178536 Hs.1 1090 


,SSJM 


membrane-spanning 4-domains, subfamily A 


11.1 


423887 AL080207 Hs.134585 


,SS,TM l BRCT,ank I ABC_tran l ABCJran 


DKFZP434G232 protein 


10.9 


415385 R17798 HsJ535 


.SSJorkJiead, 


COBW-fike protein 


10.9 


425704 U79293 Hs.159264 


SS 


Human clone 23948 mRNA sequence 


10.7 


429859 NMJ)07050Hs.225952 


,SS < TM,Yj)hospnatase,MAM,fh3, 


protein tyrosine phosphatase, receptor t 


10.4 


425523 AB007948 Hs.158244 


l SS,larninirLB l laminin_EGF,taminin - Nterrn 


KIAA0479 protein 


10.3 


428368 BE440042 Hs.83326 


SS.Peptidase jmherrwpexin.SS.Peptldasejnatnx melailoprotelnase 3 (stromelysin 


10.3 


418912 NhL000685Hs.89472 


SSJMJtnU.SSWtmJ. 


angiotensin receptor 1 


10.3 


422026 U80736 Hs.1 10826 


SS 


trinucleotide repeat containing 9 


10.3 


451952 AL120173 Hs.301663 


.SS.pkinase, 


ESTs 


10.3 


438199 AW016531 Hs.122147 


t SS,ArfGap, 


ESTs 


102 


400608 


SS.TM.SS.TM 


C10001899:gl|7508633lplr||T25392 hypothe 


10.1 


413472 BE242870 Hs.75379 


SS 


solute carrier family 1 (glial high affi 


10.0 


432374 W68815 Hs.301885 


SS 


Homo sapiens cDNA FU1 1346 fts, clone PL 


9.9 


402408 HA 


,SS,carb_anhydrase 


NM_030920*:Homo sapiens hypotheticai pro 


9.8 


445537 AJ245671 Hs.12844 


,SS,TM/as 


EGF-lfo-aomain, multiple 6 (EGFL6) 


9.7 


451621 AI879148 Hs.26770 


SS.fipocaltaJipocalin, 


fatty acid binding protein 7, brain 


9.6 


405654 NA 


BTB.SS 


C12001521:gi|7513934|plrIT31081 cca3 pr 


9.6 


434988 AJ418055 Hs.161160 


SS 


ESTs 


9.6 
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416220 N49776 Hs.170994 


,SSJM 


hypothetical protein MGC10946 


9.5 


431808 M30703 Hs.270833 


SSJMEGF^S 


arnphiregufin (schwannoma-derived growth 


9.5 


414142 AW368397 Hs.150042 


.SS.UDPGT 


Homo sapiens cDNA FU14438 frs. clone HE 


9.4 


400298 AA032279 Hs.61635 


TM 


six transmembrane epithelial antigen of 


9.4 


418601 AA279490 Hs.86368 


SS.TM.calreticuiln.SSJM.calreticulin, 


calmegin 


9.4 


415539 AI733881 Hs.72472 


deafli,ZU5,TM^ciivin_fBcp,pkInas©, 


BMP-R1B 


9.4 


421451 AA291377 Hs.50831 


TM 


ESTs 


9.3 


429432 AI678059 Hs.202676 


SS 


synaptonemaJ complex protein 2 


9.3 


442441 AI820662 Hs.129598 


ss 


ESTs 


9.1 


426429 X73114 Hs. 169849 


,SS/TM,fn3,la 


myosto-binding protein C. slow-type 


9.1 


406687 M31126 Hs.272620 


SS.Peptklase,M10 < hemopexln,SS,Pep«dase_matrix metatloprotelnase 11 (MMP11; stro 


9.1 


448693 AW004854 Hs.228320 


SS 


hypothetical protein FU23537 


9.1 


419948 AB041Q35 Hs.93847 


Fernc^ductTM,Ferrfcjeduct 


NM_016931:Homo sapiens NADPH oxidase 4 ( 


9.1 


426214 H59846 Hs.128355 


SS 


ESTs, Moderately similar to ALU7.HUMAN A 


9.0 


427718 A1798680 Hs.25933 


.SS.TM.histone.Seri ,hktone,sugarjr 


ESTs 


8.9 


414812 X72755 Hs.77367 


SS.lLB.SS.tL8 


monokine Induced by gamma Interferon 


8.8 


400285 NA 


,TM,ABCJran.ABCjriernbrane. 


Eos Control 


8.8 


422330 D30783 Hs.115263 


SSJM,EGF,SS,TM 


eptreguBn 


8.8 


416182 NM_Q04354Hs,79069 


cycfln,SS 


cyclinG2 


8.8 


420077 AW512260 Hs.87767 


SS 


ESTs 


8.7 


452281 T93500 Hs.28792 


,SS,TGF-beta ( TGFb_propeptide, 


Homo sapiens cDNA FLM104t fo, clone PL 


8.7 


434531 AA642007 Hs.1 16369 


SS 


ESTs 


8.6 


408380 AF123050 Hs.44532 


,SS,TM,ubIquitln,7tmJ l ANF_receptor,sush 


dhtoiquitin 


8.6 


443788 AI732643 Hs.144151 


TM 


ESTs 


8.6 


411078 AI222020 Hs.182364 


SS.SS 


CocoaCrisp 


8.5 


445495 BE622641 Hs.36489 


SS.SS J ENW.LLVVEQ,ENTH.LLWEQ ) DNA L jnJsjBESTs, Weakly similar to 138022 hypothetl 


8.5 


433426 H69125 Hs.133525 


.SS.TM 


ESTs 


8.5 


424871 NM_004525Hs.153595 


SS.EGF.WLrwepta.ldLrecepLb.SS.TM.E 


low density lipoprotein-retated protein 


O A 

8.4 


426215 AW963419 Hs.155223 




stanntocalcln 2 


8.4 


409045 AA635062 Hs.50094 


TM 


Homo sapiens mRNA; cDNA DKFZp434O0515 (f 


8.4 


435525 A1831297 Hs.123310 


TM 


ESTs 


8.3 


409203 AA780473 Hs.687 


SS.p450.SS.p450 


cytochrome P450. subfamily IVB. polypept 


8.3 


424902 NMJ03866Hs.153687 


SS SS 


inositol potyphosphate-4-phosphatase. ty 


8.3 


431725 X65724 Hs^839 


Ss!cys_knotSS 


Norrie disease (pseudogiioma) 


8.3 


418092 R45154 Hs.106604 


1 dealh f ZU5,pkinase^ctivia_racp 1 


ESTs 


8.3 


439840 AW449211 Hs.105445 


SS 


GDNF family receptor alpha 1 


8.2 


427811 M81057 Ha.180884 


SS,ZfUcarbOpept.PropepJ/l14 l SS.Propep_M14carboxypepfldase B1 (tissue) 


8.2 


420807 AA280627 Hs.57846 


SS.cpnIO 


ESTs 


8.2 


426320 W47595 Hs.169300 


SS.TGF-beta,TGFbj>ropeptide,SS 


transforming growth factor, beta 2 


8.2 


447078 AW885727 Hs.301570 


.SS.kazal, 


ESTs 


8.1 


415786 AW419196 Hs.257924 


SS 


hypoMcaJ protein FU13782 


8.1 


410102 AW248508 Hs.279727 


SS 


Homo sapiens cDNA RJ14035 (is, clone HE 


8.0 


404347 


SS 


Target Exon 


8.0 


433687 AA743991 


TM 


gb:ny57g01.s1 NCI_CGAPJ>r18 Homo sapiens 


8.0 


421373 AA808229 Hs.167771 


.SS.1MPDHJC.IMPOHJI.CBS 


ESTs 


8.0 


422634 NM_016010Hs.1 18821 


SS 


CG1-62 protein 


7.9 


453310 X70697 Hs.553 


TM.SNF.SSJM.SNF. 


solute carrier family 6 (neurotransmitte 


7.9 


'435957 N39015 Hs.190368 


.SS.TM 


ESTs 


7,8 


407771 AL138272 Hs.62713 


,TM,cpn60„TCP1,Sefna, 


ESTs 


7,8 


443646 AIQ85198 Hs.164226 


JSPN.vwc,tspJ.EGF,thtored, 


ESTs 


7.8 


446142 AI754693 Hs.145968 


,TM,cadherin,CadheriruCJerm, 


ESTs 


77 


444649 AW207523 Hs.197628 


.SS.nm, 


ESTs 


7.6 


435147 AL133731 Hs.4774 


,TM,SDF,UPARJ-Y6, 


Homo sapiens mRNA; cDNA DKFZp761C17i2 (f 


7.6 


439138 A1742605 Hs. 193696 


TM 


ESTs 


7.6 


429220 AW207206 Hs.136319 


SS 


ESTs 


7.6 


428804 AK000713 Hs.193736 


.SS.UDPGT 


hypothetical protein FU20705 


7.5 


453511 AID31224 Hs.33102 


SS.SS 


transcription factor AP-2 beta (acQvati 


7.5 


439809 R41396 Hs. 101 774 


SS 


hypometical protein FU 23045 


7.5 


414869 AA157291 Hs.21479 


SS 


ubinucleln 1 


7.5 


416276 U41060 Hs.79136 


SS.TM.TM 


UV-1 protein, estrogen regulated 


7.5 


452862 AW378065 Hs.8687 


,SS < PepJ412Bj>rapep,Reprolysin,tspJ, 


ESTs 


7.4 


452926 AI742170 Hs.31297 


.SS.TM 


duodenal cytochrome b 


7.4 


453331 A1240665 Hs.8895 


,SS.TM.dislntegrinPepJ^12Bj)ropep,RepfD ESTs 


7.3 


420802 U22376 Hs.1334 


SSMmyb-DNA-bincflng 


v-myb avian myetobiastosb viral oncogen 


7.3 


450603 R43646 Hs.12422 


SS 


ESTs 


7.2 


422867 L32137 Hs.1584 


SSfGF,tspJ.SS.E2F_TDP, 


cartilage otlgomeric matrix protein (COM 


7.2 


418004 U37519 Hs*7539 


SS.aldedh,SS,aldedh, 


aldehyde dehydrogenase 3 family, member 


7.2 
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417275 
432731 
442818 
407366 
427427 
410785 
401045 



A1908165 Hs.169946 

H39960 Hs.288467 

AI198719 Hs.176376 

U92649 Hs.64311 

AB029496 Hil59729 

AA102670 Hs.70725 

W87707 Hs.82065 

X63578 Hs.295449 

R31178 Hs.287820 

AK001741 Hs.6739 
AF026942 

AF077345 Hs. 177936 
AW803341 



C11OO1883*:giI6753278|refJNPj033938.1| c 



SS.GATA, 
.SS.LRR 
SS 

,TM,dlslntegrin,Reprolysin l 
SS,ig,Sema,SS,Sema J efhand 
SS.TM.SS.TM 

SS,efhand,SS ( emand,ras 
.SS.frafn1.fn2,fo2 f fn1 
WD40.SS 
.TM.1BR 

SS 

ICE_p20,SS ( lCEj)10,ICE4)20 l {CE_p10,ICE_p 

6J 



GATA-blndlng protein 3 {T-cell receptor 
Homo sapiens cDNA FU 12280 Hs, done MA 
ESTs 

a disintegrin and metalioproteinase doma 
semaphorin sem2 

gamma-aminobutyric acid (GABA) A recepto 



parvalbumln 
fibronectin 1 

hypothetical protein RJ10879 
gb:Homo sapiens dg33 mRNA, partial sequ 
ESTs 

gb:O-UM0079^903(XW5(W)03 UM0079 Homo 



7.1 
7.1 
7.1 
7.1 
7.0 
7.0 
7.0 
7.0 
6.9 
6.9 
6.8 
6.8 
6.7 



418986 A1123555 Hs.81796 


.SS.Reprolysin.tspJ, 
.mEPHJbd,pkinase,SAM,fn3, 


ESTs 


6.7 


442082 R41823 Hs.7413 


ESTs; calsyntenin-2 


6.7 


442861 AA243837 Hs.57787 


SS 


ESTs 


6.6 


418836 AI655499 Hs.161712 


.TM.ActMrurecp.pWnase.death.ZUS, 


ESTs 


6.6 


422060 R20893 Hs.325823 


,SS,TM,C036 


ESTs. Moderately similar to ALU5_HUMAN A 


6.6 


444381 BE387335 Hs.283713 


,SS.rnito_carr 


ESTs, Weakly similar to S64054 hypothec 


6.6 


404091 NA 


JM.7tnu3.AN preceptor, 


Target Exon 


6.6 


417866 AW067903 Hs.82772 


SS,CoiIagen.COLFI,TSPN ( SS ( TSPN 


colagen, type XI, alpha 1 


6.6 


428819 AL135623 Hs.193914 


SS.SS 


K3AA0575 gene product 


6.5 


410275 U85658 Hs.61796 


,SS,RlbosomaLS4e 


transcription factor AP-2 gamma (acfivat 


6.4 


425236 AW067800 Hs.155223 


SS 


stanniocalcin 2 


6.2 


415669 NM_005025Hs.78589 


,SS,serpin ( 


serine (or cysteine) proteinase inhibtto 


6.2 


416319 AI815601 Hs.79197 


SS.TMJg.SSJM 


CD83 antigen (activated B lymphocytes, i 


6.2 


412140 AA219691 Hs.73625 


,SS,kinesin, 


RAB6 interacting. kinesin-&ke (rabkines 


6.2 


442942 AW167087 Hs.131562 


.SS.lg.Sema.pkinase, 


ESTs 


6.2 


446163 AA026880 Hs.25252 


.SS.TM.fn3, 


prolactin receptor 


6.1 


443162 T49951 Hs.9029 


fflamentSS.fflamentfilarnent 


DKFZP434G032 protein 


6.1 


409602 W26713 Hs.256972 


.SS.TM.DAGKa.DAGKc, 


ESTs 


6.1 


428479 Y00272 Hs.184572 


.SS.pklnase.pfdnase 


ceD division cycle 2, G1 to S and G2 to 


6.1 


400300 X03363 


,SS l TM,plOTase l Recep^l % _domaln I SH2,PH^urHER2 receptor tyrosine kinase (c-erb-b2, 


6.1 


433404 T32982 Hs.102720 


SS 


ESTs 


6.1 


410079 U94362 Hs.58589 


Glyco_transf_8,SS 


glycogenh 2 


6.1 


401781 


.SS.filamentpribosyliran.filamentArmad 


Target Exon 


6.1 


447359 NWL012093HS.18268 


SS,adenyiate kinase, 


adenylate kinase 5 


6.1 


402230 NA 


.SS.TM.p450, 


Target Exon 


6.1 


427674 NM.003528Hs.2178 


Mstone,SS,histone, 


H2B his tone family, member Q 


6.1 


428398 AI249368 Hs.98558 


.SS.TM 


ESTs 


6.0 


458098 BE550224 Hs.74170 


SS 


metaltothionein 1E (functional) 


6.0 


419968 X04430 Hs.93913 


SS.IL6JL6, 


interieukin 6 (interferon, beta 2) 


6.0 


416636 N32536 Hs.42645 


.SS.TM 


solute carrier family 16 (monocarboxylic 


6.0 


419703 A1793257 Hs.128151 


,SS,zf.C2H2. 


ESTs 


5.8 


424687 J05070 Hs. 151738 


SS.Peptidase_M10 l fn2.hemopexin l SS,TM,l 


D epmatrix metalioproteinase 9 (geiatinase B 


5.8 


449679 AI823951 Hs.129700 


SS 


toQokMike 1 


5.8 


421296 NMJK)2666Hs.1Q3253 


SS 


perfllpin 


5.8 


442117 AW664964 Hs.128899 


.SS.TM 


ESTs 


5.7 


400303 AA242758 Hs.79136 


.SS.TM 


LIV-1 protein, estrogen regulated 


5.7 


419440 AB020689 Hs.90419 


SS 


WAA0882 protein 


5.7 


444858 AI199738 Hs.208275 


SS 


ESTs, Weakly similar to ALUA_HUMAN I 111 


5.7 


432239 X81334 Hs.2936 


SS l Peptidas8_M10,hemopexin l SS,Peptidase_matrix metalioproteinase 13 (coliagenase 


5.6 


440705 AA904244 Hs.1 53205 


TM 


ESTs 


5.6 


400286 NA 


SS.TM.ABCJran.ABCjnembrane.SS 


C16000922:giI7499103|pirj|T20903 hypothe 


5.6 


446466 H38026 Hs.308 


arresbn.SS 


arrestln 3, retinal (X-arrestin) 


5.5 


423201 NM.000163HS.125180 


SS.TM.fn3.SS 


growth hormone receptor 


5.5 


433043 W57554 Hs.125019 


SS 


lymphoid nuclear protein (LAF-4) mRNA 


5.5 


439509 AF086332 Hs.58314 


.SS.TM.Syntaxin 


ESTs 


5.4 


425247 NMJX)5940Hs.155324 


SS,PepHdaseJW10,hemopexin,SS 


matrix metalioproteinase 11 {MMP11; stro 


5.4 


409757 NM.001898Hs.1231 14 


.SS.cystatin, 


cystalin SN 


5.4 


425292 NWL005824HS.155545 


SS 


37 kDa leucme-rich repeat (LRR) protein 


5.4 


448045 AJ297436 Hs.20166 


.SS.TM 


prostate stem cell antigen 


5.4 


4526B1 AF153330 Hs.30246 


.SS.TM 


solute carrier famSy 19 (thiamine trans 


5.3 


452243 AL355715 Hs.23555 


SS 


programmed cell death 9 


5.3 


439310 AF086120 Hs.102793 


.SS,TM,UDPGT,caseirukappa 


ESTs 


5.2 
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441111 

452355 
427711 
418636 
429353 
441690 
430447 
429698 
425325 

423600 
414737 
403593 
407758 
445234 
411165 
420633 
414117 
416783 
401093 
411096 
457411 
436007 
450506 
417975 
421072 
427032 
447752 
403199 
427122 
445900 
413048 
419563 
442432 
452093 
442323 
450606 
435542 
417576 
446089 
445413 
424420 
432378 
452190 
434674 
419986 
421582 
410361 
426327 
406639 
452834 
427315 
446733 
442118 
421524 
453060 
453403 
444301 
453619 
432656 
426384 
431701 
416931 
420854 
418867 
443514 
447499 



AI806867 

N54926 

M31659 

AW749855 

AL1 17406 

R81733 

W17064 

A1685086 

X52730 



Hs.126594 

Hs.29202 

Hs.180408 

Hs.200102 

Ks.33106 

Hs.332848 

Hs.26339 

Hs.1892 



,SS,TM.Phosphodlest 

TM.7tnu1.TM 

SS 

.SSJNtfiECT 

,SS p TMABC_tran,ABC_membrane, 

,SS.HECT,zHJBR1,PABP,14^3-3, 

SS 

,SS,ras, 



phenytethanolamina N-mathyttransferase 



SS,NNMT_PNMTJTEMr t SS^NMTJ 5 NMT_TEMT ( STAR 



ESTs 

G proteirvcoupied receptor 34 
solute earner family 25 (mitochondrial 
gbflV4-BT0534-281299453<Q5 BT0534 Homo 
ATP-btoding cassette transporter MRP8 
ESTs 

SWVSNF related, matrix associated, acti 
ESTs, Weakly similar to S21348 probable 



Hs.310359 
Hs.125087 



AI633559 
A1160386 
HA 

D50915 Hs.38365 
AW137636 Hs.146059 
NH.000169HS.69089 
NM_014581Hs^74480 
W88559 Hs.1787 
AA206186 Hs.79889 

U80O34 Hs.68583 
AW085961 Hs.130093 
AI247716 Hs.232168 
NWL004460HS.418 
AA641836 Hs.30085 
AI215069 Hs.89113 
AF012023 Hs.173274 
M73700 Hs.105938 
NA 

AW057736 Hs.323910 
AF070526 Hs.13429 
M93221 Hs.75182 
AA526235 Hs.193162 
BE093589 Hs.38178 
AA447453 Hs.27860 
AW016669 Hs.29190 
AI668605 Hs.60380 
AA687376 Hs.269533 
AA339449 Hs.82285 
AI360021 Hs.270651 
AA151342 Hs.12677 
BE614743 Hs.146688 
AM93046 Hs.146133 
H26735 Hs.91668 
AA831879 Hs.136985 
A1345455 Hs.78915 
AI910275 Hs.1406 
BE391804 Hs.62661 
W03242 Hs.44898 
M97711 

AI638627 Hs.105685 
AA179949 Hs.175563 
AA863360 Hs.26040 
AA976718 Hs.202242 
AA312082 Hs.105445 
AW294092 Hs.21594 
BE466639 Hs.61779 
AKD00136 Hs.10760 
H87648 Hs.33922 
NM_000246Hs.3076 
AI472078 Hs.303662 
AW935490 Hs.14658 
D45371 Hs.80485 
AW296927 
031771 Hs.89404 
BE464288 Ks.141937 
AW262580 Hs.147674 



SS 
SS 

,CIDE-N,pkinase 
SS.SS 
,SS,TM 



5.1 
ESTs 
ESTS 

Target Exon 
KlAA0125gene product 
ESTs 



SS l MeIibiase,BTKPH lP kin3se 1 SH2 ) SH3 l Rjbo gafedosidase, alpha 
SS,fipocaiin,SS,ffoocaIin odorant-bindlng protein 2A 

,TM,lon_trans,K_tetra, proteoRpid protein 1 (Pelizaeus-Merzbac 

SS.TM.TM monocyte to macrophage dfTerenttaGon-a 

TM.LRRCT.TM.IRRCT, C12000586^gl|6330167|dbj|BAA86477.1| (A 

Peptidase_M3. rrutochondrlal intermediate peptidase 

SS ESTs 
.SS.adfuinc, ESTs 
SS,DPPiV_N - term.Peptidase - S9,SS,DPPlV - NJbrobiast activation protein, alpha 
,SS,trypsin hypothetical protein RJ231 86 

SS ESTs 

,SS, 1 4-3-3 Integrin cytoplasmic domain-associated p 

SS,transferrin,7tm_1 .transferrin, tectotransferrin 
SS,TM,Folate_carrier 1 SS,TM,Folate_canrie NMJ>25243*:Homo sapiens solute carrier f 
,SS,TM,pktoase,RecepJ^domain,SH2,PH,FurHER2 receptor tyrosine kinase (c-erb-b2. 



jSS.Ca.channeLB. 
SS,TM,lectirL_c,fn2 t RicbuB_iec«n ( SS,TM 
SS 
SS 

,SS,TM,7tm_1, 
,SS,TM,CBS.voltage^CLC 
JM^IycoJiydroJ 
,SS,pkinase,RhoGEF,ig.PH l SH3, 



Homo sapiens done 24787 mRNA sequence 
mannose receptor, C type 1 
Homo sapiens cDNA FU11983 Ms, done HE 
hypothetical protein FU23468 
Homo sapiens mRNA; cDNA OKFZp586M0723 (f 
ESTs 

ESTs, Moderately similar to ALU6_HUMAN A 
ESTs 



AIRS.forrnyLtransf.GARS.SS.GARS.AlRS.for phosphoribosyiglycinamide formyttransfer 



.pkinase 
SS,UPFQ099,SS,UPFQQ99. 
,SS,TM,MAPEG. 
,SS.TM,UDPGT 

,SS,TM,PH,SH2,Furin-like.pkinase,RecepJ. 

,SS,HisLdeacetyl, 

pkinase.OPR. 

SS,trefoa,SSJM,ld{jecept_a.SRCR,tryps 

SS.TM.GBPJM.GBP 

SS 

ss,ss,b, 

> SS l DEAD,Fork_head 
SS 

.SSJM.P450, 

.ig.Sema, 

SS 

,SS,ras ( Yj)ho5phatase,ras 

,SS,HMGJ)OX.fi!ament, 

SS.LRR.SS 

SS 

SS.LRR, 

.SSArfGap, 

,SS,BIR 

SS.CIq.Coflagen.SS.CIq, 
.SS.THPeptkfcseJ/H, 
SS.homeobox.no fneobox, 
,SS,TM,MJP, 

,SS,TM,cadherin,cadherin 
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ESTs, Moderately similar to A47582 B^ceJ 
CGM47protBin 
prostaglandin E synthase 
ESTs 

Homo sapiens done PP1498 unknown mRNA 
ESTs 

GA-binding protein transcription factor, 

trefoil factor 1(pS2) 

guanylate binding protein 1, interferon- 

Homo sapiens done TCCCTA00151 mRNA sequ 

gbflumanT-ceU receptor (V beta 18.1, J 

KIAA1688 protein 

Homo sapiens mRNA; cDNA OKF2p564N0763 (f 
ESTs, Weakly similar to fatty acid omega 
ESTs 

GDNF family receptor alpha 1 
hypothetical protein MGC15754 
Homo sapiens cONA FU13591 fis, done PL 
asporin (LRR class 1) 

Homo sapiens, done MGC.-9084, mRNA. comp 

MHC class li transariivator 

ESTs 

Human chromosome 5q13.1 done 5G8 mRNA 
adipose most abundant gene transcript 1 
gb:UW«W0**>©fl7-Wl.s1 NCJ_CGAP_Su 
msh (Orosophaa) homeo box homotog 2 
ESTs 

protocadherin beta 16 



5.2 
52 
5.2 
5.2 
5.1 
5.1 
5.1 
5.1 



5.1 
5.1 
5.1 
5.0 
5.0 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.8 
4.8 
4.8 
4.8 
4.8 
4.7 
4.7 
4.7 
4.6 
4.6 
4.6 
4.6 
4.6 
4.6 
4.6 
4.6 
4.5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
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441560 
409064 
422667 
454032 
432663 
401747 
432882 
437036 
447754 
443194 
451871 
457211 
421566 
431657 
427899 
444779 
442295 
436396 
446039 
422938 
406922 
439285 
424800 
429922 
447178 
409038 
452747 
420139 
408877 
403329 
439926 
430832 
432481 
452410 
418661 
431958 
425071 
447197 
428722 
428330 
423242 
449048 
414831 
413589 
408875 
418629 
450787 
414870 
450325 
407633 
426172 
442262 
427961 
445563 
403943 
408761 
423279 
420440 
445107 
428303 
411667 
427809 
418203 
430376 
444190 
433495 
429638 



F13386 Hs.7888 
AA062954 Hs.141883 



H25642 
W31790 
AI984317 



Hs.133471 
Hs.194293 
Hs.122589 



NM_Q13257Hs,279696 
AI571514 Hs.133022 
AW073310 Hs.163533 
AI954968 Hs.279009 
AI821005 Hs.1 18599 
AW972565 Hs.32399 
NM.000399Hs.1395 
AI345227 Hs. 105448 
AA829286 Hs.332053 
AI192105 Hs.147170 
AI827248 Hs.224398 
AI683487 Hs.152213 
AI150491 Hs.90756 
NMJX)1809Hs.1594 
S70284 

AL133916 Hs.172572 
AL035588 Hs.153203 
Z97630 Hs.226117 
AW594641 Hs.192417 
T97490 Hi50002 
BE153855.HS.61460 
NM.005357HS.95351 
AA479033 Hs.130315 
NA 

AW014875 Hs.137007 
A1073913 Hs.100686 
AW451645 Hs.151504 
AL133619 Hs.29383 
NM_001949Hs.1189 
X63629 Hs.2877 
NM_013989Hs.154424 
R36075 



U76456 
L22524 



Hs.190787 
Hs.2256 



AL039402 Hs.125783 



Z45051 
M31158 



Hs.22920 
Hs.77439 



AW452631 Hs.313803 
NM_015434Hs,48604 
BE247550 Hs.86859 
AB006190 Hs.25475 
N72264 Hs.300670 
AI935962 Hs.26289 
NM_007069Hs^7189 
AA371307 Hs.125056 
BE170651 Hs.8700 
AW293165 Hs.143134 
AW873606 Hs.149006 

AA057264 Hs.238936 
AW959861 Hs^90943 
NM_002407Hs.97644 
AI208121 Hs.147313 
AW974476 Hs.183601 
BE160198 

M26380 Hs.180878 
X54942 Hs.83758 
AW292053 Hs.12532 
AI878918 Hs.10526 
AW373784 Hs.71 
A1916662 Hs.211577 



.pkinase, 

,SS,CUB, 

.SSJM.FMCMike 

.SS.TM 

TM 

,SS,filament,filament 
pkinase,pkinase_C, 
,SS,TM 
.pkinase, 
,SS ( TM 
.SS.GDNF. 
WH1.WH1 
zf-C2H2,SS 
,SS,TM.pkinase 



,SS,SAA^proteins r ABC_m8mbran8 t ABC_tran t 38rum amyloid A1 
SS ESTs 



Homo sapiens done 23736 mRNA sequence 

ESTs 

ESTs 

ESTs, Weakly similar to 154374 gene NF2 
ESTs 

Homo sapiens keratin 17 (KRT17) 
serum/glucocorticoid regulated kinase-li 
ESTs 

Homo sapiens cONA RJ14142 fis, clone MA 

matrix Gia protein 

ESTs 

ESTs, Weakly similar to S51797 vasodilat 
early growth response 2 (Krox-20 (Drosop 
ESTs, Weakly similar to B34087 hypothec 



.COLFI.vwc.Collagen, 
,wnt 

,TM,GlycoJiydroJ 

,SS,TM,thJo)ase, 

SS.TM.Desaiurase.SS 

,SS,lg,pklnase,LRRNT,LRRCT ) 

HLH.SS 

l SS,TM l 0nker_histone,7tm_1 

.SS.TM 

SS,IL8,SS,1L8 

,SS,HLH 

,SS,TM,p450, 

,SS,TM 

ss.ss 

SS 
SS 

,SS t Coliagen t COLFl f TSPN ( 
.SS.TM.ras 
SS 



Homo sapiens cDNA FU11469 fis, clone HE 
wingless-type MMTV integration site faml 
ESTs 

centromere protein A (17kD) 
gb:stearoyi-CoA desaturase [human, adlpo 
hypothetical protein FU20093 
MyoD family inhibitor 
Hlhlstonefamiry, member 0 
ESTs 

small Inducible cytokine subfamily A (Cy 

Ig superfamSy receptor LNIR 

lipase, hormone-sensitive 

ESTs, WeaMy similar to A47582 B-cell gr 

Target Exon 

ESTs 

ESTs, Weakly similar to JE0350 Anterior 
Homo sapiens cDNA FU1 1973 fis, clone HE 
Homo sapiens mRNA; cDNA OKFZp434E2321 (f 
E2F transcription factor 3 



SS,TM,Cadherin^C - terr^(^herin l SS l TM l cadcadherln 3, type 1, P-cadherin (placenta 
SS.T4 delodinase J4_deiodinase, detodinase, iodothyronine, type II 

JM.SOF, gb:yh88b01.s1 Scares placenta Nb2HP Homo 

.SS.TIMP, tissue Inhibitor of metalioproteinase 4 

SS,PeptidaseJ^10,SS,Peptkjase_M10 l hemopematrlx metaUoproteinase 7 (MMP7; uterin 
SS DEME-6 protein 

SS,SS # TM similar to S68401 (cattle) glucose indue 

,SS,cNMP_binding,RiIa,HMG_box protein kinase, cAMP-dependent. regulato 

.SS.abhydrolase ESTs, Highly similar to AF157833 1 nond 

SS OKFZP434B168 protein 

SS.SH2,PH,SS,TM.PH.SH2,Furin^ike l pklnas growth factor receptor-bound protein 7 



SS.TM.MIP.SSJM.MIP. 



TMJM 
.SS.DENN 
.SS.START, 
SS 

,SS,WH1,WH1 
p450.SS.p450 
,SS,TM,7tmJ, 
SS 

,SS.SRCR,Uteroglobin 
f SS,TM 

SS,RGS.RGS.RGS 
TM 

,SS.Upase.PLAT,Sec7J>H ( 

CKS.SS.CKS, 

SS 

SS 

SS.Ig.MHCJ.connexln.SCAN.SS.TM 
SSJM.SS 



aquaporln 7 
K1AA1204 protein 
ESTs 

similar to rat HREV107 
ESTs 

deleted in liver cancer 1 

ESTs 

ESTs 

C500O355:gl|45O3225IreflNP^000765.1| cyt 
ESTs, Weakly similar to (defGne not ava 
ESTs 

rnammaglob)n2 (MGB2; mammaglobin B; lip 
ESTs, WeaMy similar to 138022 hypotheti 
regulator of G-protein signalling 16 
gb:QV1-HTM13410200-05WK)3 HT0413 Homo 
lipoprotein lipase 
COC28 protein kinase 2 
chromosome 1 open reading frame 21 
cysteine and glyclne-rich protein 2 
alpha-2-glycoprotein 1, zinc 
kinecfln 1 (kinesin receptor) 



4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
A2 
4.1 
4.1 
4.1 
4.1 
4.1 
4,1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
3.9 
3.9 
3.9 
3.9 
3.9 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3,8 
3.8 
3.8 
3.8 
3.8 
•3.8 
3.8 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
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454071 
451859 
420281 
427691 
428824 
424676 
418026 
457465 
417601 
407999 
425548 
446619 
411213 
406625 
417511 
428769 
407137 
401866 
451195 
426044 
426310 
440029 
408573 
431830 
444781 
431493 
414175 
411789 
418851 
453968 
407104 
449051 
434398 
454042 



414998 
407756 
442101 
449722 
452554 
421991 
420058 
425776 
407846 
406925 
445873 
418054 
414921 
452268 
428862 
412520 
410530 
401780 
447131 
416334 
415138 
421168 
431473 
421379 
411984 
408101 
405366 
414612 
411393 
435767 
416406 
433068 



A1041793 Hs.42502 
H44491 Hs.252938 
AI623693 Hs. 191 533 
AW194426 Hs.20726 
W23624 Hs.173059 
Y08565 Hs.151678 
BE379727 Hs.83213 
AW301344 Hs.122908 
NM_014735Hs.82292 
AH26271 Hs.49433 
AA890023 Hs.1906 
AU076643 Hs.313 
AA676939 Hs.69285 
Y13647 Hs.119597 
AL049176 Hs.82223 
AW207175 Hs.106771 
T97307 

U10492 Hs.438 
AA502490 Hs.336695 
NWL000909HS.169266 
AW089705 Hs.293711 
AA284775 Hs.43148 
Y16645 Hs.271387 
NM_014400Hs.11950 
A1791493 Hs.129873 
AI308876 Hs.103849 
AF245505 Hs.72157 
AI41 7B28 Hs.192435 
AA847843 Hs.62711 
S57298 Hs.323910 
AW961400 Hs.333526 
AA121098 Hs.3838 
H22570 Hs.172572 
AA808940 Hs.274450 
NM_002543Hs.77729 
AA116021 Hs.38260 
AI651930 Hs.135684 
BE280074 Hs.23960 
AW452434 Hs.58006 
NH.014918HS.110488 
AKD01423 Hs.94694 
U25128 Hs.159499 
AA426202 Hs.40403 
L34041 Hs.9739 
AA250970 Hs.251946 
NM_002318Hs.83354 
8E390551 Hs.77628 
NM_003512Hs.28777 
NM.000346Hs.2316 
AA442324 Hs.795 
M25809 Ks.64173 

NMJ)04585Hs,17466 
AA319233 Hs.5521 
C18356 Hs.295944 
AF182277 Hs.330780 
AA825686 Hs.321176 
Y15221 Hs.103982 
NMJ05419HS.72988 
AW968504 Hs.123073 

BE274552 Hs.76578 
AW797437 Hs.69771 
H73505 Hs.117874 
086961 Hs. 79299 
NM.005456HS.268215 



,TM,7tmJ, 

.SS.TM.EGF.WLrecepLa.ldLfeoepLb.EGF 
.SS.AAA, 

,$S,GlycosJransf_2, 
SS 

Glycos_trans(J.RiciruBj8Cfln f SS,Glyco3 
lipocaIin,SSJpocar»i,(lpocaIjn ( fefriti 
,SS,Pribosyitran,Sullatase 
PHD.pklnase.SS 
SS 

SSJM.fn3,SS,TM,tn3, 
.SS,TM,efhand.ion_trans 

SS.TM,CUB,F 5J : 8_type_C.MAM t SS,TM,CUB < F5jieurop8in 1 



ESTs 

ESTs, Weakly similar to ALU1 JiUMAN ALU S 
ESTs 
ESTs 
ESTs 

UDP4tecetytetpha^alactQsamine:polyp 
fatty add binding protein 4, adipocyte 
DMA replication factor 
WAA0215 gene product 

ESTs, Weakly similar to YZ28 JJUMAN HYPOT 
prolactin receptor 

secreted phosphoproteln 1 (osteopontln, 



stearo^CoA desaturase (delta-9-desatur 
chord irvllke 
ESTs, 

gbye53h05.s1 Soares fetal liver spleen 
Target Exon 

mesenchyme homeo box 1 
ESTs 

neuropeptide Y receptor Y1 
ESTs. Weakly similar to S64329 probable 
ESTs 

small inducible cytokine subfamBy A (Cy 
GPI-anchored metastasis-associated prate 
ESTs, Weakly similar to A36036 cytochrom 
JHhemopaxinPeptidase^MIO.hernopexin.Peitypou^ protein DKFZp761D112 
ig,LRRCT, DKFZP564I1922 protein 

,SS,TM ESTs 

,SS.HMG_box, Homo sapiens, clone IMAGE:3351295, mRMA 

,SS,TM l SH2 1 PH,pWnase^e(»p^domainfurv^rb-b2 avian erythroblastic leukemia v 



SS.TM.Desaturase.SS 
SS 

,SS,7tm 1.SPRY, 

,SS,TM,GDA1_CD39 

.SS.filament, 

SS.homeobox.Ets.SS.homeobox, 
SS 

SS.7M,7tmJ, 
SS 

.SS.TM.PMf^Claudin, 
.SS,TM,IL8 
,SS,PH,!actamase_B 
,SS,p450,p450 



HER2 receptor tyrosine kinase (c-erb-c-2, 
serum-inducible kinase 
hypothetical protein FU20093 
EST 

oxidised low density lipoprotein (lectin 
ubiquitin specific protease 18 
ESTs 
cycSnBI 

ESTs, Weakly similar to ALU5 JiU MAN ALU S 
WAA0990 protein 

Homo sapiens cONA FU 10561 fis, done NT 
parathyroid hormone receptor 2 



SS 

pkinase^OlOJ>ox,SS ( pktoe,POlOJbox, 
,SS 1 ig,pWnase,LRRNT l LRRCT, 
,SS,TM,KRAB,SCAN,zf-C2H2.ig 
.SS.TM 

SS.UCH-1,UCH-2.SS.TM.G_glu - transpept 
SS 

cydln f SS f TM,cydln, 
SS 
SS 
SS 

SS.TM.7tnU,SS.TM t 7tmJ2 

JM I ABC_rrKrnbrarw^C - tran,RibosomaLS4eCbp/p300Hntera transactivator, wit 
l SS,Tl^transportj)rotSWlB,RhoGAP,OAGJ ) Eglyceiol^ 1 (so 

,SS I rrm,PABP,pkinase,14-3-3 l rrm poly{A)^inding protein, cytoplasmic 1-1 

,SS,TM.mfto_carr,Lysyl_oxidase lysyl oxidase-fike 2 

SS,START t SS,START,NNMT„PNMT_TEMT, steroidogenic acute regulatory protein r 
SS^stone,Calc„CGRP„tAPP,^,MHCJ,SPRY^2A histone family, member L 



SS.HMG.box, 
histone,SS,histone,BotA 
ATP-synLab ( SS,7tm_1ATP-synLab 
f!iamentSS,fiiament,filameM 
TM 

,SS,TM,RibosomaLL27e. 

,Kunl*_BPTI, 

SS,p450,SS 

SS 

SSJL8. 

SH2,STAT,SS,STAT 



RhoGEF.PH.SS.RnoGEF.PH 

SAP.SS.FG-GAP.vwa 

SS^shi,trypsln,vwa r mrufibrinogeruC l fn 

.SS.Peptidase.SSP.Peptidase^Sap 

,SS,TM 

,SS,Pribosyttran, 
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SRY (sex determining region Y)-box 9 (ca 
H2A histone family, member O 
ATPase, H transporting, lysosomal (vacuo 
N&L005557 s :Homo sapiens keratin 16 (foca 
retinoic acid receptor res ponder (tazaro 
ESTs 

tissue factor pathway inhibitor 2 
cytochrome P450, subfamBy JIB (phenobar 
ESTs, WeaWy similar to S65824 reverse t 
small inducible cytokine subfamily B (Cy 
signal transducer and activator of trans 
CDC2~related protein kinase 7 
NM_003371*:Homo sapiens vav 2 oncogene ( 
protein inhibitor of activated STAT3 
B -factor, properdin 
ESTs 

lipoma HMGfC fusion partner-like 2 



3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
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439452 
452017 
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452106 
447519 
426928 
438825 
414575 
417837 
422128 
445941 
429973 
444542 
459561 
425741 
426501 
456508 
434228 
415752 
400419 
439750 
423858 
428514 
25 428698 
448988 
432072 
417433 
452194 
444051 
420042 
457292 
421458 
431104 
443767 
419589 
415447 
443464 
423431 
413278 
458451 
440449 
413753 
434876 
435575 
415773 
446440 
450847 
426075 
452110 
439963 
402837 
439451 
406664 
417315 
413011 
414987 
429197 
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60 407604 
419092 
456672 
443171 
452256 

65 432201 
406642 
400903 



AA378776 Hs.288649 
AA918317 Hs.57987 
AF109302 Hs.27495 
AK000725 Hs.50579 
AJ141031 Hs.21342 
U46258 Hs.339665 
AFO37062 Hs.172914 
BE327427 Hs.79953 
H11257 Hs.22968 
AL079905 Hs.1103 
AW881145 

AI267371 Hs.172636 
AW23317 Hs.164680 
AI161293 Hs.280380 
Ai547306 Hs.134981 
AF052152 Hs.159412 
AW043782 Hs.293616 
AA502764 Hs. 123469 
242047 Hs.283978 
BE314524 Hs.78776 
AF084545 

AL359053 Hs.57664 . 
AL137326 Hs.133483 
AW236861 Hs.193139 
AA852773 Ks.334838 
Y09763 Hs.22785 
N62937 Hs.269109 
BE270266 Ks.82128 
AI694413 Hs.332649 
N48373 Hs.10247 
AW015140 Hs.161723 
AI921270 Hs.334882 
NKL003654HS.104576 
AW970859 Hs.313503 
BE562136 Hs.9736 
AW973708 Hs.201925 
Z97171 Hs.78454 
BE548446 Hs.5167 
AA326062 
BE563085 Hs.833 
AW297181 Hs.195922 
AA885430 Hs.201925 
U17760 Hs.75517 
AF16Q477 Hs.61460 
AF213457 Hs.44234 
R21651 Hs.324725 
AV658411 Hs.42656 
NNL003155HS.25590 
AW513691 Hs.270149 
T47667 Hs.28005 
AW247529 Hs.6793 
NA 

AF086270 Hs.278554 
L34041 Hs.9739 
AI080042 Hs.336901 
AW068115 Hs.821 
AA524394 Hs.294022 
H24471 Hs.26930 
N30714 Hs.325960 
AW191962 Hs.249239 
J05581 Hs.89603 
AK002016 Hs.1 14727 
BE281128 Hs.9030 
AK000933 Hs.28661 
AI538613 Hs.298241 
AJ245210 
NA 



SS,SS 

ss.ss 

ss 

ss 

ss 

ss 

,SS,adhjshortTGF-beta,TGFbj)ropeptlde 
,SS,TM,hIstor»eANFjBceptor,guanyiate_cy 
.SS.pkinase.lg, 

SSJGFbj>ropeptide,TGF-beia,SS 
SS 

SS.SS.Iipoxygenase.PLAT 
,SS,T-box,UDPGT 

SS.SS.Pepttdase^MI.EGF^ecfiiLc.susW 
SS 

.pklnase, 

SS 

SS 

.SS.TM.7tmJ 
TM 

,SS,PepfkJase_M1, 

JM.integriruB.RicliUBJectln.nm 

,SS,TM 

,SS I START,NNMTJ 3 NMT_TEMT, 
SS 

SS.TM.TM 
t Seffia,ig, 

'SS,TM I LRRCT,LRRNT,LRR,TM,LRRCT, 

,SS,TM.7tnL3 l ANF.receptor l sushi 

.SS.ig, 

.SS.CUB. 

SS,TM,SS,TM,G-patch 
SS 

.Semajg, 

,SS,PCI,RasGEF,hormonejec.zf-C4, 
.FGF. 

SS,OLF,OLF,OLF,RlbosomaLt4 

SS,TM,SSF,SS,TM 

,SS,p450,p450 

.SS.mubjquitin.lamlnlJLG.lamlninJEGF.k 
,SS,RibosomaUL14 
.FGF, 



hypothetical protein MGC3077 

B-cell CLL/lymphoma 11B (zinc Gnger pro 

prostate cancer associated protein 7 

hypothetical protein FU20718 

ESTs 

ESTs 

retinol dehydrogenase 5 (11-cis and 9-d 
ESTs 

Homo sapiens done 1MAGE.451939, mRNA se 
transforming growth factor, beta 1 
gb:QV0-OT00334)1040O-182-a07 OT0033 Homo 
ESTs 
ESTs 

amfoopeptidase 
ESTs 

Homo sapiens clone 24628 mRNA sequence 
ESTs 

ESTs, Weakly similar to AF208855 1 BM-01 
Homo sapiens PR02751 mRNA, complete cds 
putative transmembrane protein 
Target 

Homo sapiens mRNA full length insert cDN 
Homo sapiens mRNA; cDNA DKFZp434B0650 (f 
ESTs 

KIAA1866 protein 

gamma-aminobutyric acid (GABA) A recepto 
ESTs 

5T4 oncofetal trophoblast glycoprotein 
olfactory receptor, family 2, subfamily 
activated leucocyte cell adhesion motecu 
ESTs 

hypothetical protein FU14251 
carbohydrate (keratan sulfate Ga(-6) sul 
ESTs 

proteasome (prosome, macropain) 26S subu 
Homo sapiens cDNA FU13446 fis. clone PL 
myocilin, trabecular meshwork inducible 
Homo sapiens mRNA; cDNA DKFZp434F152 (fr 
gb£ST29171 CerebeBum II Homo sapiens c 
Interferon-stimulated protein, 15 kDa 
ESTs 

Homo sapiens cDNA FU13446 fis, clone PL 



SS,lamlnin^GFMlnin_Nterm^dh - .short,S lamlnin, beta 3 (nlcein (125kD), kalinln 



,SS,HLH 
SS,lg,SS,TM 

,SS,TM l RibosomaLS3Ae,G-gamma 
SS 

.SS.homeobox, 
,SSM 

,SS,TM,A«n_recp,pkinase 

,TM,p450,Ets 

SS 

,SS,Chromo_shadow,chromo ( 



Ig superfamtly receptor LNIR 
triggering receptor expressed on myeloid 
ESTs, Moderately similar to A47582 B-cel 
KIAA1681 protein 
stanniocaldn 1 

ESTs, Weakly similar to 2109260A B cell 
Homo sapiens cDNA FU11309 fis, done PL 
platelet-acfivating factor acetylhydrola 
ENSPW000241312*:DJ947L8.1.8 (novel Sush 
heterochromatin-llke protein 1 



,SS,TM.tfarep^rU)rot$WIB,RrwGAP^^ 1 (so 

,SS,RH)LpoLA.RNAjJoU\2,Ribo3omaLS24e l r^ protein S24 



SS,LRR,LRRNT,SS,LRRNT,LRR, 
,SS,connexln,horrTrone_rec^^,connexin 
.SS.Gelsolin, 
.SSJM 
,SS,TM.C1q, 
SSJM.SEA, 
,SSWK 
SS,WimJ,rrm,$S 
,TM.GDI,7tmJ. 

SS,TM,trypsin,SS,TM,trefoll,trypsln,tref 

SS 

SS 
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bigiycan 
hypothetical protein FU 14950 
ESTs. Weakly similar to T20272 hypotheti 
membrane-spanning 4 -domains, subfamily A 
collagen, type VJIJ, alpha 2 
mucin 1, transmembrane 
Homo sapiens, done MGC: 16327, mRNA, com 
TONDU 

Homo sapiens cDNA FU10071 fis, done HE 
Transmembrane protease, serine 3 
gbiHomo sapiens mRNA for Immunoglobulin 
Target Exon 
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434408 
452994 
445903 
424364 
410196 
419150 
433417 
418624 
436291 
411000 
412519 
450223 
422790 
424269 
435854 
447388 
451631 
448105 
438637 
423024 
456592 
425920 
444670 
401197 
437755 
452560 
410274 



AI031771 

AW962597 

AJ347487 

AW383226 

Ai936442 

T29618 

AA587773 

AI734080 

BE568452 

N40449 

M196241 

AA418204 

AA809875 

AW137691 

AJ278120 

AW630534 

R00866 

AW591433 

BE500941 

AA593731 

R91600 

AL049977 

H58373 



Hs.132586 

Hs.31305 

Hs.132781 

Hs^01189 

Hs.59838 

Hs.89640 

Hs.8859 

Hs.104211 

Hs.5101 

Hs.201619 

Hs.73980 

Hs.241493 

Hs.25933 

Hs.199754 

Hs.4996 

Hs.76277 

Hs.298241 
Hs.126730 
Hs.325823 

Hs.162209 
Hs,332938 



AW204256 Hs.291887 
BE077084 Hs.336432 
AA381807 Hs.61762 
W27249 Hs.8109 



404826 
458389 
408196 
433675 
418848 
422095 
415992 
424631 
409956 
419667 
450946 
447770 
423826 
418838 
442804 
432284 



H70284 

AL034548 

AW977653 

A1820961 

AI868872 

C05837 

AA688021 

AW103364 

AU077005 

AA374569 

AB032417 

U20325 

AW385224 

AW300118 

AA532807 



Hs.160152 

Hs.43627 

Hs.75319 

Hs.193465 

Hs.282804 

Hs.145807 

Hs.179808 

Hs.727 

Hs.92208 

Hs.127698 

Hs.19545 

Hs.1707 

Hs.35198 

Hs.131257 

Hs.105822 



,SS,GlycoJiydro_2 ESTs 
SS,WD40,SS,WD40, WAA1547 protein 

fn3,SS JM.EF1BD class I cytokine receptor 

SS ESTs, Weakly similar to G0 1763 atrophin- 

UBACT^repeaLSS.UBACT_repeatThiF_femilyhypoe)etIcal protein aJ10808 
.TM.pkinase.fo3, TEK tyrosine kinase, endothelial (venous 

.SS.SRCR, Homo sapiens, Similar to RIKEN cONA 5830 

.Sernajg. ESTs 
.SS.abhytirolase, protein regulator of cytokinesis 1 

SS ESTs, Weakly similar to S38383 SEB4B pro 

SS,Trop(mm^erraggk]tinln l SS I TM,C2,Tropo troponin T1, skeletal, stow 



.SS.proJsomerase, 
,TM,histone,Sec1 .histone.sugarjr 
,SS.TM,7tm_2,GPS 
,SS,WD40 

,SS,TM,rrm.oxidoredji6,<»idoredji6 
SS 

.SS.TM.trefoil.trypsln.trefoil 
,TM,PH, 
.SSJM.CD36 



natural klBer-tumor recognition sequenc 
ESTs 
ESTs 

putative ankyrirwepeat containing prote 
Homo sapiens, done MGC:9381, mRNA, comp 
gb:ye79c02.s1 Soares fetal Over spleen 
Transmembrane protease, serine 3 
ESTs, Weakly similar to WAA1214 protein 
ESTs. Moderately similar to ALU5.HUMAN A 



,SS,RaruBP1,UM.Ran.BP1,GRIP l TPR l UM gb:yq10c02.r1 Soares fetal Dverspteen 
SS,TM,SS,T^PMf^Claudln.PMP22_Claudinclaudin 8 
.SS.TM hypottietical protein MGC5370 

arfM ENSP00000229263*:HSPC213, 
,wnt, ESTs 
f SS.rrm,2f-RanBP t pkinase < C2,pkinase_C,DA ESTs 
SS.SS hypoxa-inducibte protein 2 

SS hypothetical protein RJ21080 

.SS.TM Target Exon 

.SS.RA ESTs. Weakly similar to FPHU alpha-fetop 

HMG_box t pWnase^CHC l SS.TM^MG_box,SRY (sex determining region Y)-box 22 
.SS.ribonucjecLsm, ribonucleotide reductase M2 polypeptide 

,death,ZU5,pkinase^ctivinjecp l ESTs 
SS.Cu^xWase.SS.Cu-oxidase.Cu-oxidase hypothetical protein FU22704 
.SS.TM hypothetical protein FU13593 

SS ESTs 
SS.TGF-beta.TGFbjMopeptide.SS.TGF-beta.inhibln. beta A (actMn A, activtn AB a 
SS,disintegrfo I Repralysin t PepJA12Bj)rope a disintegrin and metalloprotelnase doma 
SS * ESTs, Moderately similar to 2109260A B c 

Frizzled.Fz.SS.TM^rlzzled.Fz. frizzled {Drasophila) homotog 4 

SS.SS cocaine- and amphetarnine-feguJated trans 

,SS JWPhosphoaTest, ectonucieotide pyrophosphatase/phosphodi 

,SSJM,G-garnma ESTs 
.SS.T^pkinase, ESTs 
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TABLE 20A 



Table 20A shows the accession numbers for those pkeys lacking unigeneED's for Table 20. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California), The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



CAT number: Gene cluster number 
Accession: Genbank accession numbers 

15 



Pkey 


CAT number 


Accessions 


410785 


1221055.1 


AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 


411667 


1253334J 


BE160198 AW935898 T11520 AW935930 AW856073 AW861034 


418636 


1774Q2J 


AW749855 AA225995 AW750208 AW750206 


420854 


197072J 


AW296927AI684514AI263168AA281079 


422128 


211994.1 


AW881 145 AA490718 M85637 AA304575 T06067 AA331991 


423431 


228162J 


AA326062 AA325758 AW962182 


423945 


233566.1 


AA410943 AW948953 AA334202 AA332882 


426878 


273265J 


BE069341 AW74M03 AL044891 AI908240 AA393080 


433687 


373061.1 


AA743991 AA604852 AW272737 


447197 


711623J 


R36075AI366546 R36167 


451631 


878098J 


R00866R01523A1806815 


456207 


165078.-1 


AA193450 


456592 


202684.1 


R91600T87079AA291455 



Pkey: 



Unique Eos probeset identifier number 
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TABLE 20B 
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Table 20B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 20. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. Be 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled The ONA sequence of human chromosome 22/ Dunham I et aL, Nature (1999) 402:489-495. 
Strand: indicates DNA strand from which exons were predicted. 

NLpositlon: indicates nudeoflde positions of predicted exons. 

Pkey Ref Strand Nt_position 



400608 9887666 

400903 2911732 

401045 8117619 

401093 8516137 

401197 9719705 

401747 9789672 



401780 
401781 
401866 
402230 
402408 
402578 
402606 
402837 
403199 
403329 
403593 
403943 
404091 
404347 
404826 
405366 
405654 



7249190 
7249190 
8018106 
9966312 
9796239 
9884928 
9909429 
9369121 



8516120 
6862650 
7711864 
7684554 
9838195 
6572184 
2182280 
4895155 



Minus 

Plus 

Plus 

Minus 

Plus 

Minus 

Minus 

Minus 

Plus 

Minus 

Minus 

Pius 

Minus 

Minus 

Minus 

Plus 

Minus 

Plus 

Minus 

Plus 

Plus 

Plus 

Minus 



96756-97558 
59112-59228 

90044-90184,91111-91345 

22335-23166 

176341-176452 

118596-118816,119119-119244,119609-119761J2M22-1^^ 
131932,132451-132575.133580-134011 

28397-28617^92O-29(M5^135-29296^11-29567 ( 29705-29787,30224-30573 

83215^35,83531^3656,83740^01,84237-84393,84955^37,86290-86814 

73126-73623 

29782-29932 

110326-110491 



81747-82094 

2013-2186.9570-9758,1 1 136-11309,19429-19677,21210-21455.23368-23562^4342-24527.29132-29320 

58895^59036,66618-66789 

96450-96598 

62554-62712,69449-69602 

100742-100904,101322-101503 

82121-83229 

74493-74829 

47726-48046 

22478-22632 

53624-53759 
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TABLE 21: 210 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
LIKELY TO ENCODE EITHER ENZYMES OR PROTEINS 
AMENABLE TO MODULATION BY SMALL MOLECULES 

Table 21 shows 210 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode either enzymes or proteins amenable to modulation by small 
molecules. These were selected as for Table 19, except that the ratio was greater than or equal 
to 3.0, the "average" normal adult tissue level was set to the 85 th percentile value amongst 
144 non-malignant tissues, and the 96 th percentile value amongst 73 breast cancers was 
greater than or equal 80 units, and the predicted protein contained a structural domain that is 
indicative of enzymatic function or of being modulatable by small molecules (e.g. pkinase, 
peptidase, phosphatase, ATPase, or ionjransporter domains). The predicted protein domains 
are noted. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UntgeneiD: Unigene number 

Unigene Titia* Unigene gene title 

R1 : Ratio of 93rd percentile tumor to 85* percentile of normal body tissue 



Pkey ExAccn UnigenelD Predicted Protein Domains UnigeneTttle R1 

449746 AJ660594 Hs.176588 SS,p450 ESTs, Weakly similar to CP4YJWMAN CYTOC 65.7 

400292 AA250737 HsJ2472 death7U5,TM^ctivin_recp,pkinase l 8MP-R1B 55.9 

424735 U31875 Hs.272499 SS.TM sharfctain alcohol dehydrogenase family 53.8 

407178 AA195651 Hs.104106 SS.Dihydroorotase, ESTs 39.3 

408045 AW138959 Hs.245123 PhosphodlestSomatomedin_B, ESTs 34.9 

450375 AA009647 Hs.8850 SS.THdismtegrk»Pep.M12Bj)ropep l Reprol a dlslntegnn ard metatloproteiiiase dorna 257 

429170 NMJX)1394Hs.2359 OSPcRhodanese, dual specificity phosphatase 4 245 

445730 AI624342 Hs.170042 SS.TM,CatkXLeffiux ESTs 24.1 

424634 NM_003613Hs.151407 Jg.tspJ.SS-AAA cartilage Memiediate layer protein, nu 21.7 

420757 X78592 Hs.99915 hormone^rec^drogen^recep,2f-C4, androgen receptor (dihydrotestosterone r 21.7 

424399 AI905687 Hs.2533 SS aldehyde dehydrogenase 9 family, member 20.3 

447350 A1375572 Hs.172634 pkinase. ESTs 19.2 

456207 AA193450 SS.TM.p450.p450 gb3r40e07j1 Soares_NhHMPu_S1 Homo sapi 18.3 

456938 X52509 Hs.161640 SS,TM,anrfr»tranJ_2,C^herinJLterm^ W-1 

402578 SS.pWO.SS.TM^SQ C1001134:gi|2117372|pir|ii65981 fatty ac 17.8 

425692 D90041 Hs.155956 SS-Acetyltransf2 t N-acetyitransferase 1 (arylamine N-acety 167 

424001 W67863 Hs.137476 pkinase, paternally expressed 10 16.5 

418007 M13509 Hs.83169 SS,r«mopexin < PeptidaseJMQ,SS > Peptklasejra metaltoprotetnase 1 (MMP1; Inters 157 

421727 Y13153 Hs.107318 kynurenine3-monooxygenase(kynurenine3 15.3 

411869 W20027 Hs.23439 SS.PeptWaseJ/11, ESTs 13.9 

400289 X07820 Hs.2258 hemopexin f PeptkJaseJ/l10,SS,Pepte^^ 13.5 

443348 AW873596 Hs.182278 SS.DENN calmodulin 2 (phosphorylase kinase, delt 13.0 

424086 AJ351010 Hs.102267 SS.Lysyloxidase lysy) oxidase 12-8 

400295 W72838 Hs.2533 SS aldehyde dehydrogenase 9 family, member 127 

408771 AW732573 Hs.47584 mK-tetia.lonJraiK.SS.TM.K.tetra.ion t potassium voltage^ated channel, delayed 11.9 

421155 H87879 Hs.102267 SS,LysyLoxWase,Aldose_epim,Epimerase,S lysyl oxidase 11.8 

424905 NMJ02497HS.153704 pklnase.SS.TM.pkinase.polyprenyLsynt, NIMA (never In mitosis gene ablated k 117 

438167 R28363 Hs.24286 SS,TM,7tmJ I p450,rrm ESTs 115 

459583 AI907673 pkinase, gb:IL-BT152-0803994)04BT152Homosapien 11.5 

423945 AA410943 deafo,ZU5,TM,A<tfvinjecp,pkinase t gb:rt32h03.f1 Soares ovary tumor NbHOTH 11.4 

445263 H57646 Hs.42586 SS,Acy]transferase, KIAA1560 protein 11.2 
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423887 AL080207 Hs.134585 SS,TM,BRCT l anMBC_tran^BC_tran DKFZP434G232 protein 10.9 

429859 NM.007050HS.225952 SS,TM,Yj)tospretose I MAM.fh3, protein tyrosine phosphatase, receptor t 10.4 

428368 BE440042 Hs.83326 SSPeptkJase M10 ( rtenTopexin,SSPeptWase^matrix metalloprotelnase 3 (strometysin 10.3 

418912 NMJW0685HS.89472 SS.WmJ.SS.TNtftmJ, angiotensin receptor 1 10.3 

5 451952 AL120173 Hs.301663 SSjJdnase, ESTs 10.3 

402408 NA • SS.carb.anhydrase NM_030920*:Homo sapiens hypothetical pro 9.8 

415539 AI733881 HsJ2472 death^SWdvirLrecp^kinase, BMP-R1B 9.4 

406687 M31126 Hs.272620 SSPeptkJase M10,rerropexln.SS,Pepfidasejnatrix metalloprotelnase 11 (MMP11; stro 9.1 

419948 AB041035 Hs.93847 Ferric_reductTM,Ferrlcjeduct MM_016931:Homo sapiens NADPH oxidase 4 ( 9.1 

10 400285 NA TM>\BC.tranABCjnembrane, Eos Control 8.8 

408380 AF123050 Hs.44532 SSJHuWquitin,7tnT_3>\NF - re<^towushl dJuWquitin 8.6 

409203 AA780473 Hs.687 SS,p450.SSj>450 cytochrome P450, subfamily IVB, polypept 8.3 

424902 NM_003866Hs.153687 SS,SS inositol pc^aspriate^hosphatase, ty 8.3 

431725 X65724 Hs.2839 SS.Cys_knot.SS Norrie disease (pseudogEoma) 8.3 

15 418092 R45154 Hs.106604 deathZU5.pklnase^ctMn_recp, ESTs 8.3 

427811 M81057 Hs.180884 SS,Znjxm}OpeptPropepjm,SS l Prorjep^^^ 6-2 

423554 M90516 Hs.1674 GATase_2 4 SIS, glutamlne^dose-6i)hosphatetransarnin 8.1 

426261 AW242243 Hs.168670 SS,TM,WD40,ubiqul^ 7.8 

413374 NH_001034Hs75319 SS ribonucleotide reductase M2 polypeptide 7.6 

20 432677 NM_004482Hs.278611 SS,TM,Glycos_transL2 ( Ricln.B.tectin,DPP UDP-l*^tyl*lpha^aiactosamfoe:poryp 7.6 

456986 D38299 Hs.170917 SS,TM,7tmJ, prostaglandin E receptor 3 (subtype EP3) 7.5 

453331 AI24C665 Hs.8895 SS.TM,dislntegrln.Pep_M12B_propepPeprol ESTs 7.3 

407721 Y12735 Hs.38018 pklnase, dual-specificity tyrosine-(Y}-phosphory1 72 

418004 U37519 Hs.87539 SS.aJdedh,SS ( aktedh, aldehyde dehydrogenase 3 family, member 7.2 

25 410555 U92649 Hs.64311 TM,disintegrin,ReprolYsin, a dislntegrtn and metaDoproteinase doma 7.1 

443695 AW204099 Hs.337720 ESTs, Weakly simflar to AF126780 1 retin 6.9 

423545 AP000692 Hs.129781 GAFPDEase riromosome 21 open reading frame 5 6.8 

401045 ICEj)20,SS l ICEj)10JCEj)20 l !CEj)10 I ICEj) 
C11001883^iI6753278lreflNPJ33938.1|c 6.7 

30 442082 R41823 Hs.7413 TM.EPH_lbd,pkinase,SAM I fn3, ESTs; cafeyntenin-2 6.7 

418836 AI655499 Hs.161712 TM^ctivi(urecp.pWnase l deathZU5 l ESTs 6.6 

404091 NA TMjtmJANFjeceptor, Target Exon 6.6 

450865 AI248013 Hs.1 08532 zf-C2H2 ESTs, Weakly similar to I38588 reverse t 6.5 

424085 NM.002914HS.139226 SS.AAA.VTraLhelicasel.rrm, replication factor C (activator 1)2 (40 6.5 

35 449650 AF055575 Hs.23838 TM.ion.trans.SS.TM.ion^trans, calcium channel, voltage-dependent L ty 6.4 

432304 AA932186 Hs.69297 TM,7tmJ, ESTs 6.2 

415669 NWL005025Hs78569 SS^erpin, serine (or cysteine) proteinase inhibito • 6.2 

442942 AW167087 Hs.131562 SS.ig.Sema.pklnase, ESTs 6.2 

428795 R45503 Hs.97469 SS.TM ESTs, Highly simflar to A39769 N-acetyt! 6.1 

40 428479 Y00272 Hs.184672 SS.pWnase.pkinase ceO division cycle 2, G1 to S and G2 to 6.1 

400300 X03363 SS,TM.pWnase^ecep_Ldorna!n ( SH2 1 PH.FuriHER2 receptor tyrosine kinase (c-erb-b2. 6.1 

447359 NMJ)12093Hs.18268 SS,adenylatekinase f adenylate kinase 5 6.1 

402230 NA SS,TM,p450, Target Exon 6.1 

424687 J05070 Hs.151738 SS.PeptidaseJVI10,fn2,herrc>pem^ 5.8 

45 432328 AI572739 Hs.195471 6PF2K.PGAM, 6i)hosprKjmicto*2-Wnase/fiructose-2 ( 6-bl 5,8 

432239 X81334 Hs.2936 SS,Peptidasej^10,heniopexin,SS t Peptklasejnatru(nie^ 5.6 

400286 NA SS.TM.ABCJran.ABC.membrane.SS C16000922^i|7499103|pirl|T20903hypothe 5.6 

425247 NMJ)05940Hs.155324 SS,Pepfidase_M10.hemopexIn,SS matrix metatoprotetee 11 (MMP11;stro 5.4 

434737 AA828246 Hs.291884 UCH-1 l pWnase,OPR,RrK»danese^-binding l ESTs 5.4 

50 439310 AF086120 Hs.102793 SS.TM,UDPGT,caseinJappa ESTs 5.2 

441111 AI806867 Hs.126594 SS.Whoaphodiest, ESTs 5.2 

452355 N54926 Hs.29202 TM.7tm_1.TM G protein-coupled receptor 34 5.2 

427711 M31659 Hs.180408 SS solute carrier family 25 (rrtochondrial 5.2 

429353 AL117406 Hs.200102 SS.TM^C.tranABC.membrane, ATP-binding cassette transporter MRP8 5.1 

55 425325 X52730 Hs.1892 " SSWMT_PNMT_TEMT,SS^NMT_PNMT_TEMT,STAR 
phenytefiianolambieN-methyitransierase 5.1 

448706 AW291095 Hs.21814 SS.TM/pkinase, interfeukin 20 receptor, alpha 5.1 

403593 NA < CIDE-N.pWnase Target Exon 5.1 

432777 AA564991 Hs.269477 atpha^amylase. ESTs 5.0 

60 446232 AI281848 Hs.194691 SS.WtmJ.RlbosornaU.13 reiinoic acid Induced 3 4.9 

411165 NMJ)00169Hs.69089 SS,Melibiase,BTK,PH,pta^ 4.9 

414117 W88559 Hs.1787 TM J ton_trans,K_tetra, prateoflpfcf protein 1 (Pelizaeus-Merzbac 4.9 

411096 U80034 Hs.68583 Peptidase_M3, mitochondrial intermediate peptidase 4.9 

450506 NM.004460Hs.418 SS,DPPJV^>rrRPept]aase_S9,SS.DPPlV_N^fibroblast activation protein, alpha 4.9 

65 417975 AA641836 Hs.30085 SS.trypsIn hypothetical protein FU231 86 4.9 

447752 M73700 Hs.105938 SS.transferrtnytmJ .transferrin, tactotransferrin 4 - 8 

427122 AW057736 Hs.323910 SS,TM l pkinase^ecep_l w doniaJn,SH2 t PH.FuriHER2 receptor tyrosine kinase {c-erb-b2, 4,8 
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417576 
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424420 
452190 
419986 
421582 
446733 
453060 
400205 
420854 
432690 
441560 
416445 
439024 
432882 
447754 
453775 
431657 
427899 
422938 
418478 
406922 
439285 
429922 
420139 
425071 
424511 
428722 
428330 
414831 
413589 
418629 
413453 
403943 
444618 
408761 
427809 
418203 
454071 
424676 
457465 
417601 
446619 
406625 
428769 
426310 
417531 
444781 
431493 



414175 
455325 
429597 
425320 
431854 
407104 
449051 
434398 
454042 
407756 
401464 
412970 
412049 
425776 



NA 

AA447453 Hs.27860 
AA687376 Hs.269533 
AA339449 Hs.82285 
AI860021 Ks.270651 
BE614743 Hs.14668d 
H26735 Hs.91668 
AI345455 Hs.78915 
A1910275 Hs.1406 
AA863360 Hs.26040 
AW294092 Hs.21594 
NA 

AW296927 

AF181490 Hs.278627 
F13386 Hs.7888 
AL043004 Hs.79337 
R96696 Hs.35598 
NNL013257HS.279696 
AW073310 Hs.163533 
NM_002916HS,35120 
AJ345227 Hs.105448 
AA829286 Hs.332053 
NMJW1809HS.1594 
U38945 Hs.1174 
S70284 

AL133916 Hs.172572 
Z97630 Hs.226117 
NM.O05357HS.95351 
NMJ)13989Hs.154424 
BE300512 Hs.193557 
U76456 HS.190787 
L22524 Hs.2256 
M31158 Hs.77439 
AW452631 Hs.313803 
BE247550 Hs.86859 
AA129640 Hs.128065 

AV653785 Hs.173334, 
AA057264 Hs.238936 
M26380 Hs.180678 
X54942 Hs.83758 
A1041793 Hs.42502 
Y08565 Hs.151678 
AW301344 Hs.122908 
NM_014735Hs.82292 
AU076643 Hs.313 
Y13647 Hs.1 19597 
AW207175 Hs,106771 
NM_000909Hs.169266 
NNL003157HS.1087 
NM 014400HS.11950 
AI791493 Hs.129873 
AF059214 Hs.194687 
AI308876 Hs.103849 
AW895719 
NM.003816Hs.2442 
U29344 Hs.83190 
AA383550 Hs.271699 
S57296 Hs.323910 
AW961400 Hs.333526 
AA121098 Hs.3838 
H22570 Hs.172572 
AA116021 Hs.38260 
AF039241 Hs.9028 
AB026436 Hs.177534 
N53437 Hs.18268 
U25128 Hs.159499 



SS,TM,3Beta^SD, ENSPOOQ00171555:CDNA FU10727 fis, clone 

SS.TM.7tm 1, Homo sapiens mRNA; cONA OKFZp586M0723 (f 

SS l pkinase,RhoGEF l ig.PH,SH3. ESTs 

AlRS.formylJransf.GARS.SS.GARS.AiRS.for phosphoribosyfglycinamide formyftransfer 
pWnase ESTs, Moderately similar to A47582 B<*1 

SS.TM.MAPEG, prostaglandin E synthase 

SS,WH,SH2,Furin^ike I pkinase,Recep_L Homo sapiens clone PP1498 unknown mRNA 
pkinase.OPR, GA-binding protein transcription factor, 

SS.trefoII.SS.TM.WLrecepLa.SRCR.tryps trefoil factor 1 (pS2) 
SS.TM.p450, ESTs, WeaWy similar to fatty acid omega 

hypothetical protein MGC1 5754 
NM_006265*:Homo sapiens RAD21 (S. pombe) 
gb:UW^W0^ajc-c«07-0-Ul.s1 NCI_CGAP_Su 
prenylcysteine lyase 

Homo sapiens clone 23736 mRNA sequence 
SS,pkmase, KIAA01 35 protein 

SS.TM.trypsin t \wd.ig ESTs 

pklnase,pkinase C t serum/glucocortlcoid regulated kinase-li 

pkinase. Homo sapiens cDNA FU14142 8s, done MA 

SSMA^I3_PMJdnase.PI3Ka,Pl3K„rbd.P13 replication tactor C (activator 1) 4 (37 
SS,TM,pWhase ESTs, Weakfy sfmflar to B34087 hypothefi 

SS.SAAjJroteins,ABC - membraneABC„tran l serum amyloid A1 



SS,TM.PeptkJaseJA1, 
SS,pyrjedox t SS,RibosorraLL39 



SS,TM,thiolase, 
ank.ank 

SSJM.Desaturase.SS 
SS.ig.pkinase.LRRNT.LRRCT, 
SS,TM,Iinker_histone,7tmJ 
SS,TM.p450, 

SS.T4 deiodinase t T4_defodinase, 
SS,Yj)hosphatase,BandJ1 
SS,TIMP t 



centromere protein A (17kO) 
cydin-dependent kinase inhibitor 2A (me 
gb:stearoyl-CoA desaturase [human, adipo 
hypothetical protein FU20093 
H1 histone family, member 0 
lipase, hormone-sensitive 
deiodinase, iodothyronine, type ll 
ESTs, Moderately similar to AUJ7J4UMAN A 
tissue Inhibitor of rrwtaltoproteinase 4 



SS,PerditeJrf10,SS,PeptkiaseJd1Q,herr^ metalloproteinase 7 (MMP7; uterin 
SS,cNMP Jlnding,Rlla,HMGJ)ox protein kinase. cAMP-dependent, regulato 

SS,abhydrolase ESTs, Highly similar to AF157833 1 nond 

SS,SH2,PH,SS,TM,PH.SH2,Funn-!lke,pklnas growth factor receptor-bound protein 7 



ESTs 

C5000355:gi}4503225|refINP_000765.1! cyt 
ELL-RELATEO RNA POLYMERASE il. ELONGATIO 
ESTs, Weakly similar to (defiine not ava 
lipoprotein lipase 
CDC28 protein kinase 2 
ESTs 

UDP-W-acetyl-alpha-D-gaiactosamlne:polyp 
DNA replication factor 
KlAA0215gene product 
secreted phosphoproteln 1 (osteopontin, 
stearoyi-CoA desaturase (delta-9-desatur 
ESTs 

neuropepflde Y receptor Y1 
serine/threonine kinase 2 
GPl-anchored metastasis-associated prote 
ESTs, Weakly similar to A36036 cytochrom 



SS.Peptidase_C1.gpdh 
p450,SS,p450 

SS,TM,7tmJ. 
SS,lipase,PLAT.Sec7,PH, 
CKS.SS.CKS, 
TM.7imJ, 

Glycos.transL2,Rlcin_B_lectin,SS,Giycos 
SS,Pribosyltran,Suifatase 
PHD,pkinase.SS 
SS,TM.eJband,ionJrans 
SS.TM.Desaturase.SS 
$S.7tmJ.SPRY, 
SS,TM.7tm_1. 

SS,pklnase,vwa,vwa,Giyco - transfJ5 
SS,PH,lactamase_B 
SS,p450,p450 

cholesterol 25-hydroxylase 
TM,hernopexin.Peptidase_M10^emopexin.Pephypothetical protein DKFZp761D112 
TM,ionJrans,rUetra, gb:GV4-NN0039-290300-154-f06 NN0039 Homo 

a dtslntegrin and metalloproteinase doma 
AcyLtransf,adrL2inc,kBtoacylHsyntpp4}i fatty acid synthase 
IMS.SS polymerase PNA directed) lota 

SSJM,SHiPH.pWhase.Recep.Ldc^laFuriv-erW>2 avian erythroblastic leukemia v 



SS 

pkinasePOLO_box,SS4)kinase I POLO«box ( 

SS.ig.pkinase.LRRNT.LRRCT, 

SS,UCH-1,UCH-2,SS,TM,G_glu_transpept 

Peptidase_M24, 

DSPcMyosin_taa f 

SS.adenylatekinase, 

SS.TM.7trru2,SS,TM,7tfn_2 
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HER2 receptor tyrosine kinase (c-erb-b2, 
serum-tndudble kinase 
hypothetical protein FU20093 
ubiquitin specific protease 18 
histone deacetytase 5 
dual specificity phosphatase 10 
adenylate kinase 5 
parathyroid rwrmone receptor 2 



4.6 
4.6 
4.6 
4.6 
4.6 
4.5 
4.5 
4.5 
4.5 
4.4 
4.4 
4.4 
4.3 
4.3 
4.3 
4.3 
4.3 
4.2 
4.2 
4.2 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.0 
4.0 
4.0 
3.9 
35 
3.9 
3.6 
3.8 
3.8 
3.8 
3.8 
3.8 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.5 
3.5 
33 
3.5 
3.5 
3.5 
3.5 
3.5 
3.5 
3.5 
3.4 



407846 AA426202 Hs.40403 TM^Cjrembrane^CJranJ*lbo^ 3.4 

406925 L34041 Hs.9739 SSJM,traiisporU>rotSWIB,Rru^ 3.4 

445873 AA250970 Hs.251946 SS.rrm.PABP,pkina3e ( 14-3-3 l rrm poly{AH)lnding protein, cytoplasmic 1-1 3,4 

418054 NM_002318Hs.83354 SS.TMjnito^caiT.LysyLoxkfase fysyi oxidase^® 2 .3.4 

5 406815 AA833930 Hs.288036 SS.tPPT, ftNA Isopentenylpyrophosphate transferas 3.4 

410530 M25809 Hs.64173 ATP^nLab,SS.7tmJATP-synLab ATPase, H transporting, lysosomal (vacuo 3.4 

407021 U52077 gb:Human marinerl transposase gene, comp 3.4 

421168 AF182277 Hs.330780 SS,p450,SS cytochrome P450, subfamily HB (phenobar 3.4 

431473 AA825686 Hs.321176 SS ESTs, Weaidy similar to S65824 reverse t 3.4 

10 408101 AW968504 Hs.123073 pldnase. C0C2-related protein kinase 7 3.4 

422083 NNL001141HS.111256 Epoxygenase.PLAT. arachidonata 15-Bpoxygenase, second typ 3.3 

411393 AW797437 Hs.69771 SS,sushi,tiypsin > vv!^mn,fibrinogen_C,<h B-factor, properdin 3.3 

435767 H73505 Hs.117874 SS,Peptldase_S8P,Pep6dase._S8,P ESTs 3.3 

433068 NMJK)6456Hs.288215 SS,Pribosyltran, sialyKransferase 3.3 

15 426928 AF037062 Hs.172914 SS^drushort.TGF4>etaJGFb_propeptide refinol dehydrogenase 5 (11-ds and 9-d 3.3 

414575 H11257 Hs.22968 SS.pWnase,lg. Homo sapiens clone IMAG&451939, mRNA se 3.3 

445941 AI267371 Hs.172636 SS.SS.lipoxygenase.PLAT ESTs 3.3 

444542 AI161293 Hs.280380 SS ( SSPeptdase_M1 f EGF l fg l lecfin_c,sushi anwiopeptidase 3.3 

425741 AF052152 Hs.159412 pkinase, Homo sapiens done 24628 mRNA sequence 3.3 

20 434228 Z42047 Hs.283978 SS,TM,7tmJ Homo sapiens PR02751 mRNA, complete cds 3.3 

433264 D85782 Hs.3229 cysteine dtoxygenase, type I 3.3 

400419 AF084545 SS,Peptidase_M1. Target 3.3 

439750 AL359053 Hs.57664 THIntegrauB,Ricln_BJeciin,i7m Homo sapiens mRNA fall length insert cDN 3.3 

417757 R19897 Hs.106604 death^U5,pkinaseActivin^recp, ESTs 3.3 

25 452194 A1694413 Hs.332649 SS.TM,7trru3^NF>receptor l sushi olfactory receptor, family 2, subfamily 3.2 

421458 Nft/L003654Hs. 104576 SS carbohydrate (keratan sulfate GaW) sul 3.2 

443767 BE562136 Hs.9736 SS^Cl,RasGEF,hormone_rec^f-C4, proteasome (prosome, macropain) 26S subu 3.2 

422648 D86983 Hs.1 18893 peroxidase.LRRCT, Melanoma associated gena 3.2 

423431 AA326062 SS.p450.p450 gb:EST29171 Cerebellum II Homo sapiens c 3.2 

30 451264 A1768235 SS.Trehalase gbwg82g08jc1 Soares_NSF_F8_9W_OT_PA_P^S 3.2 

452110 T47667 Hs.28005 SS,TM^cfivin^recp,pkinase HomosaplenscDNAFU11309fis,donePL 3.2 

439963 AW247529 Hs.6793 TM.p450.Ets platelet-activafing factor acetylhydrola 3.2 

453941 U39817 Hs.36820 SSJ3EAD^RDC,helicase„C, Bloom syndrome 3.1 

406664 L34041 Hs.9739 SS,TM.transporLprotSWIB^hoGAP,DAG^PE- gryceroW- 

35 phosphate dehydrogenase 1 (so 3.1 

453487 R31770 Hs^3540 TMJtmJ, ESTs 3.1 

420911 U77413 Hs.100293 Winked N-acetytglucosamine (GlcNAc) tr 3.1 

443171 BE281128 Hs.9030 SS.TM.7tm J. rrm.SS TONDU 3.1 

452256 AK000933 Hs.28661 TM.GDIJtrrul. Homo sapiens cDNA FU 10071 fis, done HE 3.1 

40 432201 AI538613 Hs.298241 SS.THtrypsln.SS.mtrefefl.trypsln.tref Transmembrane protease, serine 3 3.1 

419150 T29618 Hs.89640 TM.pWnase.fh3. TEK tyrosine kinase, endothelial (venous 3.1 

444443 A1149286 Hs.55099 SS rab6 GTPase activating protein (GAP and 3.1 

426283 NNL003937Hs.169139 kynurenlnase (Ucynurenlne hydrolase) 3.1 

436291 BE568452 Hs.5101 SS.abhydrolase, protein regulator of cytokinesis 1 ^3.1 

45 450223 AA418204 Hs.241493 SS.proJsomerase. nature) WUer-tumor recognition sequenc 3.1 

424269 AW137691 Hs.199754 SS ) TM,7(mJJ.GPS ESTs 3.1 

448105 AW591433 Hs.298241 SS.TM.trefoil.uypsin l trefo« Trajwmembrane protease, serine 3 3.0 

452560 BE077084 Hs.336432 SS.rrm.zf^BP.pkinase ( C2,pklnase^C,DAGESTs 3.0 
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TABLE 21A 



Table 21 A shows the accession numbers for those pkeys lacking unigenelD's for Table 21. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey 


CAT number Accessions 


420854 


197072J 


AW296927 AI684514 AI2631 68 AA281079 


423431 


228162J 


AA326062 AA325758 AW962182 


423945 


233566J 


AA410943 AW948953 AA334202 AA332882 


451264 


863988J 


AI768235 R31400 H29082 H23107 


455325 


1279475J 


AW895719 N31451 N41451 


456207 


165078_-1 


AA193450 



10 



Pkey: Unique Eos probeset Identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 
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TABLE 21B 

Table 21B shows the genomic positioning for those pkeys lacking unigene ID'S and 
5 accession numbers in Table 21. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Ptey: Unique number comsspondk^ to an Eos probeset 

10 Ret Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gi) numbers. "Dunham I. et a).' refers to the publication 

entitled "The ONA sequence of human chromosome 22/ Dunham I. et al„ Nature (1999) 402:469-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: indicates nucleotide positions of predicted exons. 

15 

Pkey Ref Strand NLposition 

401045 8117619 Plus 90044-90184.91111-91345 

402230 9966312 Minus 29782-29932 

20 402408 9796239 Minus 110326-110491 

402578 9884928 Pius 66350-66496 

403593 6862650 Minus 62554-62712.69449^9602 

403943 7711864 Plus 100742-100904.101322-101503 

404091 7684554 Minus 82121-63229 



320 



WO 02/059377 



# 



PCT/US02/02242 



10 



20 



TABLE 22: 739 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT BREAST 

Table 22 shows 739 genes up-regulated in breast cancer compared to normal adult breast 
These were selected as for Table 19, except that the ratio was greater than or equal to 3.0, the 
denominator was the 85 th percentile value for 12 non-malignant breast specimens, and the 
96 th percentile value amongst the 73 breast cancers was greater than or equal 100 units. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnfcjenelD: Unigene number 

1 5 Unigene Title: Unigene gene title 

R1: Ratio of 90* percentile tumor to 85* percentile normal breast tissue 



Pkey ExAccn Unigene© UnigeneTitle R1 



400292 AA250737 Hs.72472 BMP-R1B 51.5 

424735 U31875 Hs.272499 short-chain alcohol dehydrogenase family 38.3 

400297 AI127076 Hs.334473 hypothetical protein DKFZp56401278 29.9 

431448 AL137517 Hs.334473 hypothetical protein DKFZp56401278 265 

25 451110 Ai955040 Hs.265398 ESTs, WeaJdy similar to tfartforniatforK 25.8 

431211 M86849 Hs.323733 gap junction protein, beta 2, 26kD (conn 232 

418203 X54942 Hs.83758 CDC28 protein kinase 2 22.6 

407980 M046309 gb:zf12fOU1 Scares JetaUeaOlbHH19W 19.8 

414646 AA353776 Hs.901 CD48 antigen (B-ceB membrane protein) 18.9 

30 446921 AB012113 Hs.16530 small inducible cytokine subfamily A (Cy 18.0 

409041 AB033025 Hs.50081 K1AA1199 protein 17.6 

412140 AA219691 Hs.73625 RAB6 interacting, Wneslrvfike (rabklnes 17.6 

407824 AA147884 Hs.9812 Homo sapiens cDNA FU14388 fis, done HE 17.1 

453160 A1263307 Hs.239884 H2B histone family, member L 17.0 

35 407137 T97307 gb:ye53h05.s1 Soares fetal Ever spleen 16.1 

425692 D90041 Hs.155956 N-acetyitransferase 1 (arylamlne N-acety 16-1 

438533 A1440266 Hs.170673 ESTs, Weakly similar to T24832 hypothetj 16.0 

428227 AA321649 Hs.2248 smaD inducible cytokine subfamily B (CX 155 

444342 NMJ)14398Hs.10887 similar to tysosomenassociated membrane 15.1 

40 422505 AL120862 Hs.124165 programmed cell death 9 (PDCD9) 14.9 

430515 AA746503 Hs.283313 ESTs 14.7 

417308 H60720 Hs.81892 KIAA01 01 gene product 14.4 

452744 A1267652 Hs.30504 Homo sapiens mRNA; cDNA OKFZp434E082 (fr14.4 

412446 A1768015 Hs.92127 ESTs 145 

45 415539 AI733881 Hs.72472 BMP-R1B 14.1 

435496 AW840171 Hs.265398 ESTs, Weakly similar to transformatton-r 13.8 

438209 AL120659 Hs.6111 aryl-hydrocarbon receptor nuclear transl 13.8 

400205 NA NM_006265 # :Homo sapiens RAD21 (S. pombe)13.5 

430965 AA489732 Hs.154918 ESTs 13.4 

50 415263 AA948033 Hs,130853 ESTs 13.3 

451952 AL120173 Hs.301663 ESTs 13.2 

449722 BE280074 Hs.23960 cyclinBV t 135 

406685 M18728 gbfluman nonspecific crossreacting antig 13.0 

406690 M29540 Hs.220529 carcinoembryonic anfegefwelated ceB ad 12.8 

5 5 429925 N&L000786HS.226213 cytochrome P450, 51 {lanosterol 14-alpha 12.8 

416498 U33632 Hs.79351 potassium channel subfamily K, member 1 127 

432378 A1493046 Hs.146133 ESTs 12.5 

441377 BE218239 Hs.202656 ESTs 125 

456207 AA193450 gb:zr40e07.r1 Soares_NhHMPu_S1 Homo sapi12.4 

60 422805 AA436989 Hs.121017 H2A hbtone family, member A 12.2 

407811 AW190902 Hs.40098 cysteine knot superfamlly 1 , BMP antagon 12.2 

407178 AA195651 Hs.104106 ESTs 125 
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420931 AF044197 Hs.100431 small Inducible cytokine B subfamily (Cy 12.1 

421727 Y13153 Hs.107318 kynurenhe3^nonooxygenase(kynun3nlne3 12.0 

434408 AI031771 Hs.132586 ESTs 12.0 

446591 H44186 Hs.15456 P02 domain containing 1 11.9 

5 431385 BE178536 Hs. 11090 membrane-spanning 4-domains, subfamily A 11.8 

443348 AW873598 Hs.182278 calmodulin 2 (phosphorylase kinase, delt 11.7 

416602 NM_006159Hs.79389 nel (chickenHike 2 11.7 

433365 AF026944 Hs.293797 ESTs 11.6 

437866 AA156781 Hs.74170 metaHothionein 1E (functional) 115 

10 412472 AW975398 Hs.293836 ESTs 114 

416030 H15261 Hs,21948 ESTs 11.3 

439979 AW600291 Hs.6823 hypothetical protein FU 10430 1U 

420757 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 11.3 

411598 BE336654 Hs.70937 H3 histone family, member A 11.2 

IS 423600 AI633559 Hs.310359 ESTs 112 

430770 AA765694 Hs.123296 ESTs 110 

421037 AI684808 Hs.197653 programmed ceil death 9 (PDCD9) 105 

452461 N78223 Hs.108106 transcription factor 10.7 

409269 AA576953 Hs.22972 hypothetical protein FU13352 10.6 

20 417791 AW965339 Hs.111471 ESTs 10.6 

447268 AI370413 Hs.36563 hypomefical protein FU22418 10.4 

424001 W67883 Hs,137476 paternally expressed 10 10.4 

447342 AI199268 Hs.19322 Homo sapiens, Similar to RIKEN cDNA 2010 10.4 

424905 NM 002497Hs. 153704 NIMA (never In mitosis gene a)-re(ated k 10.1 

25 453619 H87648 Hs.33922 Homo sapiens, clone MGC:9084, mRNA. complO.1 

442942 AW167087 Hs.131562 ESTs 10.1 

434377 AW137148 Hs.306593 Homo sapiens cDNA FU1 1382 lis, clone HE 10.1 

427217 AA399272 Hs.144341 ESTs 10.1 

445730 AI624342 Hs.170042 ESTs 10.0 

30 432887 AI926047 Hs.162859 ESTs 10.0 

452243 AL355715 Hs.28555 programmed cell death 9 9.9 

424590 AW966399 Hs.46821 hypothetical protein FU20Q86 9.9 

432169 Y00971 Hs.2910 phosphoribosyl pyrophosphate synthetase 9.9 

438950 H23789 Hs.144530 EST 9.9 

35 418836 AI655499 Hs.161712 ESTs 9.8 

430291 AV660345 Hs.238126 CGI-49 protein 9.8 

444665 BE613126 Hs.47783 B aggressive lymphoma gene 9.7 

407377 C16391 gb:C16391 Clontech human aorta polyA mRN 97 

445413 AA151342 Hs.12677 CGI-147 protein 9.7 

40 443462 AI064690 Hs.171176 ESTs 9.7 

442145 AI022650 Hs.8117 erbb2-lnterac8ng protein ERB1N 9.7 

435570 AF212222 Hs.177812 uncharaderized bone marrow protein BM04 9.7 

439820 AL360204 Hs.283853 Homo sapiens mRNA fuO length Insert cDN 9.6 

428966 AF059214 Hs.194687 cholesterol 25-hydroxyiase 9.6 

45 449448 D60730 Hs.57471 ESTs 9.6 

433929 AI375499 Hs.27379 ESTs 9.5 

432731 R31 178 Hs.287820 fibronecflh 1 9.3 

411815 AA156679 Hs.125790 leudne-rtch repeat-containing 2 9.3 

415385 R17798 Hs.7535 COBW-tlke protein 9.3 

50 422026 U80736 Hs.110826 trinucleotide repeat containing 9 9.2 

432596 AJ224741 Hs.278461 matriBn3 9.2 

439451 AF086270 Hs.278554 heterochrwrattn-fite protein 1 95 

423945 AA410943 gb:zt32h03j1 Soares ovary tumor NbHOT H 9.1 

442432 BE093589 Hs.38178 hypothetical protein FU23468 9.1 

55 446715 A1337735 Hs. 173919 ESTs, Moderately similar to ZN91JWMANZ 9.0 

408771 AW732573 Hs.47584 potassium voltage-gated channel, delayed 9.0 

437021 AI076089 Hs.292239 ESTs 9.0 

428479 Y00272 Hs.184572 cell division cycte 2, G1 to S and G2 to 8.9 

428839 AI767756 Hs.82302 Homo sapiens CDNA FU14814 fe, done NT 8.9 

60 402408 NA NM_030920*:Homo sapiens hypothetical pro 8.9 

418601 AA279490 Hs.86368 calrnegin 8.8 

426327 W03242 Hs.44898 Homo sapiens done TCCCTA00151 mRNA sequ8.8 

419519 AI198719 Hs.176376 ESTs 8.8 

440621 AW296024 Hs.150434 ESTs 8.8 

65 446142 A1754693 Hs.145968 ESTs 8.8 

418196 AI745649 Hs.26549 K1AA1708 protein 8.7 

447178 AW594641 Hs.192417 ESTs 8.7 
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427585 
415657 
435061 
431374 
5 417866 
437211 
437751 
423887 
440941 
10 429859 
410193 
431725 
446258 
416747 
434424 
421650 
429534 
457465 
427961 
436481 
418216 
418250 
400285 
401464 
407242 
422232 
454024 
444542 
436396 
437204 
408805 
437207 
442818 
426283 
424687 
446315 
433426 
406639 
420077 
457332 
422938 
447555 
444618 
410361 
400268 
439509 
407771 
407202 
433096 
422094 
430832 
430287 
423739 
448212 
407277 
454440 
444783 
421373 
431960 
424704 
449517 
439840 
414080 
441243 
408380 
422956 
446651 
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45 
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65 



D31152 Hs.179729 
AA866115 Hs.127797 
AI651474 Hs.163944 
BE258532 Hs.251871 
AW067903 Hs.82772 
AA382207 Hs,5509 
AA767373 Hs.35669 
AL080207 Hs.134585 
BE268362 Hs.7535 
NM.007050HS.225952 
AJ132592 Hs.59757 
X65724 Hs.2839 
AI283476 Hs.263478 
AW876523 Hs.15929 
AI811202 Hs.325335 
AA781795 Hs.122587 
AW976987 Hs.163327 
AW301344 Hs.122908 
AW293165 Hs.143134 
AA379597 Hs.5199 
AA662240 Hs.283099 
U29926 Hs.83918 
NA 

AF039241 
M18728 
043945 

AA993527 Hs.293907 

AI161293 Hs.280380 

AI683487 Hs.152213 

AL110216 Hs.12285 



Hs.9028 
Hs.113274 



H69912 
T27503 



Hs.48269 
Hs.15929 



collagen, type X, alpha 1 (Schmid metaph 8.6 
Homo sapiens cDNA FU1 1381 fis, clone HE 8.6 
ESTs 8.6 
CTP synthase 8.4 
collagen, type XI, alpha 1 8.4 
ecotroplc viral integration sits 2B 8.3 
ESTs, Moderately similar to ALU1_HUMAN A 8.3 
DKFZP4346232 protein 8.2 
COBW-like protein 8.2 
protein tyrosine phosphatase, receptor t 8.2 
zinc finger protein 281 8.2 
Nome disease (pseudogBoma) 8.1 
ESTs 8.1 
hypothetical protein FU12910 8.1 
Homo sapiens cONA: FU23523 fis, done I 8.1 
ESTs 8.0 
ESTs, Weakly similar to 2109260A B ceH 8.0 
DNA replication factor 8.0 
ESTs 8.0 
HSPC150 protein similar to ubiquitin-con 8.0 
AF15q14proteh 8.0 
adenosine monophosphate deaminase (bora 7.9 
Eos Control 7.9 
histonedeacetyiaseS 7.9 
gb:Human nonspecific crossreacting antig 7.8 
transcription factor EC 7.8 
hypothetical protein FU23403 7.8 

7.8 
7.7 
7.6 
7.6 
7.6 
7.6 
7.5 
7.5 
7.5 
7.5 
7.5 
7.4 
7.4 
7.4 



wingless-type MMTV integration site (ami 
ESTs, Weakly similar to (55214 salivary 
vaccinia related kinase 1 
hypothetical protein FU12910 
AK001741 Hs.8739 hypothetical protein FU10879 
NfVL003937Hs.169139 kynureninase (Ucynurenine hydrolase) 
J05070 Hs.151738 matrix metailoprotelnase 9 (gelatinase 8 
NMJ>16293Hs.14770 bridging integrator 2 
H69125 Hs.133525 ESTs 

M97711 gfcHumanT-cell receptor (V beta 18.1, J 

AW512260 Hs.87767 ESTs 
AA961694 Hs.105187 Wnesin protein 9 gene 
NMJ)01809Hs.1594 centromere protein A (17kD) 
AI391662 Hs.160963 Homo sapiens, clone MGC:1231 8, mRNA,oom7.4 
AV653785 Hs.173334 ELL-RELATED RNA POLYMERASE II, ELONGATI07.3 
BE391804 Hs.62661 guanylate binding protein 1, interferon- 7,3 
NA NM_003292;Homo sapiens translocated prom 7.3 

AF086332 Hs.58314 ESTs 
AL138272 Hs.62713 ESTs 
N58172 Hs.109370 ESTs 
AU076803 Hs.282975 carboxylesterase 2 (intestine, Bver) 
AF129535 Hs.272027 F-box only protein 5 
AI073913 Hs.100686 ESTs, Weakly similar to JE0350 Anterior 
AW182459 Hs.125759 ESTs, Weakly similar to LEU5_HUMAN LEUKE7.0 
AA398155 Hs.97600 ESTs 7.0 
A1475858 gb:tc87d07j(1 NCLCGAP_CLL1 Homo sapien$7.0 

AW170035 Hs.326736 Homo sapiens breast cancer antigen NY-BR 7.0 
BE062906 Hs,28338 KIAA1546 protein 
AK001468 Hs.62180 aniffin (Drosophiia Scraps homolog), act 
AA808229 Hs.167771 ESTs 
AW241821 Hs.301927 c6.1A 

AI263293 Hs.152096 cytochrome P450, subfamily IIJ (arachldo 
AW500106 Hs.23643 serine/threonine protein kinase MASK 
AW449211 Hs.105445 GDNF family receptor alpha 1 
AA135257 Hs.47783 B aggressive lymphoma gene 
AI767056 Hs.193002 ESTs 
AF123050 Hs.44532 diubiquitin 
BE545072 Hs.122579 hypothetical protein FU10461 
AA393907 Hs.97179 ESTs 6.7 



7.3 
7.3 
7.3 
7.2 
7.1 
7.1 



7.0 
7.0 
6.9 
6.9 
6.8 
6.8 
6.8 
6.8 
6.7 
6.7 
6.7 



323 




. WO 02/059377 PCT/US02/02242 



419839 U24577 Hs.93304 phosphoOpase A2, group VII (pbtetet-ac 6.7 

437740 AA810265 Hs.122915 ESTs 67 

421582 AI910275 Hs.1406 trefcifl factor 1 (pS2) 6.7 

427356 AW023482 Hs.97849 ESTs 6.6 

5 429597 NM_003816Hs.2442 a dfeintegrin and metaOoproteinase doma 6.6 

422634 NMJH6010HS.118821 CGI-62 protein &6 

421072 AI215069 Hs.89113 ESTs 6.5 

427718 AI798680 Hs25933 ESTs 6.5 

411000 N40449 Hs301619 ESTs, Weakly similar to S38383 SEB4B pro 6.5 

10 449343 A1151418 Hs.272458 protein phosphatase 3 (formerly 2B), cat 6.4 

409757 NM_Q01898Hs.123114 cysfafinSN 6.4 

447164 AF026941 Hs.17518 Homo sapiens cig5 mRMA, partial sequence 6.4 

456938 X52509 Ks.161640 tyrosine aminotransferase 6.4 

418848 AI820961 Hs.193465 ESTs 6.4 



15 424902 NMJ03866HS.153687 Inositol polyphosphaM-phosphatase, ty 6.4 

452838 U65011 Hs.30743 preferentially expressed antigen in mela 6.4 

439452 AA918317 Hs.57987 B<el! CLL/r/mphoma 118 (zinc finger pro 6.4 

407266 AJ235664 gbJtomo sapiens mRNA for tmmunoglobolln 6.3 



411078 AI222020 Hs.182364 CocOaCrisp 6.3 

20 433001 AF217513 Hs.279905 done HQ0310 PRO0310p1 6.3 
434340 AI193043 Hs.128685 ESTs, WeaWy similar to T17226 hypothetl .6.2 

429503 AA394183 Hs.26873 ESTs 6.2 

402578 C1001134:gl|21 17372|pir||l65981 fatty ac 6.2 

409646 AW161391 Hs.709 deoxycytkJine kinase 6.1 

25 430447 W17064 Hs.332848 SWVSNF related, matrix associated, adi 6.1 

432415 T16971 Hs.289014 ESTs, Weakly similar to A43932 mucin 2 p 6,1 

443709 AI082692 Hs.134662 ESTs 6.1 

420929 AI694143 Hs.296251 programmed ceU death 4 6.1 

428368 BE440042 Hs.83326 matrix metalloproteinase 3 (stromelysln 6.1 

30 428248 AI126772 Hs.40479 ESTs 6,0 

420344 8E463721 Hs.97101 putative G protein-coupled receptor 6.0 

453392 U23752 Hs.32964 SRY (sex determining region Y>box 1 1 6.0 

425397 JQ4088 Hs.156346 topobomerase (DNA) II alpha (170kD) 6.0 

418007 M13509 Hs.83169 matrix metalloproteinase 1 (MMP 1 ; inters 6.0 

35 428585 AB007863 Hs, 1 85140 K1AA0403 protein 6.0 
437608 AA761605 Hs.292308 ESTs, Weakly similar to ALU1JHUMAN ALU S6.0 

427408 AA583206 Hs.2156 RAR-related orphan receptor A 6.0 

406687 M31126 Hs.272620 matrix metalloproteinase 11 (MMP11;stro 6.0 

418092 R45154 Hs.106604 ESTs 6.0 

40 447051 AW139130 Hs.160951 ESTs. WeaJdysirnflartoConl [H^aplens] 6.0 

441233 AA972965 Hs.135568 ESTs 6.0 

432239 X81334 Hs.2936 matrix metalloproteinase 13 (coOagenase 6.0 

435106 AA100847 Hs.193380 ESTs, Highly similar to AF174600 1 F-box 5.9 

435525 AI831297 Hs.123310 ESTs 5.9 

45 458809 AW972512 Hs.20985 sin3-assoclated polypeptide, 30kD 5.9 
410785 AW803341 gb:ll^4JM0079^90\W0-050-O03 UM0079 Homo5.9 

422576 BE548555 Hs.1 18554 CGI-83 protein 5.9 

451398 AI793124 Hs.144479 ESTs 5.9 

441881 AW968904 Hs.179566 hypothetical protein FU22624 5.8 

50 412022 AI005043 Hs.24143 Wiskott-Aldrich syndrome protein interac 5.8 

416636 N32536 Hs.42645 solute carrier family 16 (monocarboxyGc 5.8 

447350 A1375572 Hs.172634 ESTs 5.8 

434094 AA305599 Hs.238205 hypothetical protein PRO2013 5.8 

409151 AA306105 Hs.50785 SEC22, vesicle trafficking protein (S. c 5.8 

55 448807 AI571940 Hs.7549 ESTs 5.8 
452281 T93500 Hs.28792 Homo sapiens cDNAFUl 1041 tis, clone PL 5.8 

421281 AI299139 Hs.17517 ESTs 5.8 

430361 AI033965 Hs.239926 sterol-C4-methyl oxidase-Bke 5.8 

400289 X07820 Hs.2258 matrix metalloproteinase 10 (MMP10;str 5.7 

60 440527 AV657117 Hs.184164 ESTs, Moderately similar to S65657 alpha 5.7 

434674 AA831879 Hs.136985 ESTs 5.7 

426320 W47595 Hs.169300 trareforming growft factor, beta 2 5.7 

452401 NMJJ07115HS.29352 tumor necrosis factor, alpha-induced pro 5,7 

448663 BE614599 Hs.106823 hypoftefieal protein MGC14797 5.7 

65 438199 AW016531 Hs.122147 ESTs 5.7 

446203 Z47553 Hs.14286 flavin containing monooxygenase 5 5.7 

428336 AA503115 Hs.183752 microserranoprotein, beta- 5.6 
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430379 AF134149 Hs.240395 potassium channel, subfamily K, member 6 5.6 
422835 BE218705 Hs.121378 metettotWoneln-like 5, testis-speciflc 5.6 
444758 AL044878 Hs.11899 34iydroxy4-mettytgfutaryl-Coenzyme A re 5.6 
443426 AF098158 Hs.9329 chromosome 20 open reading frame 1 5.6 
5 400301 X03635 Hs.1657 estrogen receptor 1 5.6 
447078 AW885727 Hs.301570 ESTs 5.6 
432015 AL157504 Hs.159115 Homo sapiens mRNA; cONA OKFZp586O0724 (f5jS 
438691 AA906288 Hs.212184 ESTs 5.5 
439809 R41396 Hs.101774 hypothetical protein FU23Q45 5.5 

10 415786 AW419196 Hs.257924 hypothetical protein FU 13782 55 
455373 BE247706 Hs.89751 membraiie^panning4-<Jornains 1 subJamilyA 5.5 
401645 NA C16001440 # gi|12330704]gb|AAG52890.1iAF355 
437967 BE277414 Hs.5947 mel transforming oncogene (derived from 5.5 
445885 AI734009 Hs.127699 WAA1603 protein 5.4 

15 439138 AI742605 Hs.193698 ESTs 5.4 
440270 NH.015986Hs.7120 cytokine receptor-like molecule 9 5.4 
437536 X91221 Hs,144465 ESTs 5.4 
438167 R28363 Hs.24286 ESTs 5.4 
452741 BE392914 Hs.30503 -Homo sapiens cDNA FU11344 fis, done PL 5.4 

20 426214 H59846 Hs.128355 ESTs, Moderately similar to ALU7.HUMAN A 5.4 
413554 AA319146 Hs.75426 secretogranin II (dtromogranin C) 5.4 
422867 132137 Hs.1584 cartilage oligomerlc matrix protein (COM 5.4 
434263 N34895 Hs.44646 ESTs 5.4 
446382 AW205168 Hs.150823 ESTs 5.4 

25 422406 AF025441 Hs.116206 Opa-interacting protein 5 5.3 
438321 AA576635 Hs.6153 CG1-48 protein 5.3 
418310 AA814100 Hs.86693 ESTs 5.3 
419625 U91616 Hs.91640 nudear factor of kappa tight polypeptid 5.3 
450701 H39960 Hs.288467 Homo sapiens cDNA RJ 12280 fls. clone MA 5.3 

30 445900 AF070526 Hs.13429 Homo sapiens done 24787 mRNA sequence 52 
449051 AW961400 Hs.333526 HER2 receptor tyrosine Kinase (c-erb-b2, 5.2 
418758 AW959311 Hs.172012 hypothetical proton OKFZp434J037 5.2 
431070 AW408164 Hs.249184 transcription factor 19 (SC1) 5.2 
417079 U65590 Hs.81134 interleukin 1 receptor antagonist 5.2 

35 421928 AF013758 Hs.109643 polyadenylate binding protetn-interactin 5.2 
428804 AK000713 Hs.193736 hypothetical protein FU20706 52 
427427 AF077345 Hs.177936 ESTs 5,2 
403485 C3001813*:giI12737279lrerTXPj012163.1I k 5.2 

422168 AA586894 Hs.1 12408 S100 calcium-binding protein A7 (psorias 5.1 

40 421937 AI878857 Hs,109706 hematological and neurological expressed 5.1 
426752 X69490 Hs.172004 titin 5.1 
453310 X70697 Hs.553 solute carrier family 6 (neurotransmitte 5.1 
423198 M81933 Hs.1634 ce0 division cycle 25A 5.1 
412281 AI810054 Hs.14119 ESTs 5.1 

45 447513 AW955776 Hs.313500 ESTs. Moderately similar to ALU7_>IUMANA 5.1 
453931 AL121278 H&25144 ESTs 5.1 
404347 TajgetExon 5.1 

431808 M30703 Hs.270833 amphlregulin (schwannoma-derived growth 5.1 
429113 D28235 Hs.196384 prostaglandln-endoperoxide synthase 2 (p 5.1 

50 436291 BE568452 Hs.5101 protein regulator of cytokinesis 1 5.1 
450603 R43646 Hs.12422 ESTs 5.1 
434725 AK000796 Hs.4104 hypothetical protein 5.0 
435981 H74319 Hs.188620 ESTs 5.0 
407376 AA993138 Hs.142287 ESTs, Weakly similar to ALUF_HUMAN till 5.0 

55 431689 AA305688 Hs.267695 UDP-GafcbetaGicNAc beta 1,3-galactosyltr 5.0 
405348 NA C7001664:gl|12698061 |dbj}BAB21 849.1 |(AB 5,0 

436196 AK001084 Hs.333498 Homo sapiens cDNA RJ10222 fls. done HE 5.0 
437065 AL036450 Hs.103238 ESTs 5.0 
410196 A1936442 Hs.59838 hypothetical protein FU 10808 5.0 

60 429412 NMJW6235HS.2407 POU domain, dass 2, associating factor 5.0 
446619 AU076643 Hs.313 secreted phosphoprotein 1 (osteopontin, 4,9 
403329 NA TajgetExon 4,9 

442875 BE623003 Hs.23625 Homo sapiens done TCCCTA00142 mRNA sequ45 
442441 AI820662 Hs.129598 ESTs 4.9 

65 430375 AW371048 Hs.93758 H4 hlstone famDy, member H 4.9 



424128 AW966163 gb:EST378236 MAGE resequences, MAGI Homo4.9 

408873 AL046017 Hs.182278 calmodulin 2 (phosphorylase kinase, delt 4.9 
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407910 AA650274 Hs.41296 fibronecrjn leucine rich transmembrane p 4.9 

432606 NM_002104Hs.3Q66 granzymeK (serine protease, granzyme 3; 4.9 

453204 R10799 Hs.191990 ESTs .4.8 
452020 AA722012 Hs.255767 ESTs, Weakly similar to AT2A.HUMAN POTEN4.8 

5 449048 Z45051 Hs.22920 similar to S68401 (cattle) glucose Indue 4.8 

408369 R38438 Hs.182575 solute carrier family 15 (H77? transport 4.8 

431645 AF078849 Hs.266483 dynein Oght chains 4.8 
423575 C18863 Hs.163443 Homo sapiens cONA FU1 1576 lis, clone HE 4.8 

444246 H93281 Hs. 10710 hypoihefical protein FU20417 4.8 

1 0 421524 AA312082 Hs.105445 GDNF family receptor alpha 1 4 A 

452827 AI571835 Hs.55468 ESTs 4.8 

414222 AL135173 Hs.878 sorbitol dehydrogenase 4.8 

456086 AL161999 Hs.77324 eukaryotic translation termination facto 4.8 

419078 M93119 Hs.89584 insulinoma-assoclated 1 4.8 

15 418973 AA233056 Hs.191518 ESTs 4.8 

447033 AI357412 Ha.157601 ESTs 4.8 

451621 AI879148 Hs.26770 fatty add binding protein 7, brain 4.7 

419966 X04430 Hs.93913 In terieukin 6 (interferon, beta 2} 4.7 

424326 NMJ14479HS.14529S drsintegrin protease 4.7 

20 431585 BE242803 Hs.262823 hypothetical protein RJ10326 4.7 

429294 AA095971 Hs.198793 Homo sapiens cDNA: FU22463 lis, done H 47 

416814 AW192307 Hs.80042 doiichyl-P-Glc:Man9GlcNAc2-PP-doIichylgl 4.7 

439897 NM.015310Hs.6763 K1AA0942 protein 4.7 

429687 AI675749 Hs.211608 nudeoporin 153kD 4.7 

25 422880 AF228704 Hs.121524 glutalhlone reductase 47 
405801 NM-000390:Homo sapiens choroideremia (Ra4.6 

432435 BE21B886 Hs.282070 ESTs 4.6 

439544 W26354 Hs.28891 hypothetical protein FU11360; artemls p 4.6 

425354 U62027 Hs.155935 complement component 3a receptor 1 4.6 

30 436027 AI864053 Hs.39972 ESTs, WeaMy similar to I38588 reverse t 4.6 

• 424623 AW963062 Hs.337404 ESTs 4.6 

403366 NA Target Exon 4,6 

402542 Target Exon 4.6 

450193 A1916071 Hs. 15607 Homo sapiens Fanconi anemia complementat4.6 

35 411678 AI907114 Hs.71465 squalene epoxidase 4.6 

456844 AI264155 Hs.152981 CDPKliacylglycerol synthase (phosphatida 4.6 

448072 AI459306 Hs.24908 ESTs 4.5 

408045 AW138959 Hs.245123 ESTs 4.5 

423782 AM72209 Hs.323117 ESTs 4.5 

40 447388 AW630534 Hs.76277 Homo sapiens, ctone MGC:9381, mRNA, comp4.5 

448140 AF146761 Hs.20450 BCM-like membrane protein precursor 4.5 

452561 AI692181 Hs.49169 KIAA1 634 protein 4.5 
425331 AW962128 gb:EST374201 MAGE resequences, MAGG Homo4,5 

428801 AW277121 Hs.254881 ESTs 4.5 

45 428500 AI815395 Hs. 184641 fatty acid desaturase 2 4.5 

426075 AW513691 Hs.270149 ESTs, Weakly similar to 21 09260AB cell 4.4 

437259 AI377755 Hs.120695 ESTs 4.4 

400409 AF153341 Hs.283954 Homo sapiens'winged heOx/forkhead trans 4.4 

412863 AA121673 Hs.59757 zinc finger protein 281 4.4 

50 426989 AI815206 Hs.99395 ESTs 4.4 

401866 Target Exon 4.4 

418819 AA228776 Hs.191721 ESTs 4.4 

406348 Target Exon 4.4 

412138 AW895387 gb:QV4-NN003W00300-157-c10 NN0038 Homo4.4 

55 428550 AW297880 Hs.98661 ESTs 4.4 
411743 AW862214 gb:QV4-CTO361-301299-O74-W5 CT0361 Homo4.4 

429966 BE081342 Hs283037 HSPC039 protein 4.4 

423291 NM.004129HS.126590 guanylate cyclase 1, soluble, beta 2 4.4 

423456 AL110151 Hs.128797 DKFZP586D0824 protein 4.4 

60 452190 H26735 Hs.91668 Homo sapiens done PP1498 unknown mRNA4.4 

424871 NM.004525HS.153595 low density Gpoproteirweiated protein 4 J 

429575 AA706003 Hs.99387 ESTs 4.3 

429922 Z97630 Hs.2261 17 H1 hlstone family, member 0 4.3 

421379 Y15221 Hs.103982 small Inducible cytokine subfamily B (Cy 4.3 

65 400300 X03363 HER2 receptor tyrosine kinase (c-erb-b^ 4.3 

437258 AL041243 Hs.174104 ESTs 4.3 
446595 T57448 Hs. 15467 hypothetical protein FU20725 4.3 
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403011 NA ENSP0QQ00215330^Probableserine/threoni 4.3 

419055 AI365384 Hs.11571 Homo sapiens cDNAFLM 1570 fis, clone HE 4.3 

418661 NMJ)01949Hs,1189 E2F transcripfion factor 3 4.3 

407786 AA687538 Hs.38972 tetraspan 1 4.3 

5 429183 AB014604 Hs.197955 KIAA0704 protein 4.3 

442914 AW188551 Hs.99519 hypofoefcal protein FU 14007 4.3 

441029 AJ091795 Hs.179246 ESTs 4.3 

452194 AI694413 Hs.332649 olfactory receptor, family 2, subfamily 4.3 

414821 M63835 Hs.77424 Fc fragment of IgG, high affinity la, re 4.2 

10 410102 AW248508 Hs.279727 Homo sapiens cDNA FU14035 fis, clone HE 4.2 

452110 T47667 Hs.28005 Homo sapiens cONAFUl 1309 fis, done PI 4.2 

442007 AA301116 Hs.142838 nucleolar phosphoprotein Nopp34 4.2 

417318 AW953937 Hs.12891 ESTs 4.2 

431818 AW510444 Hs.191705 ESTs, Weakly similar to T47184 hypothetl 42 

15 443646 A1085198 Hs.164226 ESTs 4.2 

419169 AW851980 Hs.262346 ESTs, Weakly similar to S72482 hypotheii 4.2 

446839 BE091926 Hs.16244 mitotic spindle coiled-coll related prot 4.2 

423242 AL039402 Hs.125783 DEME-6 protein 42 

432116 AA902953 Hs.308538 ESTs 4.2 

20 409038 T97490 Hs.50002 small inducible cytokine subfamily A {Cy 4.2 

445625 BE246743 Hs.288529 hypothetical protein FU22635 42 

425139 AW630488 Hs.325820 protease, serine. 23 4.2 

447397 BE247676 Hs.18442 E-1 enzyme 4.2 

410166 AK001376 Hs.59346 hypothetical protein FU10514 4.1 

25 437295 AW779318 Hs.88417 ESTs 4.1 

430486 BE062109 Hs.241551 chloride channel, caldum activated, fam 4.1 

441790 AW294909 Hs.132208 ESTs 4.1 

410129 BE244074 Hs.58831 regulator of Fas-induced apoptosls 4.1 

427521 AW973352 Hs.290585 ESTs 4.1 

30 425247 NMJ05940HS.155324 matrix metalloprotelnase 11 (MMP11;stro 4.1 

412886 AF041163 Hs.74647 Human T-cel receptor active alpha-chain 4.1 

441 153 BE562826 gb:601 336534F1 NlHJAGCjW Homo sapiens c4.1 

444301 AK000136 Hs.10760 asporin (LRR class 1) 4.1 

426711 AA383471 Hs.180669 conserved gene amplified in osteosarcoma 4.1 

35 405850 NA TaraetExon 4.1 

440283 A1732892 Hs.190489 ESTs 4.0 

432441 AW292425 Hs.163484 ESTs 4.0 

400284 NA estrogen receptor 1 4.0 

417341 N91453 Hs.102987 ESTs 4.0 

40 429732 U20158 Hs.2488 lymphocyte cytosofic protein 2 (SH2 doma 4.0 

411393 AW797437 Hs.69771 B-factor, properdin 4.0 

425704 U79293 Hs.159264 Human clone 23948 mRNA sequence 4.0 

419594 AA013051 Hs.91417 topoisomerase (DNA) II binding protein 4.0 

419092 J05581 Hs.89603 mucin 1, transmembrane 4.0 

45 443147 AI034351 Hs.19030 ESTs 4.0 

408633 AW963372 Hs.46677 PRO2000 protein 4.0 

433404 T32982 Hs.102720 ESTs 4.0 

421506 BE302796 Hs.105097 thymidine kinase 1, soluble 4.0 

417900 BE250127 Hs.82906 COC20 (cell division cycle 20, S. cerevi 3.9 

50 414602 AW630088 Hs.76550 Homo sapiens mRNA; cONA OKFZp564B1264 (f3.9 

413762 AW411479 Hs.848 FK506-binding protein 4 (59kD) 3.9 

404580 NM_014112*:Homo sapiens trichorhlnophala 3.9 

452046 AB018345 Hs.27657 K1AA0802 protein 3.9 

459587 AA031956 gb:zk15e04.s1 Soaresj)regnanLuterus„NbH3.9 

55 416658 U03272 Hs.79432 fibrifiin 2 (congenital contractural ara 3.9 

426647 AA243464 Hs.294101 pre-B-cell leukemia transcription factor 3.9 

429353 AL117406 Hs-200102 ATP-btading cassette transporter MRP8 3.9 

419038 AW134924 Hs.190325 ESTs 3.9 

418918 X07871 Hs.89476 CD2 antigen (p50), sheep red blood cell 3.9 

60 421977 W94197 Hs.1 10165 ribosomal protein L26 homotog 3.9 

442567 AI201183 Hs.130251 ESTs 3.9 

421168 AF182277 Hs.330780 cytochrome P450, subfamily (IB (phenobar 3.9 

431701 AW935490 Hs. 14658 Human chromosome 5q 13.1 done 5G8mRNA3.9 

418526 8E019020 Hs.85838 solute earner family 16 (monocarboxylic 3.9 

65 414998 NM_002543Hs.77729 oxidised low density lipoprotein (lectin 3.9 

422790 AA809875 Hs.25933 ESTs 3.9 

419741 NM.007019HS.93002 ubkptin carrier protein E2-C 3.9 
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430017 AA263172 Hs.35 protein tyrosine phosphatase, noiwecept 3.9 

458814 AI498957 Hs. 170861 ESTs, WeaWy similar to Z195HUMAN ZINC 3.8 

428514 AW236861 Hs.193139 ESTs 3.8 

434521 NM.002267Hs.3886 karyopherln alpha 3 (importln alpha 4) 3.8 

5 409425 U40462 Hs.54452 zinc finger protein, subfamily 1 A, 1 (Ik 3.8 

439560 BE565647 Hs.74899 hypothetical protein FU 12820 3.8 

424028 AF055084 Hs.153692 Homo sapiens cONA FU14354 fis, done Y7 3.8 

400021 AFFX control - HUM1SGF3A/M97935_MA 3.8 

453403 8E466639 Hs.61779 Homo sapiens cDNARJ 13591 fis, done PL 3.8 

10 445941 AI267371 Hs.172636 ESTs 3.8 

434378 AA631739 Hs.335440 EST 3.8 

429220 AW207206 Hs.136319 ESTs 3.8 

439176 A1446444 Hs.190394 ESTs,WeaWy similar to B28096Iine-1 pr 3.8 

401045 C11001883 4 :gi(6753278|ref|NP_033938.1|c 3.8 

15 430178 AW449612 Hs.152475 ESTs 3.8 

423397 NH.001838Hs.1652 chemokine (C-C motif) receptor 7 3.8 

447630 AI660149 Hs.44865 lymphoid enhancer-binding factor 1 3.8 

436391 AJ227B92 Hs.146274 ESTs 3.8 

413011 AW068115 Hs.821 blglycan 3.8 

20 422121 AI767949 Hs.179833 ESTs 3.8 

452268 NM_003512Hs.28777 H2A histone family, member L 3.8 

427811 M81057 Hs.1 80884 carboxypepfldase B1 (tissue) 3.8 

415679 AA165232 Hs.222069 ESTs 3.8 

437330 AL353944 Hs.50115 Homo sapiens mRNA;cDNADKF2p761J11 12 (f 3.8 

25 427122 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c-erb-b2, 3.7 

400286 NA C1600092^iI7499103|plrtIT20903hypoflie 3.7 

420281 AI623693 Hs.191533 ESTs 3.7 . 

419926 AW900992 Hs.93796 DKFZP586D2223 protein 3.7 

417541 AI992191 Hs.1B0040 hypothetical protein FU22439 3.7 

30 426172 AA371307 Hs.125056 ESTs 3.7 

429638 AI916662 Hs.21 1577 kinectin 1 (Idnesln receptor) 3.7 

457001 J03258 Hs.2062 vitamin D (1,25- dihydroxyvitamin D3) re 3.7 



424109 AW406878 gb^JI-HF*BL0-adg^-06-(UJI.r1 NIHJA6CL373.7 



417022 NM_014737Hs.80905 Rasa5Sodatfon(^GOS/AF^)d()rnainfam 3.7 

35 .436222 AI208737 Hs.122810 Homo sapiens cONA RJ 11489 fis, clone HE 3.7 

430448 AI633553 Hs.13303 Homo sapiens cONA: FU21784 fis, clone H 3.7 

432729 AK000292 Hs.278732 hypothetical protein FU20285 3.7 

413916 N49813 Hs.75615 apolipoprotein C-ll 3.7 

421662 NMJ>14141Hs.106552 cefl recognition molecule Caspr2 3.7 

40 441633 AW958544 Hs. 112242 normal mucosa of esophagus specific 1 3.7 

408761 AA057264 Hs.238936 ESTs,Weaidy simiiartD(defrmenotava 3.7 

406153 Target Exon 3.7 

445563 AW873606 Hs.149006 ESTs 3.7 

453464 AI884911 Hs.32989 receptor (calcitonin) acfivlty modifying 3.7 

45 448918 AB011152 Hs.22572 KIAA0580 protein 3.7 

413936 AP1 13676 Hs.297681 serine (or cysteine) prate toe hhiblto 3.6 

448069 U76248 Hs.20191 seven in absentia (DrosophOa) homolog 2 3.6 

453313 BE005771 Hs.153746 hypothetical protein FU22490 3.6 

425234 AW152225 Hs.165909 ESTs, Weakly similar to 138022 hypotheti 3.6 

50 419941 X98654 Hs.93837 phosphatklyCnositDi transfer protein, m 3.6 

402397 AF188625 Hs.169507 phosphofipase A2, group (ID 3.6 

430378 229572 Hs.2556 tumor necrosis factor receptor superfaml 3.6 

448106 AI800470 Hs.171941 ESTs 3.6 

426431 NM.000458HS.169853 transcription factor 2, hepatic; LF-83; 3.6 

55 431843 AA516420 Hs.183526 ESTs, Weakly similar to 138022 hypotheti 3.6 

426878 BE069341 gb^BT0381-27010(W)73^BT0381 Homo3.6 

434061 AW024973 Hs^83675 NPD009 protein 3.6 

445292 AV653264 Hs.13982 Homo sapiens cONA RJ14666 fts, done NT 3.6 

452101 T60298 Hs. 10844 Homo sapiens cONA FU14476 fis, clone MA 3.6 

60 427581 NfO14788Hs.179703 WAA0129 gene product 3.6 

409047 AW961434 Hs.31539 ESTs 3.6 

416820 NM.O0O402HS.802O6 gtucose-6-phosphate dehydrogenase 3.6 

410386 W26187 Hs.3327 Homo sapiens cONA: FU22219 fis, clone H 3.6 

440516 S42303 Hs.161 cadherln 2, type I.Nodherin {neuronal 3.6 

65 434360 AW015415 Hs. 127760 ESTs 3.6 

428970 BE276891 Hs.194691 retlnotc add induced 3 3.6 

415079 R43179 Hs.22895 hypothetical protein FU23548 3.6 
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409619 AKD01015 Hs.55220 BCL2-assoclated athanogene 2 3.6 

430044 AA464510 Hs.152812 ESTs 3.6 

430829 AW451999 Hs.194024 ESTs 3.6 

434224 AA380731 Hs.84 Inlerleukin 2 receptor, gamma (severe co 3.6 

5 439247 AF088020 Hs.46767 EST 3.6 

431542 H63010 Hs.5740 ESTs 3.5 

430713 AA351647 Hs.2642 eukaryofic translation elongation factor 3.5 

434988 A1418055 Hs.161160 ESTs 3.5 

437748 AF234882 Hs.5814 suppression of tomorigeniciry 7 3.5 

10 418322 AA284166 Hs.84113 cydln-dependent kinase Inhibitor 3 (COK 3.5 

439569 AW602166 Hs.222399 CEGP1 protein 3.5 
459583 A1907673 gb:IL-BT152-080399-004 BT152 Homo saplen3.5 

403212 NM J)19595:Homo sapiens Intersecfin 2 (IT 3.5 

409099 AKQ00725 Hs.50579 hypothetical protein RJ20718 3.5 

15 453968 AA847843 Hs.62711 Homo sapiens, clone IMAGE3351295, mRNA3.5 

436338 W92147 Hs.118394 ESTs 3.5 

422890 Z43784 Hs.75893 ankyrin3,nodeofRanv!er(ankyrinG) 3.5 

442295 A1827248 Hs.224398 Homo sapiens cDNA RJ1 1469 fis, clone HE 3.5 

417975 AA641836 Hs30Q85 hypothetical protein aJ23186 3.5 

20 433730 AK002135 Hs.3542 hypothefical protein FLJ1 1273 3.5 

439926 AW014875 Hs.137007 ESTs 3.5 

445873 AA250970 Hs.251946 pory(A)-blnding protein, cytoplasmic 1-1 3.4 

410153 BE311926 Hs.15830 hypothetical protein FU12691 3.4 
422128 AW881145 gb:QV0<)T0033*O10400-182-€07OT0033Homo 3.4 

25 414921 BE390551 Hs.77628 steroidogenic acute regulatory protein r 3.4 

441134 W29092 Hs.7678 cellular retlnotc acid-binding protein 1 3.4 

444564 AI167877 Hs.143716 ESTs 3.4 

402470 NA Target Exon 3.4 

418120 AA213437 Hs.192249 ESTs 3.4 

30 422414 AW875237 Hs.13701 ESTs 3.4 

433345 A1681545 Hs.152982 hypothetical protein FU131 17 3.4 

409213 U61412 Hs£1133 PTK6 protein tyrosine kinase 6 3.4 

422611 AA158177 Hs.1 18722 fucosyltrarjsferase8(alpha(1,6}rucosy 3.4 

423554 M90516 Hs.1674 glutamlne-fructose-6^sprtttetransamln 3.4 

35 442619 AA447492 Hs.20183 ESTs, Weakly similar to AF1647931 prote 3.4 
402359 NA Cl9001991^qi2656111(gb|AAK00751.1|AF23.4 

439398 AA284267 Hs.221504 ESTs 3.4 

415208 F01020 Hs.172004 tftin 3.4 

452853 AA812633 Hs.10845 ESTs 3.4 

40 429345 R11141 Hs.199695 hypothetical protein 3.4 

449027 AJ271216 Hs.22880 dipeptidyipeptidase III 3.4 

412115 AK001763 Hs.73239 hypothetical protein FU10901 3.4 

432180 Y18418 Hs.272822 RuvB (E cofi homologj-like 1 3.4 

428977 AKD01404 Hs.194698 cyclinB2 3.4 

45 431611 U58766 Hs.264428 tissue specific transplantation antigen 3.4 
418286 AA622528 Hs.319825 Homo sapiens, clone IMAGE:3616574, mRNA.3.4 

436895 AF037335 Hs.5338 carbonic anhydrase XII (tumor antigen H 3,4 

443378 AW392550 Hs.9280 proteasome (prosome, macropain) subunii, 3.4 

428450 NM014791HS.184339 KIAA0175 gene product 33 

50 449571 AW016812 Hs.200266 ESTs 3.3 

412777 A1335773 Hs.270123 ESTs 3.3 

420542 NNL000505HS.1321 coagulation factor XII (Hageman factor) 3.3 

412754 AW160375 Hs.74565 amyloid beta (A4) precursor-like protein 3.3 

418327 U70370 Hs.84136 paired-like homeodomaln transcription fa 3.3 

55 449065 A1627393 Hs.258998 ESTs, Weakly similar to high mobility gr 3.3 

425999 AW513051 Hs.332981 ESTs, Weakly similar to 138022 hypotheti 3.3 

430280 AA361258 Hs.237868 Inlerleukin 7 receptor 3.3 

407777 AA161071 Hs.71465 squaleneepoxidase 3.3 

426516 BE262660 Hs.170197 gtotomboxaloacetic transaminase 2, mit 3.3 

60 • 414361 A1086138 Hs.204044 ESTs 33 

427080 AW068287 Hs.1 73466 ras-related C3 botullnum toxin substrate 3.3 

426429 X73114 Hs.169849 myosln-blnding protein C, stow-type 3.3 

446163 AA026880 Hs.25252 prolactin receptor 33 

428566 U41763 Hs.184916 clathrin. heavy potypeptkMke 1 3.3 

65 418641 BE243136 Hs.86947 a disintegrin and metaBoprotelnase doma 33 

436293 AI601188 Hs.120910 ESTs 33 
411257 AA628967 Hs.1 15274 ESTs, Highly similar to IHHJIUMAN INDIAN 33 
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430253 
430066 
436469 
437786 
444079 
457183 
431215 
424563 
450828 
408652 
445142 
426761 
439237 
422616 
443247 
406663 
434137 
408877 
439101 
408221 
447519 
404755 
451871 
420319 
430580 
400202 
400222 
425988 
458098 
430589 
431563 
442353 
422309 
419703 
420380 
410853 
454417 
432745 
422032 
415339 
426384 
448030 
418739 
442053 
434747 
427297 
412228 
452304 
453953 
407758 
451149 
430015 
433313 
418334 
450223 
454365 
451128 
417793 
428027 
441197 
424634 
419986 
416714 
449465 
422166 
409079 
423551 



AK001514 Hs236844 
A1929659 Hs.237825 
AK001455 Hs.5198 
BE142681 Hs.155573 
H09048 Hs.23606 
H91882 Hs.118569 
AA496078 Hs.121554 
AA446932 Hs.151428 
AW270655 Hs.1 93804 
R43409 Hs.6829 
AW978484 Hs.93842 
AI015709 Hs.172089 
AW408158 Hs.318893 
BE300330 Hs.1 18725 
BE614387 Hs.333893 
U24683 Hs.302063 
AA907734 Hs.124895 
AA479033 Hs.130315 
C01765 Hs.38750 
AA912183 Hs.47447 
U46258 Hs.339665 
NA 

AI821005 Hs.1 18599 
AW406289 Hs.96593 
AA806105 Hs.300697 
NA 
NA 

BE045897 Hs.274454 
BE550224 Hs.74170 
AJ002744 Hs.246315 
A1027643 Hs.120912 
BE379594 Hs.49136 
U79745 Hs.1 14924 
A1793257 Hs.128151 
AA640891 Hs.102406 



Hs.30469 
Hs.1 10826 



H04588 
AI244459 
AI821926 

AA476966 Hs.1 10857 
NMJ15156HsJ8398 
A1472078 Hs.303662 
N30714 Hs.325960 
AA310964 Hs.88012 
R35343 Hs.24968 
AA837085 Hs.220585 
AW292593 Hs.334907 
AW503785 Hs.73792 
AA025386 Hs.61311 
AW408337 Hs.36972 
D50915 Hs.38365 
AL047586 Hs.10283 
AW768399 Hs.11215? 
W20128 Hs296039 
AA319233 Hs.5521 
AA418204 Hs.241493 
AW966728 Hs.54642 
AL118668 

AW405434 Hs.82575 
U22029 Hs.334345 
BE244638 Hs.166 
NWL003613HS.151407 
AI345455 Hs.78915 
AF283770 Hs.79630 
NNL004380HS.23598 
W72424 Hs.1 12405 
W87707 Hs.82065 
AA327598 Hs.233785 



hypothetical protein FU10652 3.3 
signal recognition particle 72M) 3.3 
Down syndrome critical region gene 2 3.3 
polymerase (DNA directed), eta 3.3 
ESTs 3.3 
Dvl-binding protein IDAX (Inhibition of 3.3 
Human DNA sequence from done RP1 1 -218C13.3 
ret finger proteh 2 3.3 
ESTs 33 
Homo sapiens mRNA for K1M1644 protein, 3.3 
Homo sapiens cDNA: FU22554 fc, clone H 3.3 
Homo sapiens mRNA; cDNA DKFZp586l2022 (13.3 
ESTs. Weakty similar to A47582 B-cell gr 3.3 
selenophosphate synthetase 2 3.3 
c-Myc target JPOI 3.3 
Immunoglobulin heavy constant mu 3.3 
ESTs 3.3 
ESTs, Weakly similar to A47582 B-cell gr 3.3 
hypothetical protein FU11526 3.3 
ESTs 3.3 
ESTs 3.3 
Target Exon 3.3 
ESTs 32 
hypothetical protein 3.2 
immunoglobulin heavy constant gamma 3 (G 3.2 
NM_002795*:HGmo sapiens proteasome (pros32 
NM_002082*:Homo sapiens G protein-couple 3.2 
ESTs, Weakfy similar to 138022 hypotheti 3.2 
metaHothioneln1E(func8onal) 32 
UDP^-acetyl-a]phiO-galacto$amine:pc4yp 3.2 
ESTs 3.2 
ESTs. Moderately similar to ALU7JHUMAN A 3.2 
solute carrier family 16 (morwcarboxyOc 32 
ESTs 3.2 
ESTs 32 
ESTs 32 
trinucleotide repeat containing 9 3.2 
gbmt78f05J(5 NCLCGAP_Pr3 Homo sapiens 3.2 
polymerase (RNA) III (DNA directed) poly 3.2 
KIAA0071 protein 3.2 
ESTs 32 
membrane-spanning domains, subfamily A 32 
SHP2 interacting transmembrane adaptor 3.2 
Human DNA sequence from clone RP1-233G1632 
ESTs 32 
Homo sapiens, clone MGC:17333, mRNA. com3.2 
complement component (3d/Ep$tein Ban vi 3.2 
ESTs. Weakly similar to S10590 cysteine 32 
CD7 antigen (p41) 32 
KIAA0125 gene product 32 
RNA binding motif protein 8B 32 
ESTs 32 
ESTs 3.2 
ESTs 32 
natural killer-tumor recognition sequenc 32 
methionine adenosyltransferase It. beta 32 



gb:DKFZp761lQ310_j1 761 (synonym: hamy2)32 

small nuclear ribonucteoproteln polypept 32 

cytochrome P450, subfamily IIA (phenobar 32 

sterol regulatory element binding transc 3.2 

cartilage Intermediate layer protein, nu 32 

GA-binding protein transcription factor, 32 

C079A antigen {immunoglobunn-associated 32 

CREB binding protein {Rubinstein-Taybi s 32 

S1Q0 caldum-blndlng protein A9 (calgran 32 

interleukin 6 signal transducer (gp1 30, 32 

ESTs 32 
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453553 M036849 Hs.61829 Homo sapiens cDNA FU12763 fis, clone NT 3.2 

442580 AI733682 Hs.13Q239 ESTs 3.2 
458079 AI796870 Hs.54277 DNA segment on chromosome X (unique) 99232 

425700 AF076292 Hs.159251 forkhead box H1 3.2 

5 417124 BE122762 Hs.25338 ESTs 3.2 

407104 S57296 Hs.323910 v-erb-b2 avian erythroblastic leukemia v 3.2 

442215 M703172 Hs.129005 ESTs, Weakly similar to 2109260A B cell 3.1 

430271 T06199 Hs.237506 DnaJ (Hsp40) homolog, subfamily B, membe 3.1 

425317 AW205118 Hs.210546 Interieukln 21 receptor 3.1 

10 426095 AI278023 Hs.89986 ESTs 3,1 

442313 BE388898 Hs.8215 hypothetical protein FU1 1307 3.1 

424709 AL137589 Hs.152149 hypotheflcal protein DKFZp434K0410 3.1 

429671 8E379335 Hs.211594 proteasome (prosome, macropain) 26S subu 3.1 

432715 AA247152 Hs.200483 ESTs, WeaWy similar to KIAA1074 protein 3.1 

15 431574 AW572659 Hs.261373 hypothetical protein 0)434014.3 3.1 

436876 AI124756 Hs.5337 isocitrate dehydrogenase 2 (NADP), mitoc 3.1 

405017 NA Target Exon 3.1 

433805 AA706910 Hs.1 12742 ESTs 3.1 

437352 AL353957 Hs.284181 hypothetical protein DKFZp434P0531 3.1 

20 430105 X70297 Hs.2540 cholinergic receptor, nicotinic alpha p 3.1 

422083 NM.001141HS.111256 arachidonate 154ipoxygenase, second typ 3.1 

413507 BE145360 Hs.190064 ESTs, Weakly similar to 138022 hypolheti 3.1 

415989 A1267700 Hs.317584 ESTs 3.1 

422907 A1879263 Hs.6986 Human glucose transporter pseudogene 3.1 

25 425548 AA890023 Hs.1906 prolactin receptor 3.1 

422599 BE387202 Hs.116638 non-metastatic celts 1, protein (NM23A) 3.1 

439963 AW247529 Hs.6793 platelet-activating factor acetylhydrola 3.1 

453883 AI638516 Hs.22630 cofactor required for Sp1 transcriptiona 3.1 

458021 AI885190 Hs.156089 ESTs, Weakly similar to repressor protBi 3.1 

30 418478 U38945 Hs.1174 cycilrwiependent kinase inhibitor 2A (me 3.1 

400B14 NA Target Exon 3.1 

402327 Target Exon 3.1 

416935 AA190712 gb:zp87f09.r1 Stratagene HeLa ceO s3 93 3.1 

439838 AL355722 Hs.106875 Homo sapiens EST from clone 35214, full 3.1 

35 437036 AI571514 Hs.133022 ESTs 3.1 

449523 NM_0Q0579Hs.54443 chernokine (C-C motif) receptor 5 3.1 

406642 AJ245210 gbiHomo sapiens mRNA for Immunoglobulin 3.1 

406624 AF052762 gb:Homo sapiens clone csneg8-1 immunoglo 3.1 

421924 BE514514 Hs.109606 coronki, actin-bindlng protein, 1A 3.1 

40 414523 AU076633 Hs.76353 serine (or cysteine) proteinase inhibito 3.1 

416379 N38857 Hs.203933 ESTs 3.1 

422823 089974 Hs.121102 vanin2 3.1 

433904 A1399956 Hs.208956 ESTs 3.1 

421904 BE143533 Hs.109309 hypothetical protein aJ20035 3.1 

45 428834 AW899713 Hs.339315 ESTs 3.1 

436043 AW963838 Hs.168830 Homo sapiens cDNA FLJ12136 fis, done MA 3.1 

452823 AB012124 Hs.30696 transcription factor-Ske 5 (basic helix 3.1 

405381 NA Target Exon 3.1 

428746 AW503820 Hs.192861 Spl-B transcription factor (Spi-1/PU. 1 r 3.1 

50 435147 AL133731 Hs.4774 Homo sapiens mRNA; cDNADKFZp761C1712(f3.1 

425782 U66468 Hs.159525 cell growth regulatory with EF-handdoma 3.1 

423306 W88562 Hs.108198 ESTs 3.1 

419123 AA234276 Hs.88253 ESTs 3.1 

438581 AW977766 Hs.292133 ESTs, Moderately similar to 178885 serin 3.1 

5 5 417105 X60992 Hs.81226 CD6 antigen 3.0 

428361 NIVL015905HS.183858 transcriptionai intermediary factor 1 3.0 

417880 BE241595 Hs.82848 selectln L (lymphocyte adhesion molecule 3.0 

402606 NM_024626:Homo sapiens hypotheticat prot 3.0 

401451 NM_004496*:Homo sapiens hepatocyte nude 3.0 

60 421878 AA299652 Hs.111496 Homo sapiens cDNA FU1 1643 fis, done HE 3.0 

409518 BE384836 Hs.3454 KIAA1821 protein 3.0 

416933 BE561850 Hs.80506 small nudear ribonudeoprotein polypept 3.0 

414324 Y14768 Hs.890 lymphotoxln beta (TNF superfamfly, membe 3.0 

425081 X74794 Hs.154443 minichromosome maintenance deficient (S. 3.0 

65 401519 C15000476*:gil12737279lrefiXPJ)12163.1| 3.0 

411704 AM99220 Hs.71573 hypothetical protein RJ10074 3.0 

428819 AL135623 Hs.193914 WAA0575 gene product 3.0 
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428423 AU076517 Hs.184276 solute carrier family 9 (sodium/hydrogen 3.0 
413835 AI272727 Hs.249163 fatty acid hydroxylase 3.0 
412600 128824 Hs.74101 spleen tyrosine kinase 3,0 
410491 AA465131 Hs.64001 Homo sapiens clone 25218 mRNA sequence 3,0 
433658 103678 Hs.156110 irnmunoglobuDn kappa constant 3.0 
427666 AI791495 Hs.180142 calmodulWike skm protein 3.0 
452514 A1904898 gfcRO0TO68-1 30399485 BT068 Homo sapien3.0 

429500 X78565 Hs2m 14 hexabrachlon (tenascin C, cytotactin) 3.0 
432485 N90856 Hs.276770 CDW52 antigen (CAMPATH-1 antigen) 3.0 
437400 AB011542 Hs.5599 EGF-like-domain, multiple 5 3.0 
452234 AW084176 Hs.223296 ESTs, Weakfy similar to 138022 hypotnetl 3.0 
413269 BE167526 gb:CM44fT050^080300-107-g07 HT0509 Homo 3.0 

453216 AL137566 Hs.32405 Homo sapiens mRNA; cDNA DKFZp586G0321 (f 3.0 
400929 ENSP00000252232*:Sterol regulatory eleme 3,0 

445145 AI961702 Hs.147434 ESTs 3,0 
432615 AA557191 Hs.55028 ESTs, WeaWy similar to I54374 gene NF2 3.0 
423279 AW959861 Hs.290943 ESTs 3.0 
429392 AL109712 Hs396506 Homo sapiens mRNA full length Insert cDN 3.0 
408548 AA055449 Hs.63187 ESTs, Weakly similar to ALUC_HUMAN llll 3.0 
451346 Nft/LQ06338Hs.26312 glioma amplified on duomosome 1 protein 3.0 
413109 AW389845 Hs.110855 ESTs 3.0 
401714 NA ENSP00000241802^DNAFU11007FIS,CION 3.0 

421462 AF016495 Hs.104624 aquaporin9 3.0 
421750 AK000768 Hs.107872 hypothetical protein FU20761 3.0 
453293 AA382267 Hs.10653 ESTs 3.0 
457085 AA412446 Hs.98138 ESTs 3.0 
438930 AW843633 Hs.306163 hypothetical protein AL1 101 15 3.0 
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TABLE 22A 



10 



15 



20 



25 



30 



35 



Table 22A shows the accession numbers for those pkeys lacking unigenelD's for Table 22. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Ptey: Unique Eos probeset Identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 

407980 103087 1 AA046309 AI263500 AA046397 

410785 1221055 1 AW803341 AW603265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 

411743 1256098J AW862214AW859811 AW862215 

412138 1279172J AW895387 AW895547 AW895564 AW895323 AW895405 AW895539 AW895538 

413269 1356961 1 BE167526 BE167651 BE076401 R24654 

416935 163179 T AA190712AA190665 AA252564 

422128 211994 1 AW881145AA490718 M85637AA304575 T06067AA331991 

423945 233566 1 AA410943 AW94S953 AA334202 AA332882 

424109 235506 1 AW406878AW966560AW966151 AW966496 AA336174 AA335376 AA335537 

424128 235728 1 AW966163 AA335983 AA336011 AA335668 AA335973 

425331 250199 J AW962128AA355353AA427363 

426878 273265.1 BE069341 AW748403 AL044891 A1908240 AA393080 

432745 353673.1 A!821926AA658826AA564492AA635129Ar791191 

441153 51084_2 BE562826 BE378727 

448212 755099J AM75858 AW969013 

451128 859865J AL1 18668 D78823 AI762176 

452514 920172J A1904898 AI904849 AI904899 

456207 165078.-1 AA193450 
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TABLE 22B 



Table 22B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 22. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ret Sequence source. The 7 digit numbers In this column are Genbank Identifier (GJ) numbers. "Dunham I. et aT refers to the publication 

entitled The DNA sequence of human chtanosome 22." Dunham I. et a!.. Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which axons were predicted. 

NLposffion: indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


400814 


8569925 


. Minus 


72840-72924,74761-74849 


400929 


7651921 


Minus 


122033-122241,123483-124028 


401045 


8117619 


Plus 


90044-90184,91111-91345 


401451 


6634068 


Minus 


119926-121272 


401519 


6649315 


Pius 


157315-157950 


401645 


7657839 


Minus 


3498645133 


401714 


6715702 


Plus 


96484-96681 


401866 


8018106 


Plus 


73126-73623 


402327 


7656695 


Minus 


108675-108770,109801-109910 


402359 


9211204 


Minus 


4040341961 


402408 


9796239 


Minus 


110326-110491 


402470 


9797107 


Plus 


195129-195776 


402542 


9801558 


Minus 


67076-67594 • 


402578 


9884928 


Plus 


66350-66496 


402606 


9909429 


Minus 


81747-82094 


403011 


6693597 


Minus 


3468-3623 


403212 


7630897 


Minus 


156037-156210 


403329 


8516120 


Plus 


96450-96598 


403366 


8783692 


Minus 


49323-49652 


403485 


9966528 


Plus 


2886X3001,3198.3532,36554117 


404347 


9838195 


Plus 


74493-74829 


404580 


6539738 


Minus 


240588-241589 


404755 


7706327 


Minus 


53729-53846 


405017 


6532084 


Plus 


35551-35690 


405348 


2914717 


Minus 


4331043462 


405381 


6006920 


Minus 


7636-8054 


405801 


2924321 


Plus 


63469-63694 


405850 


6164995 


Plus 


13871-14110 


406153 


9929734 


Minus 


12902-13069 


406348 


9255985 


Minus 


71754-71944 
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TABLE 23: 320 GENES DOWN-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT BREAST 

5 Table 23 shows 320 genes down-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 22, except that the numerator was set to the median value 
for 12 non-malignant breast specimens, the denominator was set to the median value amongst 
the 73 breast cancers, the 90 th percentile value amongst thel2 non-malignant breast 
specimens was greater than or equal 80 units, and the ratio was greater than or equal to 4.0 
10 (Le. 4-fold down-regulated in tumor vs. normal breast). 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

15 UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of 50 th percentile normal body tissue to 75* percentile tumor 



20 Ptey BxAccn UnigenelD UnigeneTitle Ratio 

428722 U76456 Hs.190787 tissue Inhibitor of metailoprotelnase 4 22.4 

428848 NM.000230 Hs.194236 leptin (murine obesity homolog) 17.4 

445263 H57646 Hs.42586 KIAA1 560 protein 15.4 

25 418935 T28499 Hs.89485 carbonic anhydrase IV 15.0 

407228 M25079 Hs.155376 hemoglobin, beta 14.6 

417511 AL049176 Hs.82223 chordln-like 14.6 

439285 AL133916 Hs,172572 hypothetical protein FU20093 14.3 

412442 AI983730 Hs.26530 serum deprivation response (phosphatidyl 13.6 

30 410544 A1446543 Hs.95511 ESTs 12.6 

412047 AA934589 Hs.49696 ESTs 122 

422667 H25642 Hs.133471 ESTs 12.0 

406664 L34041 Hs.9739 glycerophosphate dehydrogenase 1 (so 12.0 

423201 NMJ00163 Hs.125180 growth hormone receptor 11.7 

35 422163 AF027208 Hs.1 12360 prominin (mouselike 1 10.8 

428769 AW207175 Hs.106771 ESTs 10.6 

407049 X72632 NM_021724 fc .Homo sapiens nuclear receptor 10.1 

412295 AW088826 Hs.117176 poly(A)-bInding protBin, nudear 1 9.8 

425126 N32759 Hs.172944 chorionic gonadotropin, beta polypeptide 9.8 

40 406791 A1220684 Hs.272572 hemoglobin, alpha 2 9.5 

447471 AF039843 Hs.18676 sprouty (Drosophila) homolog 2 9.5 

451533 NMJ304657 Hs.26530 serum deprivation response (phosphatidyl 9.4 

419407 AW410377 Hs.41502 hypothetical protein FU21276 9.0 

411939 AI365585 Hs.146246 ESTs 9.0 

45 410532 T53088 Hs.155376 hemoglobin, beta 8.9 

425707 AF1 15402 Hs.11713 E74-Iike factor 5 (ets domain transcript 8.8 

416585 X54162 Hs.79386 teiomodin 1 (smooth muscle) 8.7 

443060 D78874 Hs.8944 procollagen C-endopeptidase enhancer 2 8.6 

435265 AA779958 Hs.185932 ESTs 8.5 

50 422511 AU076442 Hs.117938 collagen, type XVII, alpha 1 8.4 

433138 AB029496 Hs.59729 semaphorin sem2 8.3 

402195 NMJ004497*:Homo sapiens hepatocyte nude 8.1 

429350 AI754634 Hs.131987 ESTs 8.1 

445107 A1208121 Hs.147313 ESTs, Weakly similar to 138022 hypothett 8.1 

55 406643 N77976 Hs.272572 hemoglobin, alpha 2 8.0 

410199 AW377424 Hs.205126 Homo sapiens cDNA: FU22667 fis, clone H 8.0 

417225 AA815048 Hs.24078 hypothetical protein FU12649 7.8 

437569 AA760849 Hs.294052 ESTs 7.5 

436062 AK000027 Hs.98633 ESTs 7.5 

60 425078 NM_0Q2599 Hs.154437 phosphodiesterase 2A, cGMP-stimulated 7,5 

430327 AW973636 Hs.55931 ESTs 7.4 
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447577 AI393693 Hs,183297 DKFZP566F2124 protein 7.4 

446039 AI150491 Hs.90756 ESTs 7.2 

422060 R20893 Hs.325823 ESTs, Moderately similar to ALUS^HUMAN A 11 

424455 AA452006 Hs.333199 ESTs 7.1 

5 424343 AW956360 Hs.4748 adenylate cyclase activating polypepflde 7.1 

442792 AI352340 Hs.131194 ESTs 7,0 

408714 A1219304 Hs.283108 hemoglobin, gamma G 6.9 

407571 AI446183 Hs.9572 ESTs, Highly similar to CYA5.HUMAN ADENY 6.8 

429580 AA346839 Hs.209100 DKFZP434C171 protein 6.7 

10 453500 A1478427 Hs.43125 esophageal cancer related gene 4 protein 6.7 

422233 AB002058 Hs. 113275 purinergic receptor P2X-I3(e 1 , orphan r 6.7 

420205 AA256395 Hs.88156 ESTs 6.6 

404368 NA ENSPO00OO241O75*:TRRAP PROTEIN. 6.6 

447261 NM_006691 Hs.17917 extraceMarCnkdonTam-containing 1 6.5 

15 417090 AA193282 Hs.85863 ESTs. Weakly simfiar to B34612 zinc fing 6.5 

410677 NNL003278 Hs.65424 tetranectin (plasrninogen-blndlng protein 6.5 

427283 AL119796 Hs.174185 ectonudeotidepyn^phataseiphosr^wdi 6.5 

415011 AW963085 gb£ST375158 MAGE resequences. MAGH Homo6.4 

412068 S72043 Hs.73133 metallothioneln 3 (growm Inhtoftory fac 6.4 

20 416253 BE250659 Hs.15463 Homo sapiens, done IMAGE2959994, mRNA 6.4 

435885 AA701483 Hs.36341 ESTs 6.3 

402779 NA Target Exon 6.3 

418138 AA213626 Hs.136204 EST 6.3 

439335 AA742697 Hs.62492 ESTs, Weakly similar to B39066 proline-r 6.3 

25 427019 AA001732 Hs.173233 hypotheticaj protein FU 10970 6.2 

411478 BE143068 gb:MR04fT01 58^330200-003-009 HT01 58 Homo6.2 

452654 BE004783 gb:MR2-8N01 14-270400-004-e1 1 BN01 14 Homo6.1 

447359 NMJ12093 Hs.18268 adenylate kinase 5 6.1 

414323 NM_014759 Hs.334688 WAA0273 gene product 6.1 

30 441266 H15968 Hs.293845 Homo sapiens, done IMAGE3502329, mRNA, 6.1 

417011 F08212 Hs.234898 ESTs. Weakly similar to 21 09260AB cell 6.0 

400089 NA Eos Control 6.0 

433614 W07475 Hs.277101 cytochrome c oxidase subunit IV teoform 5.9 

440439 N92818 Hs.64754 ESTs. Weakly similar to potential COS (H 5.9 

35 454404 BE067414 gb:MR4-BT0355-200100-201-e05BT0355 Homo 5.9 

436704 AA062610 Hs.148050 EST 5.9 

406563 NA Target Exon 5.9 

433490 AW451023 Hs.65848 hypothetical protein DKF2p7610132 5.9 

419313 AA843387 Hs.87279 ESTs 5.9 

40 409196 NNL001874 Hs.334873 carboxypeptldase M 5.8 

410882 AW809163 gb:MR4-ST01 18-261099-012^a03 ST01 18 Homo 5.8 

453469 AB014533 Hs.33010 KIAAD633 protein 5.8 

441899 AI372588 Hs.8022 TU3A protein 5.8 

426210 AA372052 Hs.334559 Homo sapiens cONA FU14458 Cs, done HE 5.8 

45 413065 BE063555 gb^M1*6T0283^81199-03S^J09BT0283 Homo 5.8 

454192 AW876813 Hs.3343 phosphogrycerate dehydrogenase 5.7 

425187 AW014486 Hs.22509 ESTs 5.7 

429757 AW452355 Hs.256037 ESTs 5.7 

420202 AL036557 Hs.95910 putative lymphocyte G0/G1 switch gene 5.7 

50 416284 AI695473 Hs.298006 ESTs 5.7 

428553 AA181641 Hs.184907 G protein-coupled receptor 1 5.6 

404689 NA Target Exon 5.6 

4388B7 R68857 Hs.265499 ESTs 5.6 

406082 S47633 Hs.82927 adenosine monophosphate deaminase 2 (tso 5.6 

55 449748 H23963 Hs.32043 ESTs 5.6 

431048 R50253 Hs.249129 cell death-inducing DFFA-iite effector a 5.5 

452205 C15819 gb:C15819 Clontech human aorta polyA mRN 5.5 

434040 AW444613 Hs.288809 hypothetical protein FU20159 5.5 

407744 AB020629 Hs.38095 ATP-binding cassette, sub-family A (ABC1 5.5 

60 450606 AI668605 Hs.60380 ESTs, Moderately similar to ALU6JWMAN A 5.5 

414629 AA345824 Hs.76688 carboxylesterase 1 (rrrarxjcyte/macrophage 5.5 

401665 C110007035i|10048448lreqNP^065258.1|g 5.5 

436107 T99079 Hs.191194 ESTs 5.5 

444432 AI161428 Hs.75916 splicing factor 3b, subunit 2, 145kD 5.5 

65 434715 BE005346 Hs.1 16410 ESTs 5.5 

447205 BE617015 Hs.11005 ESTs, Moderately similar to T17372 plasm 5.5 

408122 A1432652 Hs.42824 hypothetical protein FU10718 5.5 
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454016 AW016806 Hs.233108 
414913 R25621 

459033 AA017590 Hs.129907 

441003 BE172240 Hs.126379 

450637 N49826 Hs.18602 
442398 AA994520 
403612 NA 

407102 AA007629 Hs.9739 

410057 R66634 Hs.268107 

428232 BE272452 Hs.183109 

432769 AA620814 Hs.144959 

431344 R99530 Hs.272572 

427032 AF012023 Hs.173274 

406305 BE261320 Hs.158196 

437411 AW613948 Hs.194915 

442800 A1809481 Hs.131227 
402054 NA 

432085 AF212829 Hs.272406 

415313 R59638 Hs.6181 
459159 AI904646 

427164 AB037721 Hs.173871 

441391 BE467930 Hs.170381 

458959 AI285901 Hs.181297 
402698 NA 
401810 NA 

438879 AA827674 Hs.189073 

414657 AA424074 Hs.76780 

427809 M26380 Hs.180878 

456063 NNL006744 Hs.76461 

451186 AW023469 Hs.65256 

451882 AI821324 Hs.100445 
402583 NA 

431130 NMJJ06103 Hs.2719 

458218 AI435179 Hs.126820 

416083 R53467 . Hs.269122 
455282 BE143867 

426488 X03350 Hs.4 

426156 BE244537 Hs,167382 

407891 AA486620 Hs.41135 

408610 AW026692 Hs.224829 

445967 D59597 Hs.118821 

434813 AI524307 Hs.162870 

437526 AI076012 Hs.121388 
454775 BE160229 

409451 AF012626 Hs.54472 
409853 AW502327 
405062 

446490 AK000706 Hs.15125 

417622 AW298163 Hs.82318 

421978 AJ243662 Hs.110196 

440338 R62431 Hs.12758 

415421 R35009 Hs.24903 
417574 R00348 

409882 AJ243191 Hs.56874 

447998 At768289 Hs.304389 

445613 BE550889 Hs.158491 

443074 AW341470 Hs,144907 

451324 A1783600 Hs.208052 

432433 AW014734 Hs.157969 

449654 AI989812 Hs.199850 

414519 N94587 • Hs.55063 

457531 AW973716 Hs.13913 

433200 AA682722 Hs.192725 

430782 AF026263 Hs.247920 

427555 AW137094 Hs.97990 

433545 AA868510 Hs.112496 

420334 AI349351 Hs.118944 



ESTs 5.5 

gb:yh45fi)6.r1 Scares placenta Nb2HP Homo 5.4 

ESTs 5.4 

ESTs, Weakly similar to 138022 hypotheti 5.4 

ESTs 5.4 
gb:ou42g09.$1 Soares_NH w TJ3BC_S1 Homos5.4 

Target Exon 5.3 

glycerophosphate dehydrogenase 1 (so 5.3 

multimerin 5.3 

monoamine oxidase A 5.3 

ESTs 5.3 

hemoglobin, alpha 2 5.3 

integrin cytoplasmic domain-associated p 5.3 

transcriptional adaptor 3 (ADA3, yeast h 5.3 

ESTs 5.3 

ESTs 5.3 

Target Exon 5.3 

potassium channel, subfamily K. member 9 5.3 

ESTs 5.2 

gb:QV-BT065^)20399-103BT065Homosaplen 5.2 

K1AA1300 protein 52 

ESTs 5.2 

ESTs 5.2 

ENSP00000251335*:DJ1003JZ1 (sodium and 5.2 

Target Exon 52 

ESTs 52 

protein phosphatase 1, regulatory (inhb 5.2 

lipoprotein lipase 5.1 

relinoi-binding protein 4, interstitial 5.1 

ESTs, Weakly similar to leucine-rich gti 5.1 

ESTs 5.1 

NMJ)21620:Homo sapiens PR domain contain .5.1 

HE4; WFDC2; putafive ovarian carcinoma m 5.1 

ESTs 5.1 

ESTs, Weakly similar to ALUIJiyMAN AlU S 5.1 
gb:MR0-OT0164-0701G0«O13-h02 HT0164 Homo5.1 

alcohol dehydrogenase 16 (class I), beta 5.1 

natriuretic peptide receptor A/guanylate 5.1 

endomucin-2 5.0 

ESTs 5.0 

CGl-62 protein 5.0 

ESTs 5.0 
ESTs. WeaMy similar to MDHC_HUMAN MALAT 5.0 
gb:QV1-HT0413^)90200M)62-a12 HT0413 Homo 5.0 

fragile X mental retardation 2 5.0 

gb.-Ul-HF-BROp^aka^a-07-O-Ul.rl NIHJ/IGC_5 5.0 

Target Exon 5.0 

hypothetical protein FU20699 5.0 

WAS protein family, member 3 5.0 

N1CE-1 protein 5.0 

ESTs 5.0 

ESTs 5.0 

gb:ye69e06.r1 Scares fetal liver spleen 5.0 

heat shock 27kD protein family, member 7 5.0 

ESTs 4.9 

ESTs 4.9 

ESTs 4.9 

ESTs 4.9 

ESTs 4.9 

ESTs 4.9 

ESTs 4.9 

KIAA1577 protein 4.9 

ESTs 4.9 

cholinergic receptor, muscarinic 5 4.8 

ESTs 4.8 

ESTs 4.8 

hypothetical protein FU22477 4.8 
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421795 
427138 
413072 
443721 
408053 
427067 
442969 
426220 
414593 
426893 
434046 
401590 
457971 
427722 
443793 
407737 
441955 
441499 
447517 
403017 
450580 
404611 
414831 
459290 
444341 
408614 
449638 
434416 
447360 
419583 
440698 
451199 
438338 
433756 
423301 
417237 
439745 
424137 
449338 
434744 
407402 
443510 
415754 
415986 
457416 
418064 
437120 
453950 
401093 
436935 
457974 
428222 
442705 
437409 
458494 
410490 
416069 
438463 
444043 
438327 
457711 
400870 
441425 
416267 
407262 
444567 
403263 



X63094 

N77624 

BE063965 

AW450451 

AW139474 

AA843716 

A1025499 

AI383475 

BE386764 

AA398716 

AW292618 

NA 

AW134679 

AK000123 

AA045290 

R49187 

AA972327 

AW298235 

AI382726 

N40087 

H58589 

M31158 

NM.001546 

AI142027 

AL137698 

AW204277 

AF134707 

AI375984 

F0O312 

A1348455 

A1290653 

NMJM861 

AW015933 

S67580 

H86385 

AL389981 

AA335769 

H73444 

N94835 

AF035303 

NMJM2190 

AA169114 

Z43619 

BE142052 

BE387287 

A1356125 

AA156998 

AW206494 

AW842353 

AL133112 

A1264634 

AL359599 

A1380906 

H03589 

R3710I 

AA807958 

AI499723 

H87407 

AF147401 

AA933590 

H45384 

M12873 

AV654020 

NA 



Hs^83822 
Hs.173717 

Hs266355 
Hs.246862 
Hs.177927 
Ks.132238 
Hs.171697 

Hs.97418 
Hs.113011 

Hs.242849 

Hs.160479 

Hs.25930 

Hs.6659 

Hs.142903 

Hs.101689 

Hs.182434 

Hs.15248 ' 

Hs.35156 

Hs.77439 

Hs.34853 

Hs.146650 

Hs.46531 

Hs.250723 

Hs.278679 

Hs.167216 

Hs.147492 

Hs.124758 

Hs.6168 

Hs. 11 2654 

Hs.1645 

Hs.81737 

Hs.149219 

Hs.16262 

Hs.394 

HSJ28382S 

H&9520 
Hs.12247 

Hs.62654 
Hs.83384 
Hs.157767 
Hs.211568 

Hs^53560 
Hs321717 
Hs.183085 
Hs.131127 
Hs.283850 
Hs.158436 

Hs.20982 
Hs.314232 
Hs.135089 
Hs.172944 
Hs.23917 

Hs.28937 



Hs.184261 



Rhesus blood group, 0 anfigen 4.8 

phosphate acid phosphatase type 2B 4.8 
gb:GV^0296-14020<H)85-h01 BT0296 Homo 4.8 

ESTs 4.8 

ESTs 4.8 

ESTs 4.7 

ESTs 4.7 

ESTs, WeaMy similar to T1 3924 sdk prate 4.7 
gb:601273249F1 NiH_MGC_20 Homo sapiens c 4.7 

ESTs 4.7 

ESTs 4.7 

Target Exon 4.7 

ESTs 4.7 

hypothetical protein RJ201 16 4.6 

ESTs, Weakfy similar to 2109260A B cell 4.6 

ESTs 4.6 " 

ESTs 4.6 

ESTs 4.6 

ESTs 4.6 

Target Exon 4.6 

ESTs 4.6 

Homo sapiens cDNA FU1 1027 fis, clone PL 4.6 

protein kinase, cAMP-dependent, regukto 4.6 

inhibitor of DNA binding 4. dominant neg 4.6 

ESTs 4.6 
Homo sapiens mRNA; cDNA DKFZp434C1915 (f4.6 

hypofoetfcal protein MGC2747 4.6 

a disintegrln and metailoproteinase doma 4.6 

ESTs 4.6 

gb:HSBBOD101 STRATAGENE Human skeletal m4.6 

• Homo sapiens cDNA FU1 1777 fis, clone HE 4.6 

ESTs 4.6 

KIAA0703 gene product 4.6 

Homo sapiens, clone MG&9764, mRNA, comp 43 

cytochrome P450, subfamily 1VA, pdypept 43 

palmitoyl-protein thloesterose 2 4.5 

Homo sapiens mRNA full length insert cDN 4.5 

ESTs 4.5 

adrenomeduOin 4.5 

Homo sapiens genomic DNA, chromosome 21 q 4.5 

gbtHomo sapiens done 23943 mRNA sequenc 4.5 

fonnyitetrahydrofolate dehydrogenase 43 

hypothetical protein FU 1141 3 43 

gb:HSC1GE121 normalized Infant brain cON 43 

kringl&containing transmembrane protein 43 

S100 calciurn-bmding protein, beta (neur 4.4 
ESTs. Weakly similar to HXA2_HUMAN HOMEO 4.4 

eukaryotic translation initiation factor 4.4 

C12000586 # ."gi|6330167|dbiIBAA86477.1| (A 4.4 

ESTs 4.4 

ESTs, Weakly similar to S22765 heterogen 4.4 
Homo sapiens mRNA; cDNA DKFZp434K098 (fr 4.4 

ESTs 4.4 
Homo sapiens mRNA; cDNA DKFZp547C126 (fr 4.4 

ESTs 4.4 

gb:yj42d08.r1 Scares placenta Nb2HP Homo 4.4 

ESTs 4.4 

ESTs 4.4 

ESTs 4.4 

chorionic gonadotropin, beta polypeptide 4.4 

ESTs 4.3 

C11000905:gi|11692565lgblAAG39879.1|AF28 4.3 

homeobox protein from AL390526 43 

gbyn99c10.r1 Scares adult brain N2b5HB5 4.3 

gbttuman Ig rearranged H-chain mRNA VDJ4 43 

ESTs, Weakly similar to T26686 hypomefi 4.3 

Target Exon 43 
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410034 BE067414 

456804 AI421645 Hs.139851 

448427 BE395260 Hs.309438 

416931 D45371 Hs.80485 

5 421296 NMJW2666 Hs.103253 
400973 NA 

452602 AW366194 Hs.55962 

412330 NM.0Q5100 Hs.788 
405016 

10 435104 AI475671 Hs.88607 
406118 

418556 T02850 

429745 AA480818 Hs221736 

433088 AW451206 Hs.115899 

15 444445 AA342329 Hs.115920 

453880 AI803166 Hs28462 

447384 A1377221 Hs.40528 

414541 BE293116 Hs,76392 

444975 AV652165 Hs.182482 

20 403921 NA 

451477 AI798425 Hs.42710 
406344 

416970 AA191201 Hs.35861 

413662 BE155865 H$25522 

25 458504 AW070634 Hs.144794 
404682 NA 

418089 N69913 Hs.6858 
403433 

446532 AW975460 Hs.143563 

30 414217 AI309298 Hs279898 

418425 AI871247 Hs.6262 

419589 AW973708 Hs201925 

457029 AA397789 Hs.161803 

447860 AF193807 Hs.131835 

35 448988 Y09763 Hs.22785 

440610 AI733098 Hs.130800 
439590 AF086410 

427240 AA399975 Hs274151 

408932 AW594172 Hs278513 

40 436112 T77545 Hs.187559 

444382 AJ144152 Hs.58246 

456716 AA318060 Hs.135121 

419846 NM.015977 Hs.285681 

410036 R57171 Hs.57975 

45 400545 NA 
403051 NA 

420139 NMJJ05357 Hs.95351 

450244 AA007534 Hs.125062 

453261 AA034116 Hs.118494 

50 440246 W52010 Hs.191379 

414516 AI307802 Hs.135560 

438232 AH50595 Hs.122226 

410233 AA082947 

412179 BE270758 Hs.69428 

55 441871 A1306150 Hs.153450 

426411 AK000708 Hs,169764 

453692 AL110416 

448640 AW817177 Hs.102558 

417481 AA203281 Hs21798 

60 412912 AW1 18878 Hs.110835 

454183 AW807116 

426328 AW631296 

435942 R06285 Hs.191215 

417629 T76945 
65 403593 NA 

402690 

418190 R49591 Hs270425 



gb:MR4-BT0355-200100-201-e05 BT0355 Homo 4.3 

caveolin 2 4.3 

EST 4.3 

adipose most abundant gene transcript 1 4.3 

perilipin 4.3 

ENSP00000236667*:Mucin 5B (Fragment). 4.3 

ESTs 4.3 

A kinase (PRKA) anchor protein (gravin) 4.3 

CY00017^i|9280405|gb|AAF864G2.1|AF245 4.3 

ESTs, Highly similar to F-box protein FB 4.3 

ENSP00000246632:CDNA FU20261 fis, done 4.3 

gb:FB12A9 Fetal brain, Stratagene Homo s 4,3 

ESTs 4.3 

ESTs 4.3 

Homo sapiens cDNA: FU22816 fts, clone K 4.3 

ESTs, Weakly stater to 138022 hypotheti 4.3 

ESTs 4.2 

aldehyde dehydrogenase 1 family, member 4.2 

ESTs, Weakly similar to T00362 hypoM 42 

C5000212*:gi|10047237jdbjJBAB13407.1| (A 4.2 

ESTs 4.2 

C5001660:gil11611537ldbjjBAB18935.1| (AB 42 

DKFZP586E1621 protein 42 

K1AA1808 protein 42 

ESTs 4.2 

C9001 188*:gi|12738842|refINPJ)73725.1| p 42 

ESTs. Weakly similarto I78885 serinerth 42 

NfyL001622:Homo sapiens alpha-2-H^glycop 42 

ESTs 4.2 

Homo sapiens cDNA: FU23165 lis, clone L 4.2 

hypothetical protein MGC8407 42 

Homo sapiens cDNA FU13446 fis, done PL 42 

ESTs 42 

Rhesus blood group, B glycoprotein 4.2 

gamma-aminobutync acid {GABA) A recepto 42 

ESTs 4.2 

gb:Homo salens fall length insert cDNA 4.2 

Cgatin 42 

TP53TG3 protein 42 

ESTs 42 

ESTs 42 

hypothetical protein FU22415 42 

Williams-Beuren syndrome chromosome regi 42 

calsequestrin 2 (cardiac muscle) 4.1 

Target Exon 4.1 

Target Exon 4.1 

lipase, hormone-sensifive 4.1 

ESTs 4.1 

ESTs 4.1 

ESTs 4.1 

ESTs, Weakly similar to T43458 hypotheti 4.1 

ESTs 4.1 

gb:zn10g07.s1 Stratagene hNT neuron (937 4.1 

hypothetical protein MGC3020 4.1 

ESTs. Weakly similarto 1909123A Na glue 4.1 

hypothetical protein RJ20701 4.1 

gb:DKFZp434KQ431_f1 434 (synonym: htes3) 4.1 

Homo sapiens, clone MGC:5352. mRNA, comp 4.1 

ESTs 4.1 

ESTs 4.1 
gb:MR4^T0062-04010OO244)12 ST0062 Homo 4,1 

gb:hh83c09.y1 NCLCGAPJ3U1 Homo sapiens 4.1 

ESTs 4.1 

gb:yc92c07.r1 Scares Infant brain 1NIB H 4.1 

Target Exon 4.0 

Target Exon 4.0 

ESTs 4.0 
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408641 AW245207 Hs.5555 hypothetical protein MGC5347 4.0 

427899 AA829286 Hs.332053 semm amyloid A1 4.0 

445975 AI811536 Hs.145734 ESTs 4.0 

438831 BE263273 Hs.6439 synapsinll 4.0 

5 455578 BE006350 Hs.14355 Homo sapiens cDNA RJ13207 fis, clone NT 4.0 

401840 NA TaigetExon 4.0 

413753 1)17780 Hs.75517 lamlnln, beta 3 (nicein {125kD) t kaMn 4.0 

445030 A1205925 Hs.147238 ESTs, Highly similar to AAC3.HIIMAN AtPHA 4.0 

433873 AW156913 Hs.150478 ESTs, Weakly smlar to A Chain A. toyst 4.0 

10 456736 AW248217 Hs.1619 achaete-scute complex (Drosophita) homo) 4.0 

450112 BE047734 Hs.5473 ESTs, Moderately similar to ALU5_HUMAN A 4.0 

448906 AI589567 Hs.309719 ESTs 4.0 
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TABLE 23A 
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Table 23A shows the accession numbers for those pkeys lacking unigenelD's for Table 23, 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession'* column. 



Pkey. Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 



409853 
410034 
410233 
410490 
410882 
411478 
413055 
413072 
414593 
414913 
415011 
415985 
416267 
417574 
417629 
418556 
419583 
426328 
439590 
442398 
452205 
452654 
453692 
454183 



1156226 J 

1170594 J 

118656 J 

1205347J 

1225686J 

1247073J 

1347960.1 

1343163J 

1464909J 

1506721J 

151328J 

1564410.1 

1583547J 

1687770J 

1690392J 

1767866.-1 

186198J 

264901.1 

47413J 

541271J 

904 15 J 

925931J 

977825.1 

1049636J 



AW502327 AW502488 AW501829 AW502625 AW502687 

BE067414 BE067958 BE067419 BE067963 AW577127 AW601412 

AA082947AA083O36 

H03589 AW750687 AW750688 

AW809163 AW809247 AW809177 AW809190 AW809225 

BE143068 AW849143 AW848705 AW848569 AW848071 AW848475 AW848092 AW848005 
BE063555 BE151321 BE151319 BE151657 BE151655 BE063556 BE151322 



454404 1170594 J 

454775 1234106J 

455282 1273020.1 

459159 919998.1 



BE386764BE387560 
R25621 C03959 C04010 
AW963085 AA159005 AW963073 
Z43619 R61274 H12206 R12883 
H45384 H49125 H41699 
R00348 R09593 
T76945 R20210R05755 
T02850 

F00312 AA247490 F31427 AA383663 F22045 

AW631296AA375464 

AF086410W94386 W74609 

AA994520AW393574 

C15819 AA024741 M024742 

BE004783 BE004947 AI911790 

AL110416AWB76759 

AW807116 AW807569 AW807415 AW807338 AW807288 AW807263 AW807316 AW177402 AW807413 AW807068 BE141561 
BE141569AW807401 AW807310 BE141565 AW807318 AW807119AW807299 AW807241 AW807225 AW807204 AW807345 
AW807103 BE141615 AW807431 AW807393 AW807337 AW807406 AW807259 AW807375 AW845890 AWB07220 AW807399 
AW807064 AW807376 AW807024 BE141595 AW807236 AW807027 AW807112 AW807378 AW807202 BE141593 AW807216 
AW807138 AW807244 AW807221 AW807297 AW807050 AW807248 AW807404 AW807075 AW807237 AW807212 AW807308 
AW807110 AW807104 BE140912 AW807301 AW807382 AW807294 AW807026 AW807020 AW807108 AW807025 AW807433 
AW807124 AW807419 AW807031 AW807264 AW807032 AW807029 AW807052 AW807391 AW807207 AW807215 AW807019 
AW807238 AW807201 BE141590 AW807302 AW807323 AW807380 AW807109 BE141588 AW845877 AW807418 AW807407 
AW807309 BE141614 AW845861 AW807396 AW807300 AW807348 AW80731 1 AW807214 AW807132 AW807402 AW807350 
AW807028 AW807298 AW807291 AW807305 AW807217 AW807312 AW807261 AW807268 AW807260 AW807266 AW807198 
AW807346 AW807315 BE140940 AW807351 AW807122 AW807235 AW807076 AW807314 AW807213 AW807143 AW807131 
AW800079AW807066AW807077AW845923AW807421 BE141619BE140943AW807420AW807381 BE141621 AW845921 
BE141629 BE141625 BE141624 8E141636 BE141630 AW807405 AW807290 AW807353 
BE067414 BE067958 BE067419 BE067963 AW577127 AW601412 

BE160229 AW819879 AW820179 AW819882 AW819876 AW820169 BE153201 AW993736 BE15291 1 

BE143867 AW935060 AW886684 

AI904646BE179494BE179421 
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TABLE 23B 



Table 23B shows the genomic positioning for those pkeys lacking unigene ED's and 
accession numbers in Table 23. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey. Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Idenfifier (GO numbers. •Dunham L el aT refers to the publication 

entitled The ONA sequence of human chromosome 22 * Dunham I. et at, Nature (1999) 402:439-495. 
Strand: Indicates DNA strand from which axons were predicted. 

Ntposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand NLposiu'on 


400545 


9800107 


Minus 


124618-124881 


400870 


9838306 


Minus 


34081-35027 


400973 


7960452 


Minus 


98119-98253 


401093 


8516137 


Minus 


22335-23166 


401590 


9966320 


Minus 


33547-33649 


401665 


7145001 


Pius 


121591-122537 


401810 


7342191 


Pius 


129063-129476 


401840 


7684597 


Plus 


5628346439 


402054 


8083691 


Minus 


8288-8806 


402195 


7689778 


Minus 


147901-148884 


402583 


7684486 


Pius 


94883-95003 


402690 


8346058 


Plus 


13368-13998 


402698 


8570304 


Minus 


108641-108903 


402779 


9588555 


Minus 


38173-39210 


403017 


6693623 


Pius 


78630-79367 


403051 


4827080 


Minus 


52694411 


403263 


7770677 


Plus 


5243142737 


403433 


9719611 


Minus 


72225-72437 


403593 


6862650 


Minus 


6255442712,6944949602 


403612 


8469060 


Minus 


94723-94859 


403921 


7711590 


Minus 


3297-3536 


404368 


7630956 


Minus 


102053-102199 


404682 


9797231 


Minus 


40977-41150 


404689 


7534100 


Plus 


119461-119717 


405016 


6524300 


Plus 


5199743308 


405062 


7657730 


Pius 


101283-101432 


406118 


9143818 


Plus 


5399744629 


406344 


9255974 


Plus 


20254-20374^0526-20659^0835-21097 


406563 


7711604 


Plus 


34401-34538 
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TABLE 24: 

Table 24 depicts Seq ID No., UnigenelD, UnigeneTitle, Pkey, Pred.Cell.Loc, and ExAccn 
for all of the sequences in Table 25. The information in Table 24 is linked by Seq ID No. to 
Table 25. 
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Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

UnigeneTitle: Unigene gene title 

Pred.Cell.Loc: Predicted Cellular localization 



Seq.ID.No.: 



Sequence Identification Number found in Table 25 



Pkey ExAccn UnigenelD UnigeneTitle 



PredXetiLoc. Seq. ID. No. 



449746 
407276 
415539 
400297 
450375 
102457 
429170 
424399 
422505 
449765 
425692 
426215 
439340 
410102 
429220 
416276 
409079 
442818 
442082 
444381 
446163 
416636 
442117 
433043 
429353 
452190 
446733 
452747 
423242 
417433 
432201 
423961 



AI668594 Hs.176588 
AI951118 Hs.326736 
AI733881 Hs.72472 
A1127076 Hs.334473 
AA009647 Hs.8850 
NM.001394H1L2359 
NMJ01394HS.2359 
A1905687 Hs.2533 
AL120862 Hs.124165 
N92293 Hs.206832 
D90041 Hs.155956 
AW963419 Hs.155223 
AW449211 Hs.105445 
AW248508 Hs.279727 
AW207206 Hs.136319 
U41060 Hs.79136 



W87707 

AK001741 

R41823 



Hs.82065 

Hs.8739 

Hs.7413 



BE387335 Hs.283713 
AA026880 Hs.25252 
N32536 Hs.42645 
AW664964 Hs.128899 
W57554 Hs.125019 



AL117406 
H26735 



Hs.200102 
Hs.91668 



114480 
404561 
325372 
112287 
335824 
424735 
400289 
427585 
429925 
429441 
421155 
420931 
420813 
452744 



AA863360 Hs.26040 

8E153855 Hs.61460 

AL039402 Hs.125783 

BE270266 Hs.82128 

A1538613 Hs.298241 

D13666 Hs.136348 

AW602166 Hs.222399 

BE066778 Hs.151678 

NA 

AB033064 Hs.334806 
NA 

U31875 Hs.272499 

X07820 Hs.2258 

D31152 Hs.179729 
Nf^000786Hs^26213 

AJ224172 HS204096 

H87879 Hs.102267 

AF044197 Hs.100431 

X51501 Hs.99949 

AI267652 Hs.30504 



ESTs, Weakly similar to CP4Y_HUMAN CYTOC 
Homo sapiens breast cancer antigen NY-BR 
BMP-R1B 

hypothetical protein DKFZp56401278 

a disintegrin and metaUoproteinase doma 

dual specificity phosphatase 4 nuclear 

dual specificity phosphatase 4 nuclear 

aldehyde dehydrogenase 9 family, member cytoplasm 

ESTs 

ESTs, Moderately similar to ALU8JWMAN A 

N-acetyltransferase 1 (ary lamina N-acety 

stanniocaldn 2 

GDNF family receptor alpha 1 

Homo sapiens cDNA FU14035 lis, clone HE 

ESTs 

U\M protein, estrogen regulated 
interleukin 6 signal transducer (gp130, 
hypothetical protein FLJ10879 
ESTs 

ESTs, Weakly similar to S64054 hypothetl 
Homo sapiens cDNA FU 13603 fis, done PL 
solute carrier family 16 (rrwnocarboxyDc 
ESTs 

lymphoid nuclear protein (LAF-4) mRNA 
ATP-binding cassette transporter MRP8 
Homo sapiens clone PP1498 unknown mRNA 
ESTs, Weakly similar to fatty add omega 
Ig superfamfly receptor LNIR 
DEME-6 protein 

5T4 oncofetal trophoblast glycoprotein 
Transmembrane protease, serine 3 
osteoblast specific factor 2 (fascicfin 
CEGP1 protein 

UDP^-acety^afpha-D-gaiactosaminapolyp 
NMJ14112*:Homo sapiens trichorhlnophala mitochodria 
Phase2&3Exons nuclear 
KIAA1238 protein 

ENSP00000249072*:DJ222E13.1 (N-TERMINAL 
short-chain alcohol dehydrogenase family 
matrix metalloproteinase 10 (stromelysln 
collagen, type X, alpha 1 (SchmkJ metaph 
cytochrome P450, 51 (lanosterol 14-alpha ER 
Bpophilin B (uteroglobin family member) 

extracellular 



nuclear 



small inducible cytokine B subfamily (Cy 
prolactin-induced protein 
Homo sapiens mRNA; cDNA DKFZp434E082 (fr 

343 



SeqlD142 
SeqlD3&4 
SeqlD5&6 
SeqID7&8 
SeqlD9&10 
SeqlD11&12 
Seq ID 1U 12 
Seq ID 13 & 14 
Seq ID 15 & 16 
Seq ID 17 & 18 
Seq ID 19&20 
Seq ID 21& 22 
Seq ID 23424 
Seq ID 25426 
Seq ID 27428 
Seq ID 29 & 30 
Seq 1D31 &32 
Seq ID 33434 
SeqlD35&36 
Seq ID 37438 
Seq ID 39 & 40 
Seq ID 41 442 
SeqtD43&44 
Seq ID 45 446 
Seq ID 47 448 
SeqlD49&50 
Seq ID 51452 
Seq ID 53454 
Seq ID 55456 
SeqlD57&58 
Seq ID 59 460 
SeqlD61&62 
Seq!D63&64 
Seq ID 65 466 
Seq iO 67 & 68 
SeqlD69&70 
Seq ID 71 472 
Seq ID 73474 
Seq ID 75476 
Seq ID 77478 
Seq ID 79 4 80 
Seq ID 81 4 82 
Seq ID 83484 
Seq ID 85486 
Seq ID 87488 
Seq ID 89490 
Seq ID 914 92 
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420757 
424905 
429659 
446921 
445537 
428227 
424001 
421727 
452838 
419667 
414812 
426320 
422867 
411558 
417866 



431958 
428722 
412970 
421379 
415752 
444051 
451110 



X78592 Hs.99915 
NKL002497HS.153704 
NMJW7Q5GKS.225952 
AB012113 Hs.16530 
AJ245671 Hs.12844 
AA321649 Hs.2248 
W67883 Hs.137476 
V13153 Hs.107318 
U65011 Hs.30743 
AU077005 Hs.92208 
X72755 Hs.77367 
W47595 Hs.169300 
L32137 Hs.1584 
AA102670 Hs.70725 
AW067903 Hs.82772 
AI24936B Hs.98558 
X63629 Hs.2877 
U76456 Ha.190787 
AB026436 Hs. 177534 
Y15221 Hs.103982 
BE314524 Hs.78776 
N48373 Hs.10247 
AI955040 Hs.265398 



androgen receptor (dihydrotestosterona r 
N1MA (never in mitosis gene a)-related k 
protein tyrosine phosphatase, receptor t 
small inducible cytokine subfamily A (Cy 
EGF-like-domaln, multiple 6 
smaQ inducible cytokine subfamily B (Cy 
paternally expressed 10 
kynurenine 3-monooxygenase (kynurenine 3 
preferentially expressed antigen in mela 
a disintegrin and metaHoproteinase doma 
monokine induced by gamma interferon 
transforming growth factor, beta 2 
cartilage oikjomeric matrix protein (pse 
gamma-aminooutyric acid (GABA) A recepto 
collagen, type XI, alpha 1 
ESTs 

cadherin 3, type 1, P-cadherin (placenta 
tissue inhibitor of metaUoprotelrtase 4 
dual specificity phosphatase 10 
small Inducible cytokine subfamily B (Cy 
putative transmembrane protein 
activated leucocyte cell adhesion molecu 
ESTs, Weakly similar to transformation* 



cytoplasm 


SeqlD93&94 


nuctear 


SeqlD95&96 




Seql097&98 


extracellular 


SeqiO99&100 




Seq ID 101 & 102 
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TABLE 24A 



Table 24A shows the accession numbers for those pkeys lacking unigenelD's for Table 24. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
5 were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

1 0 Pkey: Unique Eos probeset identifier number 

CAT number. Gene cluster number 
Accession: Genbank accession numbers 

15 Pkey CAT number Accession 

335824 CH22_3197FGJ19JUINI1_E 
325372 c12Lhs 



345 



# * 



WO 02/059377 PCT/US02/02242 



TABLE 24B 

Table 24B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 24. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

1 0 Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al" refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. etal, Nature (1999) 402:469495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



15 



20 



Pkey Ref Strand NLposition 
404561 9795980 Minus 69039-70100 
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10 
15 



45 



65 



Table 25 

The 69 gene sequences identified to be overexpressed in breast cancer may be used to identify coding regions from the public DNA databases (nr and htgs in 
SZS^S^nmy be used to either identify genes that encode known proteins, or they may be used to predict the coding regions from genomic 
DNA using exon prediction algorithms, such as FGENESH (Salamov and Solovyev, 2000, Genome Res. 10:516-522) 

Seq ID NO: I DNA sequence 

Nucleic Acid Accession*: FGENESH predicted ORF 

Coding sequence: 1-1518 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

ATjGGAGCCCT CCTGGCTTCA GGAACTCATG GCTCAOCCCT TCTTGCTGCT GATCCTCCTC 60 
-rT^A-m-mrr- xr.rTfirTnTT TCAnnTA ATC AGGTTGTACC AGAGGAGGAG ATGGATGATC 



ATGGAGCCCT CCTGGCTTCA GuAACTCATu OUUUA,v i luiuviuuiurti^^.v w 
TGCATGTCTC TGCTGCTGTT TCAGGTAATC AGGTTGTACC AGAGGAGGAG ATGGATGATC 120 
AGAGCCCTGC ACCTGTTTCC TGCACCCCCT GCCCACTGGT TCTATGGCCA CAAGGAGTTT 180 
TACCCAGTAA AGGAGTTTGA GGTGTATCAT AAGCTG ATGG AAAAATACCC ATGTGCTGTT 240 
CCCTTGTGGG TTGG ACCCTT TACG ATGTTC TTCAGTGTCC ATGACCCAG A CTATGCCAAG 300 
ATTCTCCTGA AAAGACAAGA TCCCAAAAGT GCTGTTAGCC ACAAAATCCT TG AATCCTGG 360 
GTTGGTCGAG G ACTTGTG AC CCTGG ATGGT TCTAAATGGA AAAAGCACCG CCAG ATTGTG 420 
20 AAACCTGGCT TCAACATCAG CATTCTGAAA ATATTCATCA CCATGATGTC TG AG AGTGTT 480 
CGGATG ATGC TG AACAAATG GGAGG AACGC ATTGCCCAAA ACTCACGTCT GG AGCTCTTT 540 
CAACATGTCT CCCTGATGAC CCTGGACAGC ATCATGAAGT GTGCCTTCAG CCACCAGGGC 600 
AGCATCCAGT TGGACAGTAC CCTGGACTCA TACCTGAAAG CAGTGTTCAA CCTTAGCAAA 660 
ATCTCCAACC AGCGCATGAA CAATTTTCTA CATCACAACG ACCTGGTTTT CAAATTCAGC 720 
25 TCTCAAGGCC AAATCTTTTC TAAATTTAAC CAAGAACTTC ATCAGTTCAC AGAGAAAGTA 780 
ATCCAGGACC GG AAGG AGTC TCTTAAGGAT AAGCTAAAAC AAGATACTAC TCAGAAAAGG 840 
CGCTGGGATT TTCTGG ACAT ACTTTTGAGT GCCAAAAGCG AAAACACCAA AGATTTCTCT 900 
GAAGCAGATC TCCAGGCTGA AGTG AAAACG TTCATGTTTG CAGGACATGA CACCACATCC 960 
AGTGCTATCT CCTGGATCCT TTACTGCTTG GCAAAGTACC CTGAGCATC A GCAGAGATGC 1020 
30 CGAG ATGAA A TCAGGGAACT CCTAGGGG AT GGGTCTTCTA TTACCTGGGA ACACCTG AGC 1080 
CAGATGCCTT ACACCACGAT GTGCATCAAG GAATGCCTCC GCCTCTACGC ACCGGTAGTA 1140 
AACATATCCC GGTTACTCGA CAAACCCATC ACCTTTCCAG ATGGACGCTC CTTACCTCCA 1200 
GGAATAACTG TGTTTATCAA TATTTGGGCT CTTCACCACA ACCCCTATTT CTGGGAAGAC 1260 
CCTCAGGTCT TTAACCCCTT GAGATTCTCC AGGGAAAATT CTGAAAAAAT ACATCCCTAT 1320 
3 5 GCCTTCATAC CATTCTCAGC TGGATTAAGG AACTGCATTG GGCAGCATTT TGCCATAATT 1380 
GAGTGTAAAG TGGCAGTGGC ATTAACTCTG CTCCGCTTCA AGCTGGCTCC AGACCACTCA 1440 
AGGCCTCCCC AGCCTGTTCG TCAAGTTGTC CTCAAGTCCA AG AATGGAAT CCATGTGTTT 1500 
GCAAAAAAAG TTTGCTA^ATT TTAAGTCCTT TCGTATAAGA ATTAATOAG A CAATTTTCCT 1560 
ACCAAAGGAA GAACAAAAGG ATAAATATAA TACAAAATAT ATGTATATGG ™^Jf*J™ 
40 AATTATATAA CTTAGGATAC TTCTGACTGG TTTTGACATC CATTAACAGT AATTTTAATT 1680 
TCTTTGCTGT ATCTGGTGAA ACCCACAAAA ACACCTG AAA AAACTCAAGC TGACTTCCAC 1740 
TGCGAAGGGA AATTATTGGT TTGTGTAACT AGTGGTAG AG TGGCTTTCAA GCATAGTTTG 1800 
ATCAAAACTC CACTCAGTAT CTGCATTACT TTTATCTCTG CAAATATCTG CATG ATAGCT 1 860 
TTATTCTCAG TTATCTTTCC CC AATAATAA AAAA 



Seq ID NO: 2 Protein sequence: 

Protein Accession #: FGENESH predicted 



50 1 11 2! 31 41 51 

MEPSWLQELM AHPFLLULL CMSLLLFQVI RLYQRRRWMI RALHLFPAPP AHWFYGHKEF 60 
YPVKEFEVYH KLMEKYPCAV PLWVGPFTMF FSVHDPDYAK ILLKRQDPKS AVSHK1LESW 120 
VGRGLVTLDG SKWKKHRQIV KPGFNISILK IFITMMSESV RMMLNKWEER IAQNSRLELF 180 

55 OHVSLMTLDS IMKCAFSHQG SIQLDSTLDS YLKAVFNLSK ISNQRMNNFL HHNDLVFKFS 240 
SQGQDFSKFN QELHQFTEKV IQDRKESLKD KUCQDTTQKR RWDFLDILLS AKSENTKDFS 300 
EADLQAEVKT FMFAGHDTTS SA1SWILYCL AKYPEHQQRC RDEIRELLGD GSSITWEHLS 360 
OMPYTTMCIK ECLRLYAPW N1SRLLDKPI TFPDGRSLPA GITVFINIWA LHHNPYFWED 420 
PQVFNPLRFS RENSEKIHPY AFIPFSAGLR NCIGQHFAH ECKVAVALTL LRFKLAPDHS 480 

60 RPPQPVRQW UCSKNGIHVF AKKVC 



Seq ID NO: 3 DNA sequence 

Nucleic Acid Accession #: NM 052997 

Coding sequence: 100-4 1 25 (underlined sequences correspond to start and stop codons) 



1 II 21 31 41 51 

CTAGTCTATA CCAGCAACGA CTCCTACATC GTCCACTCTG GGGATCTTAG AAAGATCCAT 60 
70 AAAGCTGCCT CCCGGGG ACA AGTCCGGAAG CTGGAGAAG ATG ACAAAGAG GAAGAAGACC 120 
ATCAACCTTA ATATACAAGA CGCCCAGAAG AGGACTGCTC TACACTGGGC CTGTGTCAAT 180 
GGCCATGAGG AAGTAGTAAC ATTTCTGGTA G ACAGAAAGT GCCAGCTTGA CGTCCTTGAT 240 
GGCGAACACA GGACACCTCT GATGAAGGCT CTACAATGCC ATCAGGAGGC TTGTGCAAAT 300 
ATTCTGATAG ATTCTGGTGC CG ATATAAAT CTCGTAGATG TGTATGGCAA CATGGCTCTC 360 
75 CATTATGCTG TTTATAGTG A G ATTTTGTCA GTGGTGGCAA AACTGCTGTC CCATGGTGCA 420 
GTCATCG AAG TGCACAACAA GGCTAGCCTC ACACCACTTT TACTATCCAT AACGAAAAGA 480 
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AGTGAGCAAA TTGTGGAATT TTTGCTGATA AAAAATGCAA ATGCGAATGC AGTTAATAAG 540 
TATAAATGCA CAGCCCTCAT GCTTGCTGTA TGTCATGGAT CATCAGAG AT AGTTGGCATG 600 
CTTCTTCAGC AAAATGTTG A CGTCTTTGCT GCAGATATAT GTGGAGTAAC TGCAG AACAT 660 
TATGCTGTTA CTTGTGG ATT TCATCACATT CATGAACAAA TTATGGAATA TATACGAAAA 720 
TTATCTAAAA ATCATCAAAA TACCAATCCA GAAGG AACAT CTGCAGGAAC ACCTGATG AO 780 
GCTGCACCCT TGGCGGAAAG AACACCTGAC ACAGCTGAAA GCTTGGTGGA AAAAACACCT 840 
GATGAGGCTG CACCCTTGGT GGAAAGAACA CCTGACACGG CTGAAAGCTT GGTGGAAAAA 900 
ACACCTGATG AGGCTGCATC CTTGGTGG AG GG AACATCTG ACAAAATTCA ATGTTTGGAG 960 
AAAGCGACAT CTGGAAAGTT CGAACAGTCA GCAGAAGAAA CACCTAGGGA AATTACGAQT 1020 
CCTGCAAAAG AAACATCTGA GAAATTTACG TGGCCAGCAA AAGGAAGACC TAGGAAGATC 1080 
GCATGGGAG A AAAAAGAAGA CACACCTAGG GAAATTATGA GTCCCGCAAA AGAAACATCT 1 140 
G AGAAATTTA CGTGGGCAGC AAAAGGAAGA CCTAGGAAGA TCGCATGGGA GAAAAAAGAA 1200 
ACA(XTGTAA AG ACTGGATG CGTGGCAAG A GTAACATCTA ATAAAACTAA AGTTTTGGAA 1260 
AAAGGAAGAT CTAAGATGAT TGCATGTCCT ACAAAAGAAT CATCTACAAA AGCAAGTGCC 1320 
AATGATCAGA GGTTCCCATC AGAATCCAAA CAAGAGGAAG ATGAAGAATA TTCTTGTGAT 1380 
TCTCGGAGTC TCTTTGAGAG TTCTGCAAAG ATTCAAGTOT GTATACCTGA GTCTATATAT 1440 
CAAAAAGTAA TGGAGATAAA TAGAGAAGTA GAAGAGCCTC CTAAGAAGCC ATCTGCCTTC 1500 
AAGCCTGCCA TTGAAATGCA AAACTCTGTT CCAAATAAAG CCTTTGAATT GAAGAATGAA 1 560 
CAAACATTGA GAGCAGATCC GATGTTCCCA CCAGAATCCA AACAAAAGGA CTATGAAGAA 1620 
AATTCTTGGG ATTCTGAGAG TCTCTGTGAG ACTGTTTCAC AGAAGGATGT GTGTTTACCC 1680 
AAGGCTACAC ATCAAAAAGA AATAGATAAA ATAAATGGAA AATTAGAAG A GTCTCCTAAT 1 740 
AAAGATGGTC TTCTGAAGGC TACCTGCGGA ATG AAAGTTT CTATTCCAAC TAAAGCCTTA 1800 
GAATTGAAGG ACATGCAAAC TTTCAAAGCG GAGCCTCCGG GGAAGCCATC TGCCTTCGAG I860 
CCTGCCACTG AAATGCAAAA GTCTGTCCCA AATAAAGCCT TGGAATTGAA AAATGAACAA 1920 
ACATGGAGAG CAGATGAGAT ACTCCCATCA G AATCCAAAC AAAAGG ACTA TGAAGAAAAT 1980 
TCTTGGGATA CTGAG AGTCT CTGTG AGACT GTTTCACAGA AGG ATGTGTG TTTACCCAAG 2040 
GCTGCGCATC AAAA AGAAAT AGATAAAATA AATGG AAAAT TAGAAGGGTC TCCTGTTA A A 2 1 00 
GATGGTCTTC TGAAGGCTAA CTGCGGAATG AAAGTTTCTA TTCCAACTAA AGCCTTAGAA 2160 
TTGATGGACA TGCAAACTTT CAAAGCAGAG CCTCCCGAGA AGCCATCTGC CTTCGAGCCT 2220 
GCCATTGAAA TGCAAAAGTC TGTTCCAAAT AAAGCCTTGG AATTGAAGAA TGAACAAACA 2280 
TTG AGAGCAG ATGAG ATACT CCCATCAGAA TCCAAACAAA AGGACTATGA AGAAAGTTCT 2340 
TGGGATTCTG AGAGTCTCTG TGAGACTGTT TCACAGAAGG ATGTGTGTTT ACCCAAGGCT 2400 
ACACATCAAA AAGAAATAG A TAAAATAAAT GGAAAATTAG AAGAGTCTCC TOATAATGAT 2460 
GGTTTTCTGA AGGCTCCCTG CAGAATGAAA GTTTCTATTC CAACTAAAGC CTTAGAATTG 2520 
ATGGACATGC AA ACTTTCAA AGCAG AGCCT CCCGAGAAGC CATCTGCCTT CGAGCCTGCC 2580 
ATTGAAATGC AAAAGTCTGT TCCAAATAAA GCCTTGGAAT TG AAGAATG A ACAAACATTO 2640 
AGAGCAGATC AGATGTTCCC TTCAG AATCA AAACAAAAGA AGGTTGAAGA AAATTCTTGG 2700 
GATTCTGAGA GTCTCCGTG A GACTGTTTCA CAGAAGGATG TGTGTGTACC CAAGGCTACA 2760 
CATCAAAAAG AAATGGATAA AATAAGTGGA AAATTAGAAG ATTCAACTAG CCTATCAAAA 2820 
ATCTTGGATA CAGTTCATTC TTGTG AAAGA GCAAGGGAAC TTCAAAAAG A TCACTGTGAA 2880 
CAACGTACAG G AAAAATGG A ACAAATGAAA AAGAAGTTTT GTGTACTGAA AAAGAAACTG 2940 
TCAGAAGCAA AAGAA ATAAA ATCACAGTTA GAGAACCAAA AAGTTAAATG GGAACAAGAG 3000 
CTCTGCAGTG TGAGATTG AC TTTAA ACCAA GAAGAAGAGA AGAGAAG AAA TGCCGATATA 3060 
TTAAATGAAA AAATTAGGGA AGAATTAGGA AGAATCGAAG AGCAGCATAG GAAAGAGTTA 3120 
GAAGTGAAAC AACAACTTGA ACAGGCTCTC AGAATACAAG ATATAG AATT GAAGAGTGTA 3180 
G AAAGTAATT TG AATCAGGT TTCTCACACT CATGAAAATG AAAATTATCT CTTACATGAA 3240 
AATTGCATGT TGAAAAAGG A AATTGCCATG CTAAAACTGG AAATAGCCAC ACTGAAACAC 3300 
CAATACCAGG AAAAGGAAAA TAAATACTTT GAGGACATTA AGATTTTAAA AGAAAAG AAT 3360 
GCTGAACTTC AGATGACCCT AAAACTGAAA GAGGAATCAT TAACTAAAAG GGCATCTCAA 3420 
TATAGTGGGC AGCTTAAAGT TCTGATAGCT GAGAACACAA TGCTCACTTC TAAATTGAAG 3480 
GAAAAACAAG ACAAAGAAAT ACTAG AGGCA GAAATTGAAT CACACCATCC TAGACTGGCT 3540 
TCTGCTGTAC AAGACCATGA TCAAATTGTG ACATCAAGAA AAAGTCAAGA ACCTGCTTTC 3600 
CACATTGCAG GAGATGCTTG TTTGCAAAGA AAAATGAATG TTGATGTGAG TAGTACGATA 3660 
TATAACAATG AGGTGCTCCA TCAACCACTT TCTGAAGCTC AAAGGAAATC CAAAAGCCTA 3720 
AAAATTAATC TCAATTATGC AGGAG ATGCT CTAAG AGAAA ATACATTGGT TTCAGAACAT 3780 
GCACAAAGAG ACCAACGTG A AACACAGTGT CAAATGAAGG AAGCTGAACA CATGTATCAA 3840 
AACG AACAAG ATAATGTG AA CAAACACACT GAACAGCAGG AOTCTCTAGA TCAGAAATTA 3900 
TTTCAACTAC AAAGCAAAAA TATGTGGCTT CAACAGCA AT TAGTTC ATGC ACATAAGAAA 3960 
GCTGACAACA AAAGCAAGAT AACAATTGAT ATTCATTTTC TTG AGAGGAA AATGCAACAT 4020 
CATCTCCTAA AAGAGAA AAA TG AGGAGATA TTTAATTACA ATAACCATTT AAAAAACCGT 4080 
ATATATCAAT ATGAAAAAGA GAAAGCAGAA ACAGAAAACT CAJGAGAG AC AAGCAGTAAG 4140 
AAACTTCTTT TGGAGAAACA ACAGACCAGA TCTTTACTCA CAACTCATGC TAGGAGOCCA 4200 
GTCCTAGCAT CACCTTATGT TG AAAATCTT ACCAATAGTC TGTGTCAACA GAATACTTAT 4260 
TTTAGAAGAA AAATTCATGA TTTCTTCCTG AAGCCTACAG ACATAAAATA ACAGTGTGAA 4320 
GAATTACTTG TTCACGAATT GCATAAAGCT GCACAGG ATT CCCATCTACC CTGATGATGC 4380 
AGCAG AC ATC ATTCAATCCA ACCAG AATCT CGCTCTGCAC TCCAGCCTAG GTG ACAGAGT 4440 
GAGACTCCAC CTCGGAAA 



Seq ID NO: 4 ffotein sequence; 
Protein Accession* NP 443723.I 

1 11 21 31 41 51 
I I I I I I 

MTKRKKTINL N1QDAQKRTA LHWACVNGHE EWTFLVDRK CQLDVLDGBH RTPLMKALQC 60 
HQEACAN1U DSQADINLVD VYGNMALHYA VYSEILSWA KLLSHGAVIE VHNKASLTPL 120 
LLSITKRSEQ IVEFLLIKNA NANAVNKYKC TALMLAVCHG SSEIVGMLLQ QNVDVFAADl 180 
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CGVTAEHYAV TCGFHHIHEQ IMEYIRKLSK NHQNTNPEGT SAGTPDEAAP LAERTPDTAE 240 
SLVEKTPDEA APLVERTPDT AESLVEKTPD EAASLVEGTS DKIQCLEKAT SGKFEQSAEE 300 
TPREITSPAK ETSEKFTWPA KGRPRKIAWE KKEDTPREIM SPAKETSEKF TWAAKGRPRK 360 
IAWEKKETPV KTGCVARVTS NKTKVLEKGR SKMIACPTKE SSTKASANDQ RFPSESKQEE 420 
5 DEEYSCDSRS LFESSAKIQV CIPESIYQJCV MEINREVEEP PKKPSAFXPA IEMQNSVPNK 480 

AFELKNEQTL RADPMFPPES KQKDYEENSW DSESLCETVS QKDVCLPKAT HQKEIDKING 540 
KLEESPNKDG LLKATCGMKV SIPTKALELK DMQTFKAEPP GKPSAFEPAT EMQKSVPNKA 600 
LELKMEQTWR ADEILPSESK QKDYEENSWD TESLCETVSQ KDVCLPKAAH QKE1DKINGK 660 
LEGSPVKDGL LKAKCGMKVS IPTKALELMD MQTFKAEPPE KPSAFEP A1E MQKSVPNKAL 720 

10 EUCNEQTLRA DEILPSESKQ KDYEESSWDS ESLCETVSQK. DVCLPKATHQ KEIDKJNGKL 780 
EESPDNDGFL KAPCRMKVSI PTKALELMDM QTFKAEPPEK. PSAFEPAIEM QKSVPNKALE 840 
LKNEQTLRAD QMFPSESKQK KVEENSWDSE SLRETVSQKD VCVPKATHQFC EMDKISGKLE 900 
DSTSLSKJLD TVHSCERARE LQKDHCEQRT GKMEQMKKKF CVLKKKLSEA KEUCSQLENQ 960 
KVKWEQELCS VRLTLNQEEE KRRNADILNE KIREELGREE EQHRKELEVK QQLEQALRIQ 1020 

1 5 DIELKSVESN LNQVSHTHEN ENYLLHENCM LKKEIAMLKL EIATLKHQYQ EKENKYFEDI 1080 
K2LKEKNAEL QMTUOXEES LTKRASQYSG QLKVUAENT MLTSKLKEKQ DKEILEAEIE 1 140 
SHHPRLASAV QDHDQIVTSR KSQEPAFHIA GDACLQRKMN VDVSSTIYNN EVLHQPLSEA 1200 
QRKSKSLKIN LNYAGDALRE NTLVSEHAQR DQRETQCQMK EAEHMYQNEQ DNVNKHTEQQ 1260 
ESLDQKLFQL QSKNMWLQQQ LVHAHKKADN KSKJTIDIHF LERKMQHHLL KEKNEEIFNY 1320 

20 NNHLKNRIYQ YEKEKAETEN S 



Seq 10 NO: 5 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 273-1785 (underlined sequences correspond to start and stop codons) 



I 11 21 31 41 51 
I I I I I 



CGCGGGGCGC GGAGTCGGCG GGGCCTCGCG GGACGCGGGC AGTGCGGAGA CCX3CGGCGCT 60 
30 GAGGACGCGG GAGCCGGGAG CGCACGCGCX) GGGTGGAGTT CAGCCTACTC TTTCTTAGAT 120 
GTGAAAGGAA AGGAAGATCA TTTCATGCCT TGTTGATAAA GGTTCAGACT TCTGCTGATT 180 
CATAACCATT TGGCTCTGAG CTATGACAAG AGAGG AAACA AAAAGTTAAA CTTACAAGCC 240 
TGCCATAAGT GAGAAGCAAA CTTCCTTGAT AACATGCTTT TGCGAAGTGC AGG AAAATTA 300 
AATGTGGGCA CCAAGAAAG A GGATGGTGAG AGTACAGCCC CCACCCCCCG TCCAAAGGTC 360 
35 TTGCGTTGTA A ATGCCACC A CC ATTGTCCA G AAGACTCAG TCAACAATAT TTGC AGCAC A 420 
GACGGATATT GTTTCACGAT GATAGAAGAG GATGACTCTG GGTTGCCTGT GGTCACTTCT 480 
GGTTGCCTAG G ACTAGAAGG CTCAG ATTTT CAGTGTCGGG ACACTCXXAT TCCTCATCAA 540 
AGAAG ATCAA TTGAATGCTG CACAGAAAGG AACGAATGT A ATAAAGACCT A CACC CTACA 600 
CTGCCTCCAT TGAAAAACAG AGATTTTGTT GATGGACCTA TACACCACAG GGCTTTACTT 660 
40 ATATCTGTGA CTGTCTGTAG TTTGCtCTTG GTCCTTATCA TATTATTTTG TTACTTCCGG 720 

TATAAAAGAC AAGAAACCAG ACCTCGATAC AGCATTGGGT TAGAACAGGA TGAAACTTAC 780 
ATTCCTCCTG GAGAATCCCT GAGAGACTTA ATTG AGCAGT CTCAGAGCTC AGGAAGTGG A 840 
TCAGGCCTCC CTCTGCTGGT CCAAAGGACT ATAGCTAAGC AGATTCAGAT GGTG AAACAG 900 
ATTGGAAAAG GTCGCTATGG GGAAGTTTGG ATGGGAAAGT GGCGTGGCGA AAAGGTAGCT 960 
45 GTGAAAGTGT TCTTCACCAC AGAGGAAGCC AGCTGGTTCA GAGAGACAG A AATATATCAG 1020 
ACAGTGTTGA TGAGGCATGA AAACATTTTG GGTTTCATTG CTGCAGATAT CAAAGGGACA 1080 
GGGTCCTGGA CCCAGTTGTA CCTAATCACA GACTATCATG AAAATGGTTC CCTTTATGAT 1 140 
TATCTG AAGT CCACCACCCT AGACGCTAAA TCAATGCTGA AGTTAGCCTA CTCTTCTGTC 1200 
AGTGGCTTAT GTCATTTACA CACAGAAATC TTTAGTACTC AAGGCAAACC AGCAATTGCC 1260 
50 CATCGAGATC TGAAAAGTAA AAACATTCTG GTGAAGAAAA ATGGAACTTG CTGTATTGCT 1320 
GACCTGGGCC TGGCTGTTAA ATTTATTAGT GATACAAATG AAGTTGACAT ACCACCTAAC 1380 
ACTCGAGTTG GCACCAAACG CTATATGCCT CCAGAAGTGT TGGACGAGAG CTTG AACAGA 1440 
AATCACTTCC AGTCTTACAT CATGGCTGAC ATGTATAGTT TTGGCCTC AT CCTTTGGGAG 1 500 
GTTGGTAGGA G ATGTGTATC AGGAGGTATA GTGGAAGAAT ACCAGCTTCC TTATCATGAC 1560 
55 CTAGTGCCCA GTGACCCCTC TTATOAGGAC ATGAGGGAGA TTGTGTGCAT CAAGAAGTTA 1620 
CGCCCCTCAT TCCCAA ACCG GTGGAGCAGT GATOAGTGTC TAAGGCAGAT GGGAAAACTC 1680 
ATG ACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCTGA CAGCCCTGCG GGTTAAGAAA 1740 
ACACTTGCCA AAATGTCAGA GTCCCAGGAC ATTAAACTCT GATAGGAGAG GAAAAGTAAG 1800 
CATCTCTGCA GAAAGCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGACATCAAA 1860 
60 TAAGCATCCA CAGTACAAGC CTTGAACATC GTCCTGCTTC CCAGTGGGTT CAG ACCTCAC 1920 
CTTTCAGGGA GCGACCTGGG CAAAGACAGA GAAGCTCCCA GAAGGAGAGA TTGATCCGTG 1980 
TCTGTTTGTA GGCGG AG AAA CCGTTGGGTA ACTTGTTCAA GATATGATGC AT 

65 Sag ID NO: 6 Protein sequence: 

Protein Accession #: none found 

1 11 21 31 41 51 

^ • MLLRSAGKLN VGTKJCEDGES TAPTPRPiCVL RCKCHHHCPE DSVNNICSTD GYCFTMIEED 60 
DSGLPWTSG CLGLEGSDFQ CRDTPIPHQR RSIECCTERN ECNKDLHPTL PPLKNRDF VD 120 
GPIHHRALLI SVTVCSLLLV UILFCYFRY KRQETRPRYS IGLEQDETYI PPGESLRDLI 180 
EQSQSSGSGS GLPLLVQRTI AKQ1QMVKQI GKGRYGEVWM GKWRGEKVAV KVFFTTEEAS 240 

75 WFRETEIYQT VLMRHENILG FIAADDCGTG SWTQLYUTD YHENGSLYDY LKSTTLDAKS 300 
MLKLAYSS VS GLCHLHTEIF STQGKPA1AH RDLKSKNTLV KKNGTCCIAD LGLAVKFISD 360 
TNEVDIPPNT RVGTKRYMPP EVLDESLNRN HFQSYIMADM YSFGULWEV ARRCVSGGIV 420 
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EEYQLPYHDL VPSDPSYEDM REIVCIKKLR PSFPNRWSSD ECLRQMGKLM TECWAHNPAS 480 
RLTALRVKKT LAKMSESQDI KL 

Seq ID NO: 7 DNA sequence 

Nucleic Acid Accession U: none found 

Coding sequence: 482-3007(underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 
I 1 I I I i 

AACTQAGCTA ACAAGAAATA CTAGAAAAGG AGGAAGGAGA ACATTGCTGC AGCTTGGATC 60 
TACAACCTAA G AAAGCAAGA GTGATCAATC TCAGCTCTGT TA AACATCTT GTTTACTTAC 120 
TGCATTCAGC AGCTTGCAAA TGGTTAACTA TATGCAAAAA AGTCAGCATA GCTGTGAAGT 180 
ATGCCGTG AA TTTTAATTG A GGGAAAAAGG GACAATTGCT TCAGGATGCT CTAGTATGCA 240 
CTCTGCTTGA AATATTTTCA ATGAAATGCT CAGTATTCTA TCTTTGACCA GAGGTTTTAA 300 
CTTTATGAAG CTATGGGACT TG AC AAA AAG TGATATTTGA G AAG AAAGTA CGCAGTGGTT 360 
GGTGTTTTCT TTTTTTTAAT AAAGGAATTG AATTACTTTG AACACCTCTT CCAGCTGTGC 420 
ATTACAGATA ACGTCAGGAA OAGTCTCTGC TTTACAGAAT CGGATTTCAT CACATGACAA 480 
CATGAAGCTG TGGATTCATC TCTTTTATTC ATCTCTCCTT GCCTGTATAT CTTTACACTC 540 
CCAAACTCCA GTGCTCTCAT CCAGAGGCTC TTGTGATTCT CTTTGCAATT GTGAGGAAAA 600 
AGATGGCACA ATGCTAATAA ATTGTG AAGC AAAAGGTATC AAGATGGTAT CTGAAATAAG 660 
TGTGCCACCA TCACG ACCTT TCCAACTAAG CTTATTAAAT AACGGCTTGA CGATGCTTCA 720 
CACAAATGAC TTTTCTGGGC TTACCAATGC TATTTCAATA CACCTTGGAT TTAACAATAT 780 
TGCAGATATT GAGATAGGTG CATTTAATGG CCTTGGCCTC CTG AAACAAC TTCATATCAA 840 
TCACAATTCT TTAG AAATTC TTAAAG AGGA TACTTTCCAT GGACTGGAAA ACCTGGAATT 900 
CCTGCAAGCA G ATAACAATT TTATCACAGT G ATTG AACCA AGTGCCnTA GCAAGCTCAA 960 
CAGACTCAAA GTGTTAATTT TAAATG ACAA TGCTATTGAG AGTCTTCCTC CAAACATCTT 1020 
CCGATTTGTT CCTTTAACCC ATCTAGATCT TCGTGGAAAT CAATTACAAA CATTGCCTTA 1080 
TGTTGGTTTT CTCG AACACA TTGGCCGAAT ATTGGATCTT CAGTTGGAGG ACAACAAATG i 140 
GGCCTGCAAT TGTGACTTAT TGCAGTTAAA AACTTGGTTG GAGAACATGC CTCCACAGTC 1200 
TATAATTGGT GATGTTGTCT GCAACAGCCC TCCATTTTTT AAAGGAAGTA TACTCAGTAG 1260 
ACTAAAG AAG G AATCTATTT GCCCTACTCC ACCAGTGTAT GAAG AACATG AGG ATCCTTC 1320 
AGGATCATTA CATCTGGCAG CAACATCTTC AATAAATGAT AGTCGCATGT CAACTAAGAC 1380 
CACGTCCATT CTAAAACTAC CCACCAAAGC ACCAGGTTTG ATACCTTATA TTACAAAGCC 1440 
ATCCACTCA A CTTCCAGG AC CTTACTGCCC TATTCCTTGT AACTGCAAAG TCCTATCCCC 1 500 
ATCAGGACTT CTAATACATT OTCAGGAOCG CAACATTGAA AGCTTATCAO ATCTGAQACC 1560 
TCCTCCGCAA AATCCTAG AA AGCTCATTCT AGCGGGAAAT ATTATTCACA GTTTAATGAA 1620 
GTCTGATCTA GTGG AATATT TCACTTTGGA AATGCTTCAC TTGGGAAACA ATCGTATTO A 1 680 
AGTTCTTGAA GAAGGATCGT TTATGAACCT AACGAGATTA CAAAAACTCT ATCTAAATGG 1740 
TAACCACCTG ACCAAATTAA GTAAAGGCAT GTTCCTTGGT CTCCATAATC TTGAATACTT 1800 
ATATCTTG AA TACAATGCCA TTAAGGAAAT ACTGCCAGGA ACCTTTAATC CAATGCCTAA 1860 
ACTTAAAGTC CTGTATTTAA ATAACAACCT CCTCCAAGTT TTACCACCAC ATATTTTTTC 1920 
AGGGGTTCCT CTAACTAAGG TAAATCTTAA AACAAACCAG TTTACCCATC TACCTGTAAG 1980 
TAATATTTTG G ATGATCTTG ATTTACTAAC CCAG ATTOAC CTTGAGGATA ACCCCTGGGA 2040 
CTGCTCCTGT GACCTGGTTG GACTGCAGCA ATGGATACAA AAGTTAAGCA AGAACACAGT 2100 
GACAGATGAC ATCCTCTGCA CTTCCCCCGG GCATCTCGAC AAAAAGGAAT TGAAAGCCCT 2160 
AAATAGTGAA ATTCTCTGTC CAGGTTTAGT AAATAACCCA TCCATGCCAA CACAGACTAG 2220 
TTACCTTATG GTCACCACTC CTGCAACAAC AACAAATACG G CTG AT ACT A TTTTACGATC 2280 
TCTTACGG AC GCTGTGCCAC TGTCTGTTCT AATATTGOG A CTTCTGATTA TGTTCATCAC 2340 
TATTGTTTTC TGTGCTGCAG GGATAGTGGT TCTTGTTCTT CACCGCAGG A G AAGATACAA 2400 
AAAG AAACAA GTAG ATGAGC AAATG AG AG A CAACAGTCCT GTGCATCTTC AGTACAGCAT 2460 
GTATGGCCAT AAAACCACTC ATCACACTAC TG AAAGACCC TCTGCCTCAC TCTATG AACA 2S20 
GCACATGGTG AGCCCCATGG TTCATGTCTA TAGAAGTCCA TCCTTTGGTC CAAAGCATCT 2580 
GGAAGAGGAA GAAGAGAGGA ATGAGAAAGA AGGAAGTG AT GCAAAACATC TCCAAAGAAG 2640 
TCTTTTGGAA CAGGAAAATC ATTCACCACT CACAGGGTC A AATATG AAAT ACAAAACCAC 2700 
GAACCAATCA ACAG AATTTT TATCCTTCCA AG ATGCCAGC TCATTGTACA GAAACATTTT 2760 
AGAAAAAGAA AGGGAACTTC AGCAACTGGG AATCACAGAA TACCTAAGGA AAAACATTGC 2820 
TCAGCTCCAG CCTGATATGG AGGCACATTA TCCTGG AGCC CACG AAGAGC TG AAGTTAAT 2880 
GGAAACATTA ATGTACTCAC GTCCAAGGAA GGTATTAGTG GAACAGACAA AAAATG AGTA 2940 
TTTTGAACTT AAAGCTAATT TACATGCTG A ACCTGACTAT TTAGAAGTCC TGG AGCAGCA 3000 
AACATAGATG GAGAGTTTGA GGGCTTTCGC AGAAATGCTG TG ATTCTGTT TTAAGTCCAT 3060 
ACCTTGTAAA TAAGTGCCTT ACGTGAGTGT GTCATCAATC AGAACCTAAG CACAGCAGTA 3120 
AACTATGGGG AAAAAAAAAG AAGAAGAAAA GAAACTCAGG GATCACTGGG AGAAGCCATG 3180 
GCATTATCTT CAGGCAATTT AGTCTGTCCC AAATAAAATC AATCCTTGCA TGTAAATC 



Scq ID NO: 8 Protein sequence: 
Protein Accession #: none found 

1 It 21 31 41 51 
I I I I I I 

MKLWIHLFYS SLLACISLHS QTPVLSSRGS CDSLCNCEEK DGTMUNCEA KGIKMVSEIS 60 
VPPSRPFQLS LLNNGLTMLH TNDFSGLTNA ISIHLGFNNI ADIEIGAFNG LGLLKQLHIN 120 
HNSLEIUCED TFHGLENLEF LQADNNF1TV IEPSAFSKLN RLKVULNDN A1ESLPPNIF 180 
RFVPLTHLDL RGNQLQTLPY VGFLEHIGRI LDLQLEDNKW ACNCDLLQLK TWLENMPPQS 240 
HGDWCNSP PFFKGSILSR LKKES1CPTP PVYEEHEDPS GSLHLAATSS INDSRMSTKT 300 
TSIUCLPTKA PGUPYITKP STQLPGPYCP IPCNCKVLSP SGLUHCQER. NIESLSDLRP 360 
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PPQNPRKUL AGNUHSLMK SDLVEYFTLE MLHLGNNRJE VLEEGSFMNL TRLQKLYLNG 420 
NHLTKLSKGM FLGLHNLEYL YLEYNABCEl LPGTFNPMPK LKVLYLNNNL LQVLPPKtFS 480 
GVPLTKVNLK TNQFTHLPVS NILDDLDLLT QIDLEDNPWD CSCDLVGLQQ WIQKLSKNTV 540 
TDDILCTSPG HLDKKELKAL NSEILCPGLV NNPSMPTQTS YLMVTTPATT TNTADTILRS 600 
5 LTDAVPLSVL ILGLUMFIT IVFCAAGIW LVLHRRRRYK KKQVDEQMRD NSPVHLQYSM 660 
YGHKTTHHTT ERPSASLYEQ HMVSPMVHVY RSPSFGPKHL EEEEERNEKE GSDAKHLQRS 720 
LLEQENHSPL TGSNMKYKTT NQSTEFLSFQ DASSLYRN1L EKERELQQLG ITEYLRKNIA 780 
QLQPDMEAHY PGAHEELKLM ETLMYSRPRK VLVEQTKNEY FELKANLHAE PDYLEVLEQQ 840 



1 11 il 31 41 51 

CACTAACGCT CTTCCTAGTC CCCGGGCCAA CTCGGACAGT TTGCTCATTT ATTGCAACGG 60 
TCAAGGCTGG CTTGTGCCAG AACGGCGCGC GCGCGACGCA CGCACACACA CGGGGGGAAA 120 

20 CTTTTTTAAA AATGAAAGGC TAGAAGAGCT CAGCGOCGGC GCGGGCCGTG CGCG AGGGCT 1 80 
CCGGAGCTGA CTCGCCGAGG CAGG AAATCC CTCCGGTCGC GACGCCCGGC CCCGCTCGGC 240 
GCCCGCGTGG GATGGTGCAG CGCTCGCCGC CGGGCCCGAG AGCTGCTGCA CTGAAGGCCG 300 
GCGACGATGG CAGCGCGCCC GCTGCCCGTG TCCCCCGCCC GCGCCCTCCT GCTCGCCCTG 360 
GCCGGTGCTC TGCTCGCGCC CTGCGAGGCC CGAGGGGTGA GCTTATGGAA CG AAGGAAGA 420 

25 GCTGATGAAG TTGTCAGTGC CTCTGTTCGG AGTGGGGACC TCTGGATCCC AGTGAAGAGC 480 
TTCGACTCCA AGAATCATCC AGAAGTGCTG AATATTCGAC TACAACGGG A AAGCAAAGAA 540 
CTGATCATAA ATCTGG AAAG AAATGAAGGT CTCATTGCCA GCAGTTTCAC GGAAACCCAC 600 
TATCTGCAAG ACGGTACTGA TGTCTCCCTC GCTCGAAATT ACACGGTAAT TCTGGGTCAC 660 
TGTTACTACC ATGG ACATGT ACGGGGATAT TCTGATTCAG CAGTCAGTCT CAGCACGTGT 720 

30 TCTGGTCTCA GGGGACTTAT TGTGTTTGAA AATGAAAGCT ATGTCTTAGA ACCAATGAAA 780 
AGTGCAACCA ACAGATACAA ACTCTTCCCA GCGAAGAAGC TGAAAAGCGT CCGGGGATCA 840 
TGTGGATCAC ATCACAACAC ACCAAACCTC GCTGCAAAGA ATGTGTTTCC ACCACCCTCT 900 
CAGACATGGG CAAGAAGGCA TAAAAGAGAG ACCCTCAAGG CAACTAAGTA TGTGGAGCTG 960 
GTGATCGTGG CAGACAACCG AG AGTTTCAG AGGCAAGGAA AAGATCTGGA AAAAGTTAAG 1020 

3 5 CAGCG ATTAA TAGAGATTGC TAATCACGTT GACAAGTTTT ACAG ACCACT GAACATTCGG 1080 
ATCGTGTTGG TAGGCGTGGA AGTGTGGAAT GACATGGACA AATGCTCTGT AAGTCAGGAC 1 140 
CCATTCACCA GCCTCCATGA ATTTCTGGAC TGGAGGAAGA TGAAGCTTCT ACCTCGCAAA 1200 
TCCCATGACA ATGCGCAGCT TGTCAGTGGG GTTTATTTCC AAGGGACCAC CATCGGCATG 1260 
GCCCCAATCA TGAGCATGTG CACGGCAGAC CAGTCTGGGG GAATTGTCAT GGACCATTCA 1320 

40 GACAATCCCC TTG GTGCAGC CGTGACCCTG GCACATGAGC TGGGCCACAA TTTCGGGATG 1380 
AATCATGACA CACTGGACAG GGGCTGTAGC TGTCAAATGG CGGTTGAGAA AGGAGGCTGC 1440 
ATCATGAACG CTTCCACCGG GTACCCATTT COCATGGTGT TCAGCAGTTG CAGCAGGAAG 1500 
GACTTGGAGA CCAGCCTGGA GAAAGGAATG GGGGTGTGCC TGTTTAACCT GCCGGAAGTC 1560 
AGGGAOTCTT TCGGGGGCCA OAAGTGTGGG AACAGATTTG TGGAAGAAGG AGAGGAGTGT 1620 

45 GACTGTGGGG AGCCAGAGGA ATGTATGAAT CGCTGCTGCA ATGCCACCAC CTGTACCCTG 1680 
AAGCCGGACG CTGTGTGCGC ACATGGGCTG TGCTGTGAAG ACTGCCAGCT GAAGCCTGCA 1 740 
GGAACAGCGT GCAGGGACTC CAGCAACTCC TGTGACCTCC CAGAGTTCTG CACAGGGGCC 1 800 
AGCCCTCACT GCCCAGCCAA CGTGTACCTG CACGATGGGC ACTCATGTCA GGATGTGGAC 1860 
GGCTACTGCT ACAATGGCAT CTGCCAG ACT CACGAGCAGC AGTGTGTCAC ACTCTGGGGA 1920 

5 0 CCAGGTGCTA A ACCTGCCCC TGGGATCTGC TTTGAGAGAG TCAATTCTGC AGGTGATCCT 1980 
TATGGCAACT GTGGCAAAGT CTCGAAGAGT TCCTTTGCCA AATGCGAGAT GAGAGATGCT 2040 
AAATGTGGAA AAATCCAGTG TCAAGGAGGT GCCAGCCGGC CAGTCATTGG TACCAATGCC 2100 
GTTTCCATAG AAACAAACAT CCCCCTGCAG CAAGGAGGCC GG ATTCTGTG CCGGGGGACC 2160 
CACOTGTACT TGGGCGATGA CATGCCGGAC CCAGGGCTTG TGCTTGCAGG CACAAAGTGT 2220 

55 GCAGATGGAA AAATCTGOCT GAATCGTCAA TGTCAAAATA TTAGTGTCTT TGGGGTTCAC 2280 
GAGTGTGCAA TGCAGTGCCA CGGCAGAGGG GTGTGCAACA ACAGGAAG AA CTGCCACTGC 2340 
GAGGCCCACT GGGCACCTCC CTTCTGTGAC AAGTTTGGCT TTGGAGGAAG CACAGACAGC 2400 
GGCCCCATCC GGCAAGCAGA TAACCAAGGT TTAACCATAG G AATTCTGGT G ACCATCCTG 2460 
TGTCTTCTTG CTGCCGGATT TGTGGTTTAT CTCAAAAGGA AGACCTTGAT ACG ACTGCTG 2520 

60 TTTACAAATA AGAAGACCAC CATTGAAAAA CTAAGGTGTG TGCGCCCTTC CCGGCCACCC 2580 
OGTGGCTTCC AACCCTGTCA GGCTCACCTC GGCCACCTTG GAAAAGGCCT GATGAGGAAG 2640 
CCGCCAG ATT CCTACCCACC GAAGGACAAT CCCAGGAG AT TGCTGCAGTG TCAGAATGTT 2700 
GACATCAGCA G ACCCCTCAA CGGCCTGAAT GTCCCTCAGC CCCAGTCAAC TCAGCGAGTG 2760 
CTTCCTCCCC TCCACCGGGC CCCACGTGCA CCTAGCGTCC CTGCCAG ACC CCTGCCAGCC 2820 

65 AAGCCTGCAC TTAGGCAGGC CCAGGGGACC TGTAAGCCAA ACCCCCCTCA GAAGCCTCTG 2880 
CCTGCAGATC CTCTGGCCAG AACAACTCGG CTCACTCATG CCTTGGCCAG GACCCCAGGA 2940 
CAATGGGAG A CTGGGCTCCG CCTGGCACCC CTCAGACCTG CTCCACAATA TCCA CACCAA 3000 
GTGCCCAG AT CCACCCACAC CGCCTATATT AAGTGAOA AG CCGACA CCTT TTT TCAACA G 3060 
TGAAGACAGA AGTTTGCACT ATCTTTCAGC TCCAGTTGGA GTTTTTTGTA CCAACTTTTA 3120 

70 GGATTTTTTT TAATGTTTAA AACATCATTA CTATAAGAAC TTTGAGCTAC TGCCGTCAGT 3 180 
GCTGTGCTGT GCTATGGTGC TCTGTCTACT TGCACAGGTA CTTGT AAATT ATTAAT TTAT 3240 
GCAGAATGTT G ATTACAGTG CAGTGCGCTG TAGTAGGCAT TTTTACCATC ACTGAGTTTT 3300 
CCATGGCAGG AAGGCTTGTT GTGCTTTTAG TATTTTAGTG AACTTGAAAT ATCCTGCTTG 3360 
ATGGGATTCT GGACAGGATG TGTTTGCTTT CTGATCAAGG CCTTATTGGA AAGCAGTCCC 3420 

7 5 CCA ACTACCC CCAGCTGTGC TTATGGTACC AG ATGCAGCT CAAGAG ATCC CAAGTAG AAT 3480 
CTCAGTTGAT TTTCTGGATT CCCCATCTCA GGCCAGAGCC AAGGGGCTTC AGGTCCAGGC 3540 
TGTGTTTGGC TTTCAGGG AG GCCCTGTGCC CCTTGACAAC TGGCAGGCAG GCTCCCAGGG 3600 
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Seq ID NO: 9 DNA sequence 

Nucleic Acid Accession #: NMJ003474 

Coding sequence: 307-3036 (underlined sequences correspond to start and stop codons) 
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ACACCTGGGA GAAATCTGGC TTCTGGCCAG GAAGCTTTGG TGAGAACCTG GGTTGCAGAC 3660 
AGGAATCTTA AGGTGTAGCC ACACCAGGAT AG AGACTGGA ACACTAGACA AGCCAGAACT 3720 
TGACCCTG AG CTGACCAGCC GTGAGCATGT TTGGAAGGGG TCTGTAOTGT CACTCAAGGC 3780 
GGTGCTTGAT AGAAATGOCA AGCACTTCTT TTTCTCGCTG TCCTTTCTAG AGCACTGCCA 3840 
CCAGTAGGTT A7TTAGCTTG GGAAAGGTGG TOTTTCTGTA AOAAACCTAC TGCCCAGGCA 3900 
CTGCAAACCG CX^CCTCCCT ATACTGCTTG G AGCTGAGCA AATCACCACA AACTGTAATA 3960 
CAATGATCCT GTATTCAGAC AGATGAGGAC TTTCCATGGG ACCACAACTA TTTTCAGATG 4020 
TGAACCATTA ACCAGATCTA GTCAATCAAG TCTGTTTACT GCAAGGTTCA ACTTATTAAC 4080 
AATTAGGCAG ACTCTTTATG CTTGCAAAAA CTACAACCAA TGGAATGTGA TGTTCATGGG 4140 
TATAGTTCAT GTCTGCTATC ATTATTCGTA GATATTGGAC AAAGAACCTT CTCTATGGGG 4200 
CATCCTCTTT TTCCAACTTG GCTGCAGGAA TCTTTAAAAG ATGCTTTTAA CAGAGTCTGA 4260 
ACCTATTTCT TAAACACTTG CAACCTACCT GTTGAGCATC ACAGAATGTG ATAAGGAAAT 4320 
CAACTTGCTT ATCAACTTCC TAAATATTAT GAOATGTGGC TTGGGCAGCA TCCCCTTGAA 4380 
CTCTTCACTC TTCAAATGCC TGACTAGGG A GCCATGTTTC ACAAGGTCTT TAAAGTG ACT 4440 
AATGGC ATG A G AAATACAAA AATACTCAG A TAAGGTAAAA TGCCATG ATG OCTCTGTCTT 4500 
CTGGACTGGT TTTCACATTA GAAGACAATT GACAACAGTT ACATAATTCA CTCTGAGTGT 4560 
TTTATGAGAA AGCCTTCTTT TGGGGTCAAC AGTTTTCCTA TGCTTTGAAA CAGAAAAATA 4620 
TGTACCAAGA ATCTTGGTTT GCCTTCCAG A AAACAAAACT GCATTTCACT TTCCOGGTGT 4680 
TCCCCACTGT ATCTAGGCAA CATAGTATTC ATG ACT ATG G ATAAACTAAA CACGTGACAC 4740 
AAACACACAC AAAAGGG AAC CCAGCTCTAA TACATTCCAA CTCGTATAGC ATGCATCTGT 4800 
TTATTCTATA GTTATTAAGT TCTTTAAAAT GTAAAGCCAT GCTGGAAAAT AATACTGCTG 4860 
AGATACATAC AGAATTACTG TAACTGATTA CACTTGGTAA TTGTACTAAA GCCAAACATA 4920 
TATATACTAT TAAAAAGGTT TACAG AATTT TATGGTGCAT TACGTGGGCA TTGTCTTTTT 4980 
AGATGCCCAA ATCCTTAGAT CTGGCATGTT AGCCCTTCCT CCAATTATAA GAGGATATGA 5040 
ACCAAAAAAA AAAAAAAAAA AA 



Seq ID NO: 10 Protein sequence: 
Protein Accession #: HP 003465.2 

1 11 21 31 41 51 
1 I I I 1 I 

MAARPLPVSP ARALLLALAG ALLAPCEARG VSLWNEGRAD EWSASVRSG DLWEPVKSFD 60 
SKNHPEVLNI RLQRESKEU INLERNEGLI ASSFTETHYL QDOTDVSL AR NYTVILGHCY 120 
YHGHVRGYSD SAVSLSTCSG LRGLIVFENE SYVLEPMKSA TNRYKLFPAK KLKS VRGSCG 1 80 
SHHNTPNLAA KNVFPPPSQT WARRHKRETL KATKYVELVI VADNREFQRQ GKDLEKVKQR 240 
UEIANHVDK FYRPLNIR1V LVGVEVWNDM DKCS VSQDPF TSLHEFLDWR KMKLLPRKSH 300 
DNAQLVSGVY FQGTTIGMAP IMSMCTADQS GGIVMDHSDN PLGAAVTLAH ELGHNFGMNH 360 
DTLDRGCSCQ MAVEKGGCIM NASTGYPFPM VFSSCSRKDL ETSLEKGMGV CLFNLPEVRE 420 
SFGGQKCGNR FVEEGEECDC GEPEECMNRC CNATTCTLKP DAVCAHGLCC EDCQLKPAGT 480 
ACRDSSNSCD LPEFCTOASP HCPANVYLHD GHSCQDVDGY CYNGICQTHE QQCVTLWGPG 540 
AKPAPGICFE RVNSAGDPYG NCGKVSKSSF AKCEMRDAKC GKIQCQGGAS RPVIGTNAVS 600 
ffiTNIPLQQG GRILCRGTHV YLGDDMPDPG LVLAGTKCAD GKICLNRQCQ NISVFGVHEC 660 
AMQCHGRGVC NNRKNCHCEA HWAPPFCDKF GFGGSTDSGP IRQADNQGLT IGILVTILCL 720 
LAAGF WYLK RKTURLLFT NKKTTIEKLR CVRPSRPPRG FQPCQAHLGH LGKGLMRKPP 780 
DSYPPKDNPR RLLQCQNVDI SRPLNGLNVP QPQSTQRVLP PLHRAPRAPS VPARPLPAKP 840 
ALRQAQGTCK PNPPQKPLPA DPLARTTRLT HALARTPGQW ETGLRLAPLR PAPQYPHQVP 900 
RSTHTAYIK 

Scq ID NO; 1 1 DNA sequence 

Nucleic Acid Accession #: NM_O01394 

Coding sequence: 400-1 584(underluied sequences correspond to start and stop codons) 
1 U 21 31 41 51 

1(111! 

GGCACGAGOG CTGAGCGCCG GAOGAGCGTA GGCAGGGCAG CGCTGGCGCC AGCGGCGACA 60 
GGAGCCGCGC GACCGGCAAA AATACACGGG AGGCCGTCGC CGAAAAGAGT CCGCGGTCCT 120 
CTCTCGTAAA CACACTCTCC TCCACCGGCG CCTCCCCCTC CGCTCTGCGC GCCGCCCGGC 1 80 
TGGGCGCCCG AGGCCGCTCC GACTGCTATG TGACCGCG AO GCTGCGGGAG GAAGGGGACA 240 
GGGAAG AAG A GGCTCTCCCG CGGG AGCCCT TGAGGACCAA GTTTGCGGCC ACTTCTGCAG 300 
GCGTCCCTTC TTAGCTCTCG CCTGCCCCTT TCTGCAGCCT AGGCGGCCCA GGTTCTCT TC 360 
TCTTCCTCGC GCGCCCAGCC GCCTCGGTTC CCGGCGAC CA TG GTQACGAT GGAGGAGCTG 420 
CGGGAG ATGG ACTGCAGTGT GCTCAAAAGG CTGATGAACC GGGACGAGAA TGGCGGCGGC 480 
GCGGGCGGCA GCGGCAGCCA CGGCACCCTG GGGCTGCCGA GCGGCGGCAA GTGCCTGCTG 540 
CTGG ACTGCA GACCGTTCCT GGCGCACAGC GCGGGCTACA TCCTAGGTTC GGTCAACGTG 600 
CGCTGTAACA CCATCGTGCG GCGGCGGGCT AAGGGCTCCG TGAGCCTGG A GCAGATCCTG 660 
CCCGCCGAGG AGGAGGTACG CGCCCGCTTG CGCTCCGGCC TCTACTCGGC GGTCATCGTC 720 
TACGACG AGC GCAGCCCGCG CGCCGAGAGC CTCCGCGAGG ACAGCACCGT GTCGCTGGTG 780 
GTGCAGGCGC TGCGCCGCAA CGCCGAGCGC ACCGACATCT GCCTGCTCAA AGGCGGCTAT 840 
GAOAGGTTTT CCTCCGAGTA CCCAG AATTC TGTTCTAAAA CCAAGGCCCT GGCAGCCATC 900 
CCACCCCCGG TTCCCCCCAG CGCCACAGAG CCCTTGGACC TGGGCTGCAG CTCCTGTGGG 960 
ACCCCACTAC ACQ ACCAGGG GGGTCCTGTG GAGATCCTTC CCTTCCTCTA CCTCGGCAGT 1020 
GCCTACCATG CTGCCCGGAG AGACATGCTG GACGCCCTGG GCATCACGGC TCTGTTGAAT 1080 
GTCTCCTCGG ACTGCCCAAA CCACTTTGAA GGACACTATC AGTACAAGTG CATCCCAGTG 1 140 
GAAGATAACC ACAAGGCCGA CATCAGCTCC TGGTTCATGG AAGCCATAGA GTACATCGAT 1200 
GCCGTGAAGG ACTGCCGTGG GCGCGTGCTG GTGCACTGCC AGGCGGGCAT CTCGCGGTCG 1260 
GCCACCATCT GCCTGGCCTA CCTGATGATG AAGAAACGGG TO AG G CTGG A GGAGGCCTTC 1320 
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G AGTTCOTTA AGCAGCGCCG CAGCATCATC TCGCCCAACT TCAGCTTCAT GGGGCAGCTG 1380 
CTGCAGTTCG AGTCCCAGGT GCTGGCCACO TCCTGTGCTG CGGAGGCTGC TAGCCCCTCG 1440 
GGACCCCTGC GGGAGCGGGG CAAGACCCCC GCCACCCCCA CCTCGCAGTT CGTCTTCAGC 1500 
TTTCCGGTCT COGTGGGCGT GCACTCGGCC CCCAGCAGCC TGCCCTACCT GCACAGCCCC 1560 
5 ATCACCACCT CTCCCAGCTG TTAGAGCCGC CCTGGGGGCC CCAGAACCAG AGCTGGCTCC 1620 
CAGCAAGGGT AGGACGGGCC GCATGCGGCA GAAAGTTGGG ACTGAGCAGC TGGGAGCAGG 1680 
CGACCGAGCT CCTTCCCCAT CATTTCTCCT TGGCCAACGA CGAGGCCAGC CAGAATGGCA 1740 
ATAAGG ACTC CG AATACATA ATAAAAGCAA ACAGAACACT CCAACTTAGA GCAATAACCG 1800 
GTGCCGCAGC AGCCAGGGAA GACCTTGGTT TGGTTTATGT GTCAGTTTCA CTTTTCCGAT 1 860 

10 AGAAATTTCT TACCTCATTT TTTTAAGCAG TAAGGCTTGA AGTGATGAAA CCCACAGATC 1920 

CTAGCAAATG TGCCCAACCA GCTTTACTAA AGGGGGAGGA AGGGAGGGCA AAGGGATGAG 1980 
AAG ACAAGTT TCCCAG AAGT GCCTGGTTCT GGGTACTTGT CCCTTTGTTG TCGTTGTTGT 2040 
AGTTAAAGGA ATTTCATTTT TAAAAGAAAT CTTCGAAGGT GTGGTTTTCATTTCTCAGTC 2100 
ACCAACAG AT GAATAATTAT GCTTAATAAT AAAGTATTTA TTAAGACTTT CTTCAGAGTA 2160 

1 5 TGAAAGTACA AAAAGTCTAG TTACAGTGG A TTTAGAATAT ATTTATGTTG ATGTCAAACA 2220 
GCTGAGCACC GTAGCATGCA GATGTCAAGG CAGTTAGG AA GTAAATGGTG TCTTGTAGAT 2280 
ATGTGCAAGG TAGCATGATG AGCAACTTGA GTTTGTTGCC A CTGAGA AGC AGGCGCGTTG 2340 
GGTGGG AGGA GGAAGAAAGG GAAGAATTAG GTTTG AATTG CTTTTTAAAA AAAAAAGAAA 2400 
AGAAAAAG AC AGCATCTCAC TATGTTGCCA AGGCTCATCT TGAGAA GCAG GCGGGTTGGG 2460 

20 TGGG AGGAGG AAGAAAGGGA AG AATTAGGT TTGAATTGCT TTTTTAAAAA AAAA 

Seq ID NO: 12 Protein sequence: 
Protein Accession*: NPJ00138S 

25 1 11 21 31 41 51 

MVTMEELREM DCSVLWILMN RDENGGGAGG SGSHGTLGLP SGGKCLLLDC RPFLAHSAGY 60 
ILGSVNVRCN TIVRRRAKGS VSLEQ1LPAE EEVRARLRSG LYSAVTVYDE RSPRAESLRE 120 
DSTVSLWQA LRRNAERTDI CLLKGGYERF SSEYPEFCSK TKALAAIPPP VPPSATEPLD 1 80 
30 LGCSSCGTPL HDQGGPVEIL PFLYLGSAYH AARRDMLDAL GITALLNVSS DCPNHFEGHY 240 
QYKCIPVEDN HKADISSWFM EA1EYIDAVK DCRGRVLVHC QAGISRSATI CLAYLMMKKR 300 
VRLEEAFEFV K.QRRSUSPN FSFMGQLLQF ESQVLATSCA AEAASPSGPL RERGKTPATP 360 
TSQFVFSFPV SVGVHSAPSS LPYLHSPITT SPSC 

35 Seq ID NO: 13 DNA sequence 

Nucleic Acid Accession »: none found 

Coding sequence: 68-340(iindcr1ined sequences correspond to start and stop codons) 



40 1 11 21 31 41 51 

AGCGCCTTGC CTTCTCTTAG GCTTTGAAGC ATTTTTGTCT GTGCTCCCTG ATCTTCAGGT 60 
CACCACCATG AAGTTCTTAG CAGTCCTGGT ACTCTTGGGA GTTTCCATCT TTCTGGTCTC 120 
TGCCCAGAAT CCGACAACAG CTGCTCCAGC TGACACGTAT CCAGCTACTG GTCCTGCTGA 180 

45 TGATG AAGCC CCTGATGCTG AAACCACTGC TGCTGCAACC ACTGCGACCA CTGCTGCTCC 240 
TACCACTGCA ACCACOGCTG CTTCTACCAC TGCTCGTAAA GACATTCCAG TTTTACCCAA 300 
ATGGGTTGGG GATCTCCCGA ATGGTAGAGT GTGTCCCTGA GATGGAATCA GCTTGAGTCT 360 
TCTGCAATTG GGTCACAACT ATTCATGCTT CCTGTGATTT CATCCAACTA CTTACCTTGC 420 
CTACG ATATC CCCTTTATCT CTAATCAGTT TATTTTCTTT CAAATAAAAA ATAACTATG A 480 

50 GCGAGCTAAC AT 
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Seq ID NO: 14 Protein sequence; 
Protein Accession #: none found 



1 11 21 31 41 51 

MKFLAVLVLL GVSIFLVSAQ NPTTAAPADT YPATGP ADDE APDAETTAAA TTATTAAPTT 60 
60 ATTAASTTAR KDIPVLPKWV GDLPNGRVCP 



Seq ID NO: 15 DNA sequence 

Nucleic Acid Accession #: NM 01 6640.2 

Coding sequence: 39-1 358(underiined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

GCTTAAGTTG ACCTCTGGGT CCGG AATCGC GGGCAAAG AT GGCGGCGGCC AOGTGTTGGA 60 
70 GGCCTTTGCT ACGCGGTCCG AGGCTTTCAT TGCACACCGC GGCTAATGCC GCCGCCACGG 120 
CTACAGAAAC GACCTCCCAA GACGTCGCGG CGACCCCCGT CGCGCGGTAC CCGCCGATTG 180 
TGGCCTCCAT GACAGCCGAC AGCAAAGCTG CACGGCTGCG GCGGATCGAG CGCTGGCAGG 240 
CG ACGGTGCA CGCTGCGGAG TCGGTAGACG AGAAGCTGCG AATCCTCACC AAGATGCAGT 300 
TTATGAAGTA CATGGTTTAC CCGCAGACCT TCGCACTGAA TGCCGACCGC TGGTACCAGT 360 
75 ACTTCACCAA GACCGTGTTC CTGTCGGGTC TGCCGCCGCC CCCAGCGGAG CCCGAGCCCO 420 
AGCCCGAAOC CGAACCTGAA CCTGCGCTGG ACCTCGCGGC GCTGCGTGCG GTCGCCTGCG 480 
ACTGCCTGCT GCAGGAGCAC TTCTACCTGC GGCGCAGGCG GCGCGTGCAC CGTTACGAGG 540 
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AGAGCG AGGT CATATCTTTG CCCTTCCTGG ATCAGCTGGT GTCAACCCTC GTGGGCCTCC 600 
TCAGCCCACA CAACCCGGCC CTGGCCGCTG CCGCCCTCGA TTATAGATGC CCAGTTCATT 660 
TTTACTGGGT GCGTGGTGAA GAAATTATTC CTCGTGGTCA TCGAAGAGGT CGAATTGATG 720 
ACTTGCGATA CCAGATAGAT GATAAACCAA ACAACCAGAT TCGAATATCC AAGCAACTCG 780 
CAGAGTTTGT GCCATTGG AT TATTCTGTTC CTATAGAAAT CCCCACTATA AA ATGTAAAC 840 
CAGACAA ACT TCCATTATTC AAACGGCAGT ATGAAAACCA CATATTTGTT GGCTCAAAAA 900 
CTGCAGATCC TTGCTGTTAC GGTCACACCC AGTTTCATCT GTTACCTGAC AAATTAAGAA 960 
GGGAAAGGCT TTTGAG ACAA AACTGTGCTG ATCAGATAGA AGTTGTTTTT AGAGCTAATG 1020 
CTATTGCAAG CCTTTTTGCT TGGACTGGAG CACAAGCTAT GTATCAAGGA TTCTGGAGTG 1080 
AAGCAG ATGT TACTCGACCT TTTGTCTCCC AGGCTGTGAT CACAGATGGA AAATACTTTT 1 1 40 
CCTTTTTCTG CTACCAGCTA AATACTTTGG CACTGACTAC ACAAGCTGAT CAAAATAACC 1200 
CTCGTAAAAA TATATGTTGG GGTACACAAA GTAAGCCTCT TTATGAAACA ATTGAGGATA 1260 
ATGATGTGAA AGGTTTTAAT GATGATGTTC TACTICAGAT AGTTCACTTT CTACTGAATA 1320 
GACCAAAAG A AGAAAAATCA CAGCTGTTGG AAAACTGAAA AAGCATATTT GATTGAGAAC 1380 
TGTGGGAATA TTTAAATTTT ACTGAAGG AA CAATAATGAT GAGATTTGTA ACTGTCAACT 1440 
ATTAAATACA TTGATTTTTG AG ACAAATAT TTCTTATGTC AACCTGTTAT TAGATCTCTT 1 500 
ACTCTGCTCA AATTCATCAC TGAAAGATTT AATTTTAGTT ACCTTTTGTT GATTTAAAAA 1560 
TAATTGCATT TGTATATTGC TAACTGATAA GACAAATTGA GTTATTGAGC TATTAAATGC 1620 
ACATTTTAAT ATAAATGCAG AAATCCCAAA TAAAATGCTA ACATACTG AA TTCAGTAATT 1680 
AAAAGAACCC ACTGC 



SeqlDNO 16 Protein sequence: 
Protein Accession #: NP_057724. 1 

I 11 21 31 41 51 
I I I I I I 

MAAARCWRPL LRGPRLSLHT AANAAATATE TTSQDVAATP VARYPPIVAS MTAJDSKAARL 60 
RRIERWQATV KAAESVDEKL RILTKMQFMK YMVYPQTFAL NADRWYQYFT KTVFLSGLPP 120 
PPAEPEPEPE PEPEPALDLA ALRA VACDCL LQEHF YLRRR RRVHRYEESE VTSLPFLDQL 1 80 
VSTLVGLLSP HNPALAAAAL DYRCPVHFYW VRGEEIIPRG HRRGRIDDLR YQIDDKPNNQ 240 
IRISKQLAEF VPLDYSVPIE IPTIKCKPDK LPLFKRQYEN HIFVGSKTAD PCCYGHTQFH 300 
LLPDKLRRER LLRQNCADQI EWFRANAJA SLFA WTGAQA MYQGFWSEAD VTRPFVSQAV 360 
ITDGKYFSFF CYQLNTLALT TQADQNNPRK N1CWGTQSKP LYET1EDNDV K.GFNDDVLLQ 420 
IVHFLLNRPK EEKSQLLEN 

Seq ID NO. 17 DNA sequence 

Nucleic Add Accession #: NM _025059. 1 

Coding sequence: 3-2150 (underlined sequences correspond to start and stop codons) 



I U 21 31 41 51 
I I I I I I 

GCATG AGCCT GGACTGCACC AGCCATATCG CGCTGGGTGC CGCTTCGCCA GCGCCCGAGG 60 
AAACTTACGA TCATCTTTCG GAAGTCCCGG TCACGCGGGA GCAGTTAAAC CACTATCGGA 120 
ATGTGGCTCA AAATGCTCGA AGTGAACTTG CAGCAACTTT GGTCAAATTT GAATGTGCTC 180 
AGTCTGAGCT TCAAGACCTC CGATCCAAGA TGCTTTCTAA AGAAGTCTCC TGTCAAGAAC 240 
TGAAAGCTG A AATGGAGAGC TACAAGGAAA ACAATGCCAG AAAATCATCT CTCCTTACCT 300 
CTTTGAGAGA CAGAGTTCAG GAACTAGAAG AAGAATCAGC AGCACTTTCC ACTTCTAAAA 360 
TCAGAACAGA AATCACAGCT CACGCTGCAA TCAAGGAGAA CCAGGAATTA AAGAAGAAAG 420 
TTGTAGAGTT AAATGAAAAA TTACAAAAGT GTTCAAAAGA AAATGAGGAG AATAAGAAAC 480 
AAGTTTCAAA GAATTGCAGG AAACATG AGO AATTTCTGAC TCAACTGCOT GACTGCTTGG 540 
ATCCAGATGA GAGGAATGAC AAGGCATCAG ATGAAGATTT AATTTTAAAG CTTAGAGACC 600 
TGCGCAAAGA AAATGAATTC GTGAAAGGAC AAATTGTTAT TCTTGAAGAG ACTATAAATG 660 
TCCATGAGAT GOAAGCAAAA GCTAGCAGAG AAACGATCAT OAGGCTGGCT TCAGAAGTCA 720 
ACAGAOAGCA GAAAAAAGCT GCCTCCTGTA CTGAAG AG AA AGAGAAGCTG AACCAGG ACC 780 
TGCTCAGTGC TOTAG AAGCA AAAGAAGCTC TTGAAAGGGA AGTTAAGATC TTCCAAGAAA 840 
GGCTGCTTGC TGGCCAGCAG GTCTGGGATG CCTCA AAGCA GGAAGTGAGC CTCCTGAAGA 900 
AAAGCTCTTCTGAGTTGGAG AAGAGTTTGA AGGCCAGTCA GGATGCAGTC ACAACCTCAC 960 
AAAGCCAGTA CTCCTCATTT AGGGAGAAAA TCGCAGCCCT CCTTAGGGGC AGATTGAGCA 1020 
TGACTGGGTC CACTGAGGAC ACCATTTTGG AGAAGATTCG AGAAATGGAC AGOCGGGAAG 1080 
AAAGCAGGGA COGGATGGTC TCCCAGCTITj AAGCCCAAAT ATCTGAGCTT GTTGAACAGT 1 140 
TGGGAAAGGA GTCTGGGTTT CACCAG AAAG CTCTCCAGAG GGCCCAGAAA GCAGAGAATA 1200 
TGTTGGAGAC TCTTCAGGGT CAGCTGACAC ACCTGGAGGC AGAGCTGGTT TCTGGAGGTG 1260 
TTTTGCGAGA CAACTTGAAT TTTGAGAAAC AAAAATATCT TAAATTTCTG GATCAGCTTT 1320 
CTCAGAAAAT GAAGTTGGAC CAGATGGCTG CCGAACTTGG CTTTGACACG CGGCTGGACG 1380 
TGGTTTTAGC TCGAACAGAG CAGCTGGTTC GTCTTGAGAG CAATGCAGTC ATTGAGAACA 1440 
AGACCATTGC CCACAATTTG CAGAG AAAGC TAAAGACACA GAAAGAGAG A CTGGAGAGCA 1500 
AAGAATTACA CATGAGCCTC CTCCGGCAGA AAATAGCCCA GCTGGAGGAG GAGAAGCAGG 1560 
CACGCACGGC CTTGGTGGTT G AG AGGG ACA ACGCGCATCT TACCATCAGG AACTTGCAGA 1 620 
AGAAGGTGGA GAGGCTGCAG AAAGAGCTGA ACACGTGTCG AGACTTGCAC ACCOAGCTCA 1680 
AAGCCAAACT GGCCGACACC AATGAACTGA AGATTAAAAC TTTGGAACAG ACTAAAGCCA 1740 
TTGAAGATCT AAACAAATCC AGAGACCAAC TGGAGAAGAT GAAGGAGAAA GCTGAGAAAA 1800 
AGCTCATGTC TGTCAAGTCA GAACTGGATA CCACAGAACA TGAGGCTAAG GAGAATAAAG 1860 
AAAGGGCCAG AAACATG ATA G AAGTGGTAA CCAGTGAAAT GAAGACACTA AAAAAATCTC 1920 
TGGAAGAAGC AGAAAAGAGA GAAAAGCAGC TGGCAGACTT CAGGGAGGTG GTGTCGCAGA 1980 
TGCTAGGCTT G AACGTGACC AGCCTTGCTC TTCCTGATTA TG AAATCATC AAGTGTCTTG 2040 

354 



• 



WO 02/059377 PCT/US02/02242 



AAAGATTGGT CCATTCACAT CAGCATCACT TTGTTACCTG TGCCTGCCTC AAAGATGTGA 2100 
CTACTGGGCA AG AG AGGCAC CCACAAGGCC ATTTACAGCT TCTTCATISA ACACTGTATC 2160 
TCTTGAGAGA GQTGGCCATA AGACATGGCA CACAATTCCC AATTTCACAA ATTCCTCATG 2220 
TCTTTGAGAT TTGATCAGTT TGTGAATATT TTATGCTTTG ATG ATATAGT GAGAATGCAT 2280 
5 CACTTGCAAA AACGATCTCA AAAGTGTCAG CCTTAG ATAA ACGTCAGCAT TAAAAAACGC 2340 
C AAA AAAAAA AAAAAAAAGC ATTTTAGG AT CCAGAAGAAT TCCACCAG AT TGCATGAGTT 2400 
AGATTGGGAA ATGGG AGTGG GAGATAATAT TGGGAGGTAT CTATTTTAAG TCAGGGGCTT 2460 
TACTAGCCGA TTTAGTTCTC ACAATAACCA TGTGGAG AAG CTGTGACATT TTTAATTTAC 2520 
AACCTTTCTG GGGCTCAGAC ATAAAGTTAC CTATCCAAGG TTGCAGTTGG GTAGTGGTGG 2580 

1 0 GACCAGGATG GACAACTCAT TGGCCCTGCC TCAAAAGCCA TACCTCTTCT CCTGCTATGC 2640 
AGAATCTGTT TCTCCTGAAT CTCTGTGATG CTGGTGGG AA TTGTTTGCAT AGAGGAAGGA 2700 
CAATAACCCT GCCATCGTGA GTTAATGTCC GGGCTGGTCA CAGTGGTTCA TGCCTGTAAT 2760 
CCCAGCACTT TGGGAGTCCA AGGCAGGCAT ATCATTTG AG GTCAGGAGTT TAAGACCAGC 2820 
CTGGCTAACA TAGTGAG ACC CTGTTTCTAC TAAAAATACA AAAATAAGCC AGGTGTGGTG 2880 

1 5 GTGCATGACT GTAATCCCAA CTACTCAGCA GGAGAAGCAC TTGAACCCAG GAGACGGAGG 2940 
CGGCAGTGAG CCAAG ATTGT GCCACTGCAC TCCAGCCTGG GCGACAGAGT GAAACTACAT 3000 
CTCAG AAAAA AAAAAAAAAA AAAAAAAAAA AAGTTAATGT CCAAAAATG A CAGATTTACA 3060 
AGTGTAAGCT ATATG ATTTC TTCAAAAAGC AAAAGCAATA TACCTAATTC ATTTGG ATCA 3 120 
AACTTACATA GGTCTCAGGT CCTGTAAGAA ACTTGCCTGT TCTAACTGTT GCTACCAG AT 3 1 80 

20 TATATCTGGT GGTAATTGTT AATGTTTCAG CAQGGCTGQT CTCAGTCCTT TAAAATGGAA 3240 
AGCTTTATTT GG AGCCCAAC CCTATAAG AT GAAG AAATCC TATATAGTCT TATTCACCAA 3300 
TATATCCAAT ACACCCACAO CAATGGTACC TTTTTAAGAT CAGGATTTTA TTATGAATTC 3360 
CTGTCACTTT CTGTTTTCCA TTTAAATTTC TATTTTACAA ATTTTTCAGG GAATCATATT 3420 
CTTA ACTTCA CTGAGAAAAA TGTGTTACTC TTTTGG ACAA TTTATCTTAT TTCTATCATA 3480 

25 T AAG ATG TAT TTTTTTATTG TCCTTAAAAQ AAGCTCTAGC ATGAAATTAA AGGAAAGGGA 3540 
AAGAAATAGA TCTGGTGCAC CCGAACATTA GGAGAAAATG AAAAATATAC AACCAACCGT 3600 
TCGTG AGTCA TCAAAAAGTC AAAGTCAGCC TGGCCAACAT GGCAAAACTC CGTCTCTGCA 3660 
AAAAATACAA AAATGAGCCC GGTATGTTGG CATATGCCTG TAATCCCAGC TACTCGAGAG 3720 
GCTGAGACAC GAAAATTGCT TGAACCTGGG AGGCGGAGGT TGCAATGAGC CGAGATCGCG 3780 

30 CTACTGCACT CCAGCCTGGG CAACAGAGAG AGACCTTGTC TC AAAAAACA ACAACAACAA 3840 
AAAGTCAAAG TCATAATAAG CAAATTATTG GCTTCTTTCT TCTAGACYAA AAGAAATTAA 3900 
AGAGATGAAA CAATCAATTG CAAGGGTCAA AACTAGATTG G ATCTTGGTT TGAATGAAAA 3960 
AAAGCATAAA ATATTCTTGC AATAATTGTA AAAATTTGAA TGTGGACTAA GTCCTAGATT 4020 
ATATTAAAAT ATTTTTAATT TTTTAAGCTT GACAAATGCA CTG ATTGTTA TACTTTAAAT 4080 

3 5 AACTAAAAAT CTGAGAATCC ACAGTGCTAC AGACAATAAA TGATAAAATG GGAAAAAAAA 4140 
AAAAAAAAAA A 

Sea ID NO: 18 Protem sequence: 
40 Protein Accession #: NP 079335.I 

1 11 21 31 41 51 

MSLDCTSHIA LGAASPAPEE TYDHLSE VP V TREQLNHYRN VAQNARSELA ATLVKFECAQ 60 
45 SELQDLRSKM LSKEVSCQEL KAEMESYKEN NARKSSLLTS LRDRVQELEE ESAALSTSKI 120 
RTEITAHAAI KENQELKKKV VELNEKLQKC SKENEENKKQ VSKNCRKHEE FLTQLRDCLD 180 
PDERNDKASD EDULKLRDL RKENEFVKGQ IVILEETINV HEMEAKASRE TIMRLASEVN 240 
REQKKAASCT EEKEKLNQDL LSAVEAKEAL EREVKIFQER LLAGQQVWDA SKQEVSLLKK 300 
SSSELEKSLK ASQDAVTTSQ SQYSSFREKI AALLRGRLSM TGSTEDTTLE KIREMDSREE 360 
50 SRDRMVSQLE AQISELVEQL GKESGFHQKA LQRAQKAENM LETLQGQLTH LEAELVSGGV 420 
LRDNLNFEKQ KYLKFLDQLS QKMKLDQMAA ELGFDTRLDV VLARTEQLVR LESNA V1ENK 480 
TIAHNLQRKL KTQKERLESK ELHMSLLRQK IAQLEEEKQA RTAL WERDN AHLTIRNLQK 540 
KVERLQKELN TCRDLHTELK AKLADTNELK IKTLEQTKAI EDLNKSRDQL EKMKEKAEKK 600 
LMSVKSELDT TEHEAKENKE RARNMIEVVT SEMKTLKKSL EEAEKREKQL ADFREWSQM 660 
5 5 LGLNVTSLAL PDYE1IKCLE RLVHSHQHHF VTCACLKDVT TGQERHPQGH LQLLH 

Scq ID NO: 19 DNA sequence 

Nucleic Acid Accession #: AF071552,NM^000662 
60 Coding sequence: 441-13 13 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

65 CTTTGTATAA GGCTCAGCTA AAAGGOAAAT TGAGTGGGTC AGGTACCACG GATACTATAC 60 
ACTCTATTGC ATGATTCTCC TGCCTACATC AGAAGACGTT TATAAGCCTA TTTTAAAGGA 120 
TACCAGTTGG AATCTCTCTT TTATTAATCA CCAAGAGAAC CATGAACAAG CTGTTTATCA 180 
TTTG ACTCAT CATTTAATCT TGATTTCCAG CTTCTCACAC TTGAAAGAAG ACATAATACA 240 
TTTCTCACAG GATTCTGGGA CTATTAACTO AACTTATOTQ T QTAAA AOGA ATTCATACAA 300 

70 TGAAAGCACT AGAAATAATT ATTATACTTA TAACCATTGT ATTTTTACAT CTTTAAAATA 360 
TAGCCATAAT TAGCCTACTC AAATCCAAGT GTAAAAGTAA AATGATTTGC TTTCGTTTTG 420 
TTTTCCTTGC TTAGGGGATC ATGGACATTG AAGCATATCT TGAAAGAATT GGCTATAAGA 480 
AGTCTAGGAA CAAATTGGAC TTGGAAACAT TAACTGACAT TCTTCAACAC CAGATCCGAG 540 
CTGTTCCCTT TG AG AACCTT AACATCCATT GTGGGGATGC CATGGACTTA GGCTTAGAGG 600 

75 CCATTTTTG A TCAAGTTGTG AGAAGAAATC GGGGTGGATG GTGTCTCCAG GTCAATCATC 660 
TTCTGTACTG GGCTCTGAOC ACTATTGGTT TTGAGACCAC GATGTTGGGA GGGTATGTTT 720 
ACAGCACTCC AG CC AAAAAA TACAGCACTG GCATGATTCA CCTTCTCCTG CAGGTGACCA 780 
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TTGATGGCAG GAACTACATT GTCGATGCTG GGTTTGGACG CTCATACCAG ATGTGGCAGC 840 
CTCTGG AGTT AATTTCTGGG AAGO ATCAGC CTCAGGTGCC TTGTGTCTTC CGTTTGACGG 900 
AAGAGAATGG ATTCTGGTAT CTAGACCAAA TCAGAAGGGA ACAGTACATT CCAAATGAAG 960 
AATTTCTTCA TTCTGATCTC CTAGAAGACA GCAAATACCG AAAAATCTAC TCCTTTACTC 1020 
5 TTAAGCCTCG AACAATTGAA GATTTTGAGT CTATGAATAC ATACCTGCAG ACATCTCCAT 1080 
CATCTGTGTT TACTAGTAAA TCATTTTGTT CCTTGCAGAC CCCAGATGGG GTTCACTGTT 1 140 
TGGTGGGCTT CACCCTCACC CATAGG AGAT TCAATTATAA GGACAATACA GATCTAATAG 1200 
AGTTCAAGAC TCTGAGTGAG GAAGAAATAG AAAAAGTGCT GAAAAATATA TTTAATATTT 1260 
CCTTGCAGAG AAAGCTTGTG CCCAAACATG GTGATAGATT TTTTACTATT TAOAATAAGQ 1320 

10 AGTAAAACAA TCTTGTCTAT TTGTCATCCA GCTCACCAGT TATCAACTGA CX3ACCTATCA 1380 
TGTATCTTCT GTACCCTTAC CTTATTTTGA AGAAAATCCT AGACATCAAA TCATTTCACC 1440 
TATAAAAATG TCATCATATA TAATT AAACA GCTTTTTAAA G AAACATAAC CACAAACCTT 1500 
TTCAAATAAT AATAATAATA ATAATAATAA ATGTATTTTA AAGATGGCCT GTGGTTATCT 1560 

j ^ TGGAAATTGG TGATTTATGC TAGAAAGCTT TTAATGTTGG TTTATTGTTG AATTC 

Scq ID NO: 20 P rotem sequence: 
Protein Accession #: NPJW0653. 1 



20 1 11 21 31 41 51 
I I I I I I 

MDIEAYLERI GYKKSRNKLD LETLTDILQH QIRAVPFENL NIHCGDAMDL GLE AIFDQW 60 
RRNRGGWCLQ VNHLLYWALT TIGFETTMLG GYVYSTPAKK YSTGMIHLLL QVTIDGRNY1 120 
VDAGFGRSYQ MWQPLEUSG KDQPQVPCVF RLTEENGFWY LDQIRREQY1 PNEEFLHSDL 1 80 
25 LEDSKYRKJY SFTLKPRT1E DFESMNTYLQ TSPSSVFTSK SFCSLQTPDG VHCLVGFTLT 240 
HRRFNYKDNT DLIEFKTLSE EEIEKVLKNI FNISLQRKLV PKHGDRFFTI 

Seq ID NO: 21 DNA sequence 
Nucleic Acid Accession H: NM 003714 

30 Coding sequence: 1 23- 1 031 (underlined sequences correspond to start and stop codons) 



1 II 21 31 41 51 

I I I I 1 I 

3 5 CGGCACG AGC AAAAAGGAAG AGTGGGAGGA GGAGGGGAAG CGGCGAAGGA GGAAGAGGAG 60 
GAGGAGGAAG AGGGGAGCAC AAAGGATCCA GGTCTCCCGA CGGGAGGTTA ATACCAAGAA 120 
CCATGTGTGC CGAGCGGCTG GGCCAGTTCA TGACCCTGGC TTTGGTGTTG GCCACCTTTG 180 
ACCCGGCGCG GGGGACCGAC GCCACCAACC CACCCGAGGG TCCCCAAGAC AGGAGCTCCC 240 
AGCAGAAAGG CCGCCTGTCC CTGCAG AATA CAGCGGAGAT CCAGCACTGT TTGGTCAACG 300 

40 CTGGCGATGT GGGGTGTGGC GTGTTTGAAT GTTTCGAGAA CAACTCTTGT GAGATTCGGG 360 
GCTTACATGG GATTTGCATG ACTTTTCTGC ACAACGCTGG AAAATTTGAT GCCCAGGGCA 420 
AGTCATTCAT CAAAGACGCC TTGAAATGTA AGGCCCACGC TCTGCGGCAC AGGTTCGGCT 480 
GCATAAGCCG GAAGTGCCCG GCCATCAGGG AAATGGTGTC CCAGTTGCAG CGGGAATGCT 540 
ACCTCAAGCA CO ACCTGTGC GCGGCTGCCC AGGAGAACAC CCGGGTG ATA GTGGAGATGA 600 

45 TCCATTTCAA GGACTTGCTG CTGCACG AAC CCTACGTGGA CCTCGTGAAC TTGCTGCTGA 660 
CCTGTGGGGA GGAGGTGAAG GAGGCCATCA CCCACAGCGT GCAGGTTCAG TGTGAGCAGA 720 
ACTGGGGAAG CCTGTGCTCC ATCTTGAGCT TCTGCACCTC GGCCATCCAG AAGOCTCOCA 780 
CGGCGCCCCC CG AGCGCCAG CCCCAGGTGG ACAGAACCAA GCTCTCCAGG GCCCACCACG 840 
GGGAAGCAGG ACATCACCTC CCAG AGCCCA GCAGTAGGGA GACTGGCCGA GGTGCCAAGG 900 

5 0 GTGAGCGAGG TAGCAAGAGC CACCCAAACG CCCATGCCCG AGGCAGAGTC GGGGGCCTTG 960 
GGGCTCAGGG ACCTTCCGGA AGCAGCGAGT GGGAAGACGA ACAGTCTGAG TATTCTGATA 1020 
TCCGGAGG TG AA ATGAAAGG CCTGGCCACG AAATCTTTCC TCCACGCCGT CCATTTTCTT 1080 
ATCTATGGAC ATTCCAAAAC ATTTACCATT AGAGAGGGGG GATGTCACAC GCAGGATTCT 1 140 
GTGGGOACTG TGGACTTCAT COAGOTGTGT GTTCGCOGAA CGGACAGGTG AGATGGAGAC 1200 

55 CCCTGGGGCC GTGGGGTCTC AGGGGTGCCT GGTGAATTCT GCACTTACAC GTACTCAAGO 1260 
OAGCQCGCCC GCGTTATCCT CGTACCTTTG T CTTCI 1 IC C ATCTGTGGAG TCAOTGGGTG 1320 
TCGGCCGCTC TGTTGTGGGG GAGGTG AACC AGGGAOGGGC AGGGCAAGGC AGGGCCCCCA 1 380 
GAGCTGGGCC ACACAGTGGG TGCTGGGCCT CGCCCCGAAG CTTCTGGTGC AGCAGCCTCT 1440 

rp . GGTGCTGTCT CCGCGGAAGT CAGGGCGGCT GGATTCCAGG ACAGGAGTGA ATGTAAAAAT 1500 

oO AAATATCGCT TAGAATGCAG GAGAAGGGTG GAG AG GAG GC AGGGGCCGAG GGGGi GCTTG i560 
GTGCCAAACT GAAATTCAGT TTCTTGTGTG GGGCCTTGCG GTTCAGAGCT CTTGGCGAGG 1620 
GTGG AGGGAG GAGTGTCATT TCTATGTGTA ATTTCTGAGC CATTGTACTG TCTGGGCTGG 1 680 
GGGGGACACT GTCCAAGGOA GTGGCCCCTA TGAGTTTATA TTTTAACCAC TGCTTCAAAT 1 740 
CTCGATTTCA CTTTTTTTAT TTATCCAGTT ATATCTACAT ATCTGTC ATC TA AATA AATG 1800 

65 GCTTTCAAAC AAAAAAAAAA AAAAAAAAAA AAAAAAA 

Rrg m NCb 12 Protein sequence: 
Protein Accession fc NP_003705 

70 

1 11 21 31 41 51 
1)1111 

MCAERLGQFM TLALVLATFD PARGTDATNP PEGPQDRSSQ QKGRLSLQNT AEIQHCLVNA 60 
GDVGCGVFEC FENNSCEIRG LHGICMTFLH NAGKFDAQGK SFDCDA1KCX AHALRHRFGC 120 
75 ISRKCPAIRB MVSQLQRECY LKHDLCAAAQ ENTRVTVEMI HFKDLLLHEP YVDLVNLLLT 180 
CGEEVKEATT HSVQVQCEQN WGSLCSILSF CTSAIQKPPT APPERQPQVD RTKLSRAHHG 240 
EAGHHLPEPS SRETGRGAKG EROSKSHPNA HARGRVGGLG AQGPSGSSEW EDEQSEYSDI 300 
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Seq ID NO: 23 DNA sequence 
5 Nucleic Acid Accession*: NM 005264.1 

Coding sequence: 557- 1954(underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

10 | | I I I I 

GAATTCCGGC CAGAAGAAAT CTGGCCTCGG AACACGCCAT TCTCOGCGCC GCTTCCAATA 60 
ACCACTAACA TCCCTAACG A GCATCCG AGC CO AGGGCTC T GCTCGGAA AT CGTCCT GGCC 120 
CAACTCGGCC CTTCG AGCTC TCGAAGATTA CCGCATCTAT TTTTTTTTTC TTTTTTTTCT 1 80 
TTTCCTAGCG CAGATAAAGT GAGCCCGGAA AGGGAAGGAG GGGGCGGGGA CACCATTGCC 240 

1 5 CTGAAAGAAT AAATAAGTAA ATAAACAAAC TGGCTCCTCG CCGCAGCTGG ACGCGGTCGG 300 
TTG AGTCCAG GTTGGGTCGG ACCTGAACCC CTAAAAGCGG AACCGCCTCC CGCCCTCGCC 360 
ATCCCGGAGC TGAGTCGCCG GCGGCGGTGG CTGCTGCCAG ACCCGGAGTT TCCTCTTTCA 420 
CTGGATGGAG CTGAACTTTG GGCGGCCAGA GCAGCACAGC TGTCCGGGGA TCGCTGCACG 480 
CTGAGCTCCC TCGGCAAGAC CCAGCGGCGG CTCGGG ATTT TTTTGGGGGG GCGGGGACCA 540 

20 GCCCCGCGCC GGCACCATGT TCCTGGCGAC CCTGTACTTC GCGCTGCCGC TCTTGGACTT 600 
GCTCCTGTCG GCCGAAGTGA GCGGCGGAGA CCGCCTGGAT TGCGTGAAAG CCAGTGATCA 660 
GTGCCTGAAG GAGCAGAGCT GCAGCACCAA GTACCGCACG CTAAGGCAGT GCGTGGCGGG 720 
CAAGGAG ACC AACTTCAGCC TGGCATCCGG CCTGGAGGCC AAGGATGAGT GCCGCAGCGC 780 
CATGGAGGCC CTG AAGCAGA AGTCGCTCTA CAACTGCCGC TGCAAGCGGG GTATGAAGAA 840 

25 GGAGAAGAAC TGCCTGCGCA TTTACTGGAG CATGTACCAG AGCCTGCAGG GAAATGATCT 900 
GCTGG AGGAT TCCCCATATG AACCAGTTAA CAGCAGATTG TCAGATATAT TCCGGGTGGT 960 
CCCATTCATA TCAGATGTTT TTCAGCAAGT GGAGCACATT CCCAAAGGGA ACAACTGCCT 1020 
GGATGCAGCG AAGGCCTGCA ACCTCGACGA CATTTGCAAG AAGTACAGGT CGGCGTACAT 1080 
CACCCCGTGC ACC ACC AG CG TGTCCAACGA TGTCTGCAAC CGCCGCAAGT GCCACAAGGC 1 140 

3 0 CCTCCGGCAG TTCTTTG ACA AGGTCCCGGC CAAGCACAGC TACGGAATGC TCTTCTGCTC 1 200 
CTGCCGGGAC ATCGCCTGCA CAGAGCGGAG GCGACAGACC ATCGTGCCTG TGTGCTCCTA 1260 
TGAAGAGAGG GAG AAGCCCA ACTGTTTGAA TTTGCAGGAC TCCTGCAAGA CGAATTACAT 1320 
CTGCAGATCT CGCCTTGCGG ATTTTTTTAC CAACTGCCAG CCAGAGTCAA GGTCTGTCAG 1380 
CAGCTGTCTA AAGGAA AACT ACGCTG ACTG CCTCCTCGCC TACTCGGGGC TTATTGGCAC 1440 

3 5 AGTCATGACC CCCAACTACA TAGACTCCAG TAGCCTCAGT GTGGCCCCAT GGTGTGACTG 1500 
CAGCAACAGT GGGAACGACC TAGAAGAGTG CTTGAAATTT TTGAATTTCT TCAAGGACAA 1560 
TACATGTCTT AAAAATGCAA TTCAAGCCTT TGGCAATGGC TCCGATGTGA CCGTGTGGCA 1620 
GCCAGCCTTC CCAGTACAGA CCACCACTGC CACTACCACC ACTGCCCTCC GG GTTAAGAA 1680 
CAAGCCCCTG GGGCCAGCAG GGTCTGAGAA TGAAATTCCC ACTCATGTTT TGCCACCGTG 1740 

40 TGCAAATTTA CAGGCACAGA AGCTGAAATC CAATGTGTCG GGCAATACAC ACCTCTGTAT 1800 
TTCCAATGGT AATTATGAAA AAGAAGGTCT CGGTGCTTCC AGCCACATAA CCACAAAATC 1860 
AATGGCTGCT CCTCCAAGCT GTGGTCTGAG CCCACTGCTG GTCCTGGTGG TAACCGCTCT 1920 
GTCCACCCTA TTATCTTTAA CAGAAACATC ATAGCTGCAT TAAAAAAATA CAATATGGAC 1980 
ATGTAAAAAG ACAA AA ACCA AGTTATCTGT TTCCTGTTCT CTTGTATAGC TGAAATTCCA 2040 

45 GTTTAGGAGC TCAGTTGAGA AACAGTTCCA TTCAACTGGA ACATTTTTTT TTTTCCTTTT 2100 
AAGAAAGCTT CTTGTGATCC TTCGGGGCTT CTGTGAAAAA CCTGATGCAG TGCTCCATCC 2160 
AAACTCAGAA GGCTTTGGGA TATGCTGTAT TTTAAAGGGA CAGTTTGTAA CTTGGGCTGT 2220 
AAAGCAAACT GGGGCTGTGT TTTCGATGAT GATGATCATC ATGATCATGA TGATTTTAAC 2280 
AGTTTTACTT CTGGCCTTTC CTAGCTAGAG AAGGAGTTAA TATTTCTAAG GTAACTCCCA 2340 

50 TATCTCCTTT AATGACATTG ATTTCTAATG ATATAAATTT CAGCCTACAT TGATGCCAAG 2400 
CTTTTTTGCC ACAAAGAAGA TTCTTACCAA GAGTGGGCTT TGTGGAAACA GCTGGTACTG 2460 
ATGTTCACCT TTATATATGT ACTAGCATTT TCCACGCTG A TGTTTATGTA CTGTAAACAG 2520 
TTCTGCACTC TTGTACAAAA G AAAAAACCA CCCGGAATTC 

55 Sgq ID NO: 24 Protein sequence:. 

Protein Accession #: NP_005255 

1 11 21 31 41 51 

60 MFLATLYFAL PLLDLLLSAE VSGGDRLDCV KASDQCLKEQ SCSTKYRTLR QCVAGKETNF 60 

SLASGLEAKD ECRSAMEALK QKSLYNCRCK RGMKKEKNCL RIYWSMYQSL QGNDLLEDSP 120 
YEPVNSRLSD IFR VVPFISD VFQQVEHIPK GNNCLDAAKA CNLDDICKKY RSAYTTPCTT 1 80 
SVSNDVCNRR KCHKALRQFF DKVPAKHSYG MLFCSCRDIA CTERRRQTIV PVCSYEEREK 240 
PNCLNLQDSC KTNYICRSRL ADFFTNCQPE SRSVSSCLKE NYADCLLAYS GLIGTVMTPN 300 

65 YIDSSSLSVA PWCDCSNSGN DLEECLKFLN FFKDNTCLKN AIQAFGNGSD VTVWQPAFPV 360 
QTTTATTTTA LRVKNKPLGP AGSENEIPTH VLPPCANLQA QKLKSNVSGN THLCISNGNY 420 
EKEGLGASSH ITTKSMAAPP SCGLSPLLVL WTALSTLLS LTETS 

Scq ID NO: 25 DNA sequence 
70 Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1576(entire sequence represents open reading frame) 

1 11 21 31 41 51 

75 CTTTTGTTTC GCCATGCCTA GTCTAGTGGT ATCTGGAATA ATGGAAAGAA ATGGGGGCTT 60 

TGGAGAACTA GGATGTTTCG GGGGAAGCGC TAAGGACCGA GGGCTGCTGG AAGACGAGCG 120 
CGCCCTTCAG CTGGCTCTCG ATCAACTCTG CCTCCTGGGT TTGGGGG AGC CCCCCGCCCC 180 
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CAGGGCGGGC GAGGACGGGG GAGGTGGGGG GGGCGGCGCC CCCGCGCAGC CGACAGCCCC 240 
CCCGCAGCCG GCGCCGCCGC CGCCGCCCGC GGCGCCCCCG GCCGCCCCGA CGACGGCCCC 3O0 
CCCAGCGCAG ACGCCCCAGC CCCCCACCGC CCCCAAAGGG GCGAGCGACG CCAAGCTCTG 360 
CGCTCTCTAC AAAGAGGCCG AGCTGCGCCT GAAGGGCAGC AGCAACACCA CGGAGTGTGT 420 
5 TCCOGTGCCC ACCTCCGAGC ACGTGGCCG A G ATCGTGGGC AGGCAAGGCT GCAAGATTAA 480 
GGCCTTGAGG GCCAAGACCA ACACCTACAT CAAGACACCG GTGAGGGGCG AGGAACCAGT 540 
GTTCATGGTG ACAGGGCGAC GGGAGGACGT GGCCACAGCC CGGCOGGAAA TCAT CTCAOC 600 
AGCGGAGCAC TTCTCCATGA TCCGTGCCTC COGCAACAAG TCAGGCGCCG CCTTTGGTGT 660 
GGCTCCTGCT CTGOCCGGCC AGGTGACCAT CCOTGTGCGG OTGCCCTACC GCGTGGTGGG 720 

10 GCTGGTGGTG GGCCCCAAAG GGGCAACCAT CAAGCGCATC CAGCAGCAAA CCAACACATA 780 
CATTATCACA CCAAGCCGTG ACCGCGACCC CGTGTTCGAG ATCACGGGTG CCCCAGGCAA 840 
CGTGGAGCGT GCGCGCG AGO AGATCG AGAC GCACATCOCG GTGCGCACTG GCAAGATCCT 900 
CG AGTACAAC AATGAAAACG ACTTCCTGGC GGGGAGCCCC GACGCAGCAA TCGATAGCCG 960 
CTACTCCGAC GCCTGGCGGG TGCACCAGCC CGGCTGCAAG CCCCTCTCCA CCTTCCGGCA 1020 

1 5 GAACAGCCTG GGCTGCATCG GCGAGTGCGG AGTGGACTCT GGCTTTGAGG CCCCACGCCT 1080 
GGGTGAGCAG GGCGGGG ACT TTGGCTACGG CGGGTACCTC TTTCCGGGCT ATGGCGTGGG I 140 
CAAGCAGOAT GTGTACTACG GCGTGGCCGA GACTAGCCCC CCGCTGTGGG CGGGCCAGGA 1200 
GAACGCCACG CCCACCTCCG TGCTCTTCTC CTCTGCCTCC TCCTCCTCCT CCTCTTCCGC 1260 
CAAGGCCCGC GCTGGOCCCC CGGGCGCACA CCGCTCCCCT GCCACTTCCG CGGGACCCGA 1320 

20 GCTGGCCGGA CTCCCGAGGC GCCCCCCGGG AGAGCCGCTC CAGGGCTTCT CTAAACTTGG 1380 
TGGGGGCGGC CTGCGGAGCC CCGGCGGCGG GCGGGATTGC ATGGTCTGCT TTGAGAGCGA 1440 
AGTGACTGCC GCCCTTGTGC CCTGCGGACA CAACCTGTTC TGCATGGAGT GTGCAGTACG 1500 
CATCTGCGAG AGGACGGACC CAGAGTGTCC CGTCTGOCAC ATCACAGCCA CGCAAOCCAT 1560 
CCG AAT ATTC TCCTAA 

25 

, Seq ID NO: 26 Protein sequence: 

Protein Accession #: FGENESH predicted 



30 1 11 21 31 41 51 

FCFAMPSLVV SGIMERNGGF GELGCFGGSA KDRGLLEDER ALQLALDQLC LLGLGEPPAP 60 
RAGEDGGGGG GGAPAQPTAP PQPAPPPPPA APPAAPTTAP AAQTPQPPTA PKGASDAKLC 120 
ALYKEAELRL KGSSNTTECV PVPTSEHVAE IVGRQGCKIK ALRAKTNTYI KTPVRGEEPV 1 80 

3 5 FM VTGRREDV ATARREIISA AEHFSMIRAS RNKSG AAFG V APALPGQVTI RVRVPYRWG 240 
LWGPKGATI KRIQQQTNTY 1ITPSRDRDP VFEITGAPGN VERAREEIET HIAVRTGKIL 300 
EYNNENDFLA GSPDAAIDSR YSDAWRVHQP GCKPLSTFRQ NSLGCIGECG VDSGFEAPRL 360 
GEQGGDFG YG GYLFPGYGVG KQDVYYG VAE TSPPLWAGQE NATPTSVLFS SASSSSSSSA 420 
KARAGPPGAH RSPATSAGPE LAGLPRRPPG EPLQGFSKLG GGGLRSPGGG RDCMVCFESE 480 

40 VTAALVPCGH NLFCMECAVR ICERTDPECP VCHITATQAI R1FS 

Seq ID NO: 27 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1 -2070 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

ATGAGCGGTG CGGGGGTGGC GGCTGGGACG CGGCCCCCCA GCTCGCCGAC CCCGGGCTCT 60 
CGGCGCCGGC GCCAGCGCCC CTCTGTGGGC GTCCAGTCCT TGAGGCCGCA GAGCCCGCAG 120 

50 CTCAGGCAGA GCGACCCGCA GAAACGGAAC CTGGACCTGG AGAAAAGCCT GCAGTTCCTG 180 
CAGCAGCAGC ACTCGGAGAT GCTGGCCAAG CTCCATGAGG AGATCG AGCA TCTGAAGCGG 240 
GAAAACAAGG GTGAGCCGGC GCGGGGCCCT AGGCCGGCCC TGCCTCCCCA GGCACACTCA 300 
ACACTGCCGC TCCCGCAGCA CAGAAACACA GCCATCAACT CCAGCACACG CCTGGGCTCA 360 
GGGGO AACAC AGG ACGGGG A GCCCCTCCAG ACTGTCCTTG CCCACCTGGC TGCACTGGCC 420 

5 5 CCTGTATGCC AACCCAGTGG GTACAGGTTC TGGGGGACCT GGACAGATGC CGCTACCTCT 480 
AGCCGTGGCT GGACGATGTT ATGCAGCCAA GCACAGCACG TGCTGCTCTC GGGAAGCCCA 540 
GGGCCTGAGG TCATTGCAGG GCGGCAGGTG GCCACAGGGT GCTCCCCAGA CCTCC CTCCT 600 
CCAAGTAG AG CTG AAATGGG AAGGAACCCC TGGG ACAGCC CCTGCCCTGC TAGATCTTTG 660 
CCTCAG ATTG CTGCTGTGGC CAGGCCCAGG ATTTCCAGCC CTATGGCTCT GAGTCCTCAC 720 

60 ATGCTGGGGG CCCAGGGGAT ATGGACACAC TCCATCCAGG GATCCCTTCC TGCCATCTGG 780 
GCAGCAACCA TGGGGACAA A GGGAGGAAGC AGAGTCCTGT TTCCTTGCCA CTTGTCCAAG .840 
GCACTTCCCC ATCCTGACAG CGGCCCCCAC CCAOCCCAGG ATCCTGGGCT GTGGTCTCAA 900 
GCTCACTTCC CATTATCTTT GGGGCTGGGG CTG ACATCAG GAGGACATCT GACTGGTGGA 960 
TGGAGCCAGC CTGGGAACAT CGCAGCTGGO GCAGTGCCTA GGGCTCTCCC TTCCCAGGGA 1020 

65 GACATGGAGA AGGGGGTTGA GGG AGGGCCC TTCCCTAGCC GCTGTGGCAA CTCCAGTGAG 1080 
CTGTTCTGGG CAAAGTGTGG CCCAAGTCGG CAGCCCCAGC CCTGCAGTGC TGGGGACGCT 1140 
G ACAGGACAC GGGAAGAGGC CATGCTTTCC CTCGGOACCT GCTOTTCCAT GTGTCCCAAG 1200 
CCCTCCTGCT TTCCAGATGG CCCCTCAGGA AACCACCTTT CCAGGGCCTC TGCTCCCTTG 1260 
GGCGCTCGCT GGGTCTGCAT CAACGGAGTG TGGGTAGAGC CGGGAGGACC CAGCCCTGCC 1320 

70 AGGCTG AAGG AGGGCTCCTC ACGG ACACAC AGGCCAGGAG GCAAGCGTGG GCGTCTTGOG 1 380 
GGCGGTAGCO CCOACACTGT GCGCTCTCCT GCAGACAGCC TCTCCATGTC AAGCTTCCAG 1440 
TCTGTCAAGT CCATCTCTAA TTCAGCCAAC TCTCAAGGCA AGGCCAGGCC CCAGCCCGGC 1500 
TCCTTCAACA AGCAAGATTC AAAAGCTGAC GTCTCCCAGA AGGCGGACCT GGAAGAGGAG 1560 
CCCCTACTTC ACAACAGCAA GCTGOACAAA GTTCCTGGGG TACAAGGGCA GGCCAGAAAG 1620 

75 GAGAAAGCAG AGGCCTCTAA TGCAGGAGCT GCCTGTATGG GGAACAGCCA GCACCAGGGC 1680 
AGGCAGATGG GGGCGGGGGC ACACCCCCCA ATGATCCTGC CCCTTCCCCT GCGAAAGCCC 1740 
ACCACACTTA GGCAGTGCGA AGTGCTCATC CGCGAGCTGT GGAATACCAA CCTCCTGCAG 1 800 
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25 



30 



ACCCAAGAGC TGCGGCACCT CAAGTCCCTC CTGGAAGGG A GCCAGAGGCC CCAGGCAGCC 1860 
CCGGAGGAAG CTAGCTTTCC CAGGGACCAA GAAGCCACGC ATTTCCCCAA GGTCTCCACC 1920 
AAGAGCCTCT CCAAGAAATG CCTGAGCCCA CCTGTGGCGG AGCGTGCCAT CCTGCCCGCA 1980 
CTGAAGCAGA CCCCGAAGAA CAACTTTGCC GAGAGGCAGA AGAGGCTGCA GGCAATGCAG 2040 
AAACGGCGCC TGCATCGCTC AGTGCTTTGA 

Seq ID NO: 28 Protein sequence: 

Protein Accession #: FGENESH predicted 



I U 21 31 41 51 

MSGAGVAAGT RPPSSPTPGS RRRRQRPSVG VQSLRPQSPQ LRQSDPQKRN LDLEKSLQFL 60 
QQOHSEMLAK LHEEIEHLKR ENKGEPARGP RPALPPQAHS TLPLPQHRNT AINSSTRLGS 120 
15 GGTQDGEPLQ TVLAH1AALA PVCQPSGYRF WGTWTDAATS SRGWTMLCSQ AQHVLLSGSP 180 
GPEVIAGRQV ATGCSPDLPP PSRAEMGRNP WDSPCPARSL PQIAAVARPR ISSPMALSPH 240 
MLGAQGIWTH SIQGSLPAIW AATMGTKGGS RVLFPCHLSK. ALPHPDSGPH PAQDPGLWSQ 300 
AHFPLSLGLG LTSGGHLTGG WSQPGNIAAG AVPRALPSQG DMEKG VEGGP FPSRCGNSSB 360 
LFWAKCGPSR QPQPCSAGDA DRTREEAMLS LGTCCSMCPK PSCFPDGPSG NHLSRASAPL 420 
20 G ARWVC1NGV WVEPGGPSPA RLKEGSSRTH RPGGKRGRLA GGSADTVRSP ADSLSMSSFQ 480 
SVKS1SNSAN SQGKARPQPG SFNKQDSKAD VSQKADLEEE PLLHNSKLDK VPGVOGQARK 540 
EKAEASNAGA ACMGNSQHQG RQMGAGAHPP MILPLPLRKP TTLRQCEVLI RELWNTNLLQ 600 
TQELRHLKSL LEGSQRPQAA PEEASFPRDQ EATHFPKVST KSLSKKCLSP PVAERAILPA 660 
LKQTPKNNFA ERQKRLQAMQ KRRLHRSVL 



Seq ID NO: 29 DNA sequence 

Nucleic Acid Accession #: NMJH23 19.2 

Coding sequence: 1 38-2405 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

CTCGTGCCGA ATTCGGCACG AG ACCGCGTG TTCGCGCCTG GTAGAG ATTT CTCGAAG ACA 60 
CCAGTGGGCC CGTGTGGAAC CAAACCTGCG CGCGTGGCCG GGCCGTGGGA CAACGAOGCC 120 

35 GOGGAGACGA AGGCGCAATG GCGAGG AAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC 180 
TCTCTGTCAC A AATCCCCTT CATGAACTAA AAGCAGCTGC TTTCCCCGAG ACCACTGAG A 240 
AAATTAGTCC G AATTGGG AA TCTGGCATTA ATGTTG ACTT GGCAATTTCC ACACGGCAAT 300 
ATCATCTACA ACAGCTTTTC TACCGCTATG GAGAAAATAA TTCTTTGTCA GTTGAAGGGT 360 
TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 420 

40 ACCACG ACCA TCACTCAGAC CACGAGCATC ACTCAG ACCA TGAGCGTCAC TCAGACCATG 480 
AGCATCACTC AG ACCACGAG CATCACTCTG ACCATGATCA TCACTCTCAC CATAATCATG 540 
CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCCCAGA CCATGACTCA G ATAGTTCAG 600 
GTAAAGATCC TAGAAACAGC CAGGGGAAAG G AGCTCACCG ACCAG AACAT GCCAGTGGTA 660 
GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 720 

45 TCTCTG AAGG AACTCACTTT CTAGAGACAA TAGAG ACTCC AAG ACCTGGA AAACTCTTCC 780 
CCAAAGATGT AAGCAGCTCC ACTCCACCCA GTGTCACATC AAAGAGCCGG GTG AGCCGGC 840 
TGGCTGGTAG GAAAACAAAT GAATCTGTGA GTGAGCCCCG AAAAGGCTTT ATGTATTCCA 900 
GAAACACAAA TG AAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACTG ACATCTCATG 960 
GCATGGGCAT CCAGGTTCCG CTG AATGCAA CAG AGTTCAA CTATCTCTGT CCAGCCATCA 1020 

50 TCAACCAAAT TGATGCTAGA TCTTGTCTGA TTCATACAAG TGAAAAGAAG GCTGAAATCC 1080 
CTCCAAAGAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC AT TTCCAT CA 1140 
TCAGTTTCCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 1200 
AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT GATGciim 1260 
TACACCTTCT TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT GAAGAACCAG 1 320 

55 CAATGG AAAT GAAAAG AGGA CCACTTTTCA GTCATCTGTC TTCTCAAAAC ATAGAAGAAA 1380 
GTG CCT ATTT TGATTCCACG TGG AAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 1440 
TTCTTGTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 1500 
AGAAGAAACC TGAAAATGAT GATGATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 1560 
CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 1620 

60 GAGCAGACTC ACAAGAGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGGAAGAAG 1680 
AAGAGGTCAT GATAGCTCAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 1740 
GGTGCAAGAA TAAATGCCAT TCACATTTCC ACGATACACT CGGCCAGTCA GACGATCTCA 1 800 
TTCACCACCA TCATGACTAC CATCATATTC TCCATCATCA CCACCACCAA AACCACCATC 1860 
CTCACAGTCA CAGCCAGCGC TACTCTCGGG AGGAGCTGAA AGATGCCGGC GTCGCCACTT 1920 

65 TGGCCTGG AT GGTGATAATG GGTG ATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 1980 
GTGCTGCTTT TACTG AAGGC TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 2040 
ATGAGTTGCC TCATG AATTA GGTG ACTTTG CTGTTCTACT AAAGGCTGGC ATGACCGTTA 2100 
AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAG 2160 
GAATTTTCAT TGGTCATTAT GCTGAAAATG TTTCTATGTG GATATTTGCA CTTACTGCTG 2220 

70 GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AAT QATGCT A 2280 
GTGACCATGG ATGTAGCCGC TGGGGGTATT TCTTTTTACA GAATGCTGGG ATGCTTTTGG 2340 
GTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 2400 
TCTAGTTAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT- 2460 
AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGGT TTTGTG ATTT 2520 

75 TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTTA ATATTTAAGT 2580 
TATTCTATCT TGG AG ATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATG 2640 
TAAACAAGAG ATTTGGCATG ACATGTTCTG TATGTTTCAG GGAAAAATGT CTTTAATGCT 2700 
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TTTTCAACAA CTAACACAGT TATTCCTATA CTGGATTTTA GGTCTCTOAA GAACTGCTGG 2760 
TGTTTAGGAA TAAOAATGTO CATGAAGCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 2820 
AGCAAAGAAA'TAAAGGAGAA AAGAGAAGAA TCTGAG AATT GGGGAGGCAT AGATTCTTAT 2880 
■ AAAAATCACA AAATTTGTTG TAAATTAGAG GGGAGAAATT TAG AATT AAG TATAAAAAGG 2940 
5 CAGAATTAGT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GGATTATTTC CCGTAAAAAC 3000 
GTAGTGAGCA CTCTCATATA CTAATTAGTG TACATTTA AC TTTGTATAAT ACAG AAATCT 3060 
AAATATATTT AATG AATTCA AGCAATATAC ACTTGACCAA GAAATTGGAA TTTCAAAATG 3 120 
TTCGTGCGGG TTATATACCA GATGAGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 3180 
TATTGCCAAG TTATATATCA CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 3240 
1 0 CAAAATTATC AG AGTAGTAA AACTTTGATA TATATGAGG A TATTAAAACT ACACTAAGTA 3300 
TCATTTGATT CG ATTCAGAA AGTACTTTGA TATCTCTCAG TGCTTCAGTG CTATCATTGT 3360 
GAGCAATTGT CTTTATATAC GGTACTGTAG CGATACTAGG CCTGTCTGTG GCATTCTCTA 3420 
GATGTTTCTT TTTTACACAA TAAATTCCTT ATATCAGCTT G 

15 Scq ID NO: 30 Protein sequence: 

Protein Accession #: NP_03645 1 .2 

1 II 21 31 41 51 
„ 1 I I I I I 

20 MARKLSV1L1 LTFALSVTNP LHELKAAAFP QTTEK1SPNW ESGIN VDLAI STRQYHLQQL 60 

FYRYGENNSL SVEGFRKIXQ NIGIDKIKRI HIHHDHDHHS DHEHHSDHHR HSDHEHHSDH 120 
EHHSDHDHHS HHNHAASGKN KRKALCPDHD SDSSGKDPRN SQGKG AHRPE HASGRRNVKD 1 80 
SVSASEVTST VYNTVSEGTH FLETIETPRP GKLFPKDVSS STPPSVTSKS RVSRLAGRKT 240 
NESVSEPRKG FMYSRNTNEN PQECFNASKL LTSHGMGIQV PLNATEFNYL CPAUNQIDA 300 

25 RSCUHTSEK KAEIPPKTVS LQIAWVGGFI AISIISFLSL LGVILVPLMN RVFFKFLLSF 360 

LVALAVGTLS GDAFLHLLPH SHASHHHSHS HEEPAMEMKR GPLFSHLSSQ NIEESAYFDS 420 
TWKGLTALGG LYFMFLVEHV LTLIKQFKDK KKKNQKKPEN DDDVEDCKQL SKYESQLSTN 480 
EEKVDTDDRT EGYLRADSQE PSHFDSQQPA VLEEEEVMIA HAHPQEVYNE YVPRGCKNKC 540 
HSHFHDTLGQ SDDLIHHHHD YHHILHHHHH QNHMPHSHSQ RYSREEUCDA GVATLAWMVI 600 

30 MGDGLHNFSD GLAIGAAFTE GLSSGLSTSV AVFCHELPHE LGDFAVLLXA GMTVKQAVLY 660 
NALSAMLAYL GMATGIFIGH YAENVSMWIF ALTAGLFMYV ALVDMVPEML HKDASDHGCS 720 
RWGYFFLQNA GMLLGFGIML USIFEHKIV FRINF 

Seq ID NO: 31 DNA sequence 
35 Nucleic Add Accession NM_002 184.1 

Coding sequence: 256-3012(undertined sequences correspond to start and stop codons) 

I It 21 31 41 51 

40 G AGCAGCCAA AAGGCCCGCG GAGTCGCGCT GGGCCGCCCC GGCGCAGCTG AACCGGGGGC 60 
CGCGCCTGCC AGGCCGACGG GTCTGGCCCA GCCTGGCGCC AAGGGGTTCG TGCGCTGTGG 120 
AGACGCGG AG GGTCG AGGCG GCGCGGCCTG AGTG AAACCC AATGGAAAAA GCATGACATT 1 80 
TAGAAGTAGA AG ACTTAGCT TCAAATCCCT ACTCCTTCAC TTACTAATTT TGTGATTTGG 240 
AAATATCCGC GCAAG ATGTT G ACGTTGCAG ACTTGGGTAG TGCAAGCCTT GTTTATTTTC 300 

45 CTCACCACTG AATCTACAGG TG AACTTCTA GATCCATOTG GTTATATCAG TCCTGAATCT 360 
CCAGTTGTAC AACTTCATTC TAATTTCACT GCAGTTTGTG TGCTAAAGGA AAAATOTATG 420 
GATTATTTTC ATGTAAATGC TAATTACATT GTCTGG AAAA CAAACCATTT TACTATTCCT 480 
AAGOAGCAAT ATACTATCAT AAACAGAACA GCATCCAGTG TCACCTTTAC AG ATATAGCT 540 
TCATTAAATA TTCAGCTCAC TTGCAACATT CTTACATTCG GACAGCTTGA ACAGAATGTT 600 

50 TATGG AATCA CAATAATTTC AGGCTTGCCT CCAGAAAAAC CTAAAAATTT GAGTTGCATT 660 

GTGAACGAGG GGAAGAAAAT GAGGTGTGAG TGGGATGGTG GAAGGGAAAC ACACTTGGAG 720 
ACAAACTTCA CTTTAAA ATC TGAATGGGCA ACACACAAGT TTGCTG ATTG CAAAGCAAAA 780 
CGTGACACCC CCACCTCATG CACTGTTGAT TATTCTACTG TGTATTTTGT CAACATTGAA 840 
GTCTGGGTAG AAGCAG AG AA TGCCCTTGGG AAGGTTACAT CAGATCATAT CAATTTTGAT 900 

55 CCTGTATATA AAGTG AAGCC CAATCCGCCA CATAATTTAT CAGTG ATCAA CTCAGAGG AA 960 
CTGTCTAGTA TCTTAAAATT GACATGGACC AACCCAAGTA TTAAGAGTGT TATAATACTA 1020 
AAATATAACA TTCAATATAG GACCAAAGAT GCCTCAACTT GGAGCCAGAT TCCTCCTGAA 1080 
GACACAGCAT CCACCCGATC TTCATTCACT GTCCAAGACC TTAAACCTTT TACAGAATAT 1 1 40 
GTGTTTAGGA TTCGCTGTAT GAAGG A AGAT GGTAAGGGAT ACTGG AGTGA CTGGAGTGAA 1 200 

60 GAAGCAAGTG GGATCACCTA TG AAG AT AG A CCATCTAAAG CACCAAGTTT CTGGTATAAA 1260 
ATAGATCCAT CCCATACTCA AGGCTACAGA ACTGTACAAC TCGTGTGGAA GACATTGCCT 1320 
CCTTTTGAAG CCAATGGAAA AATCTTGGAT TATGAAGTGA CTCTCACAAG ATGGAAATCA 1380 
CATTTACAAA ATTACACAGT TAATGCCACA AAACTGACAG TAAATCTCAC AAATOATCGC 1440 
TATCTAGCAA CCCTAACAGT AAG AAATCTT GTTGGCAAAT CAG ATGCAGC TGTTTTAACT 1500 

65 ATCCCTGCCT GTGACTTTCA AGCTACTCAC CCTGTAATGG ATCTTAAAGC ATTCCCCAAA 1560 
GATAACATGC TTTGGGTGGA ATGGACTACT CCAAGGGAAT CTGTAAAGAA ATATATACTT 1620 
GAGTGGTGTG TGTTATCAGA TAAAGCACCC TGTATCACAG ACTGGCAACA AGAAGATGGT 1680 
ACCGTGCATC GCACCTATTT AAGAGGG AAC TTAGCAGAGA GCAAATGCTA TTTGATAACA 1740 
GTTACTCCAG TATATGCTG A TGG ACCAGG A AGCCCTG AAT CCATAAAGGC ATACCTTAAA 1 800 

70 CAAGCTCCAC CTTCCAAAGG ACCTACTGTT CGGACAAAAA AAGTAGGGAA AAACGAAGCT 1860 
QTCTTAGAGT GGGACCAACT TCCTGTTGAT GTTCAGAATG GATTTATCAG AAATTATACT 1920 
ATATTTTATA GAACCATCAT TGGAAATG AA ACTGCTGTGA ATGTGGATTC TTCCCACACA 1980 
GAATATACAT TGTCCTCTTT GACTAGTGAC ACATTGTACA TGGTACGAAT GGCAGCATAC 2040 
ACAGATGAAG GTGGGAAGGA TGGTCCAOAA TTCACTTTTA CTACCCCAAA GTTTGCTCAA 2100 

75 GGAGAAATTG AAGCCATAGT CGTGCCTGTT TGCTTAGCAT TCCTATTGAC AACTCTTCTG 2160 
GGAGTGCTGT TCTGCTTTAA TAAGCGAG AC CT AATT AAAA AACACATCTG GCCTAATGTT 2220 
CCAGATCCTT CAAAGAGTCA TATTGCCCAG TGGTCACCTC ACACTCCTCC AAGGCACAAT 22S0 

360 



WO 02/059377 



TTTAATTCAA AAGATCAAAT GTATTCAGAT GGCAATTTCA CTGATGTAAG TGTTGTGGAA 2340 
ATAGAAGCAA ATGACAAAAA GCCTTTTCCA GAAGATCTGA AATCATTGGA CCTGTTCAAA 2400 
AAGG AAAAAA TTAATACTGA AGG ACACAGC AGTGGTATTG GGGGGTCTTC ATGCATGTCA 2460 
TCTTCTAGGC CAAGCATTTC TAGCAGTG AT G AAAATGAAT CTTCACAAAA CACTTCGAGC 2520 
ACTGTCCAGT ATTCTACCGT GGTACACAGT GGCTACAGAC ACCAAGTTCC GTCAGTCCAA 2580 
GTCTTCTCAA GATCCG AGTC TACCCAGCCC TTGTTAGATT CAGAGGAGCG GCCAGAAG AT 2640 
CTACAATTAO TAGATCATGT AGATGGCGGT GATGGTATTT TGCCCAGGCA ACAOTACTTC 2700 
AAACAGAACT GCAGTCAGCA TG AATCCAGT CCAGATATTT CACATTTTGA AAGGTCAAAG 2760 
CAAGTTTCAT CAGTCAATG A GGAAG ATTTT GTTAG ACTTA AACAGCAGAT TTCAG ATCAT 2820 
ATTTCACAAT CCTGTGGATC TGGGCAAATG AAAATGTTTC AGGAAGTTTC TGCAGCAGAT 2880 
GCTTTTGGTC CAGGTACTGA GGGACAAGTA GAAAGATTTG AAACAGTTGG CATGGAGGCT 2940 
GCGACTG ATG AAGGCATGCC TAAAAGTTAC TTACCACAGA CTGTACGGCA AGGCGGCTAC 3000 
ATGCCTCAG T G AA GG ACT AG TAGTTCCTGC TACAACTTCA GCAGTACCTA TAAAGTAAAG 3060 
CTA AAATGAT TTTATCTGTG A ATTC 

Scq ID NO: 32 Protein sequence: 
Protein Accession #; NP_002 1 75. 1 

1 11 21 31 41 51 
I I I I I I 

MLTLQTWWQ ALFIFLTTES TGELLDPCGY ISPESPWQL HSNFTAVCVL KEKCMDYFHV 60 
NANYTVWKTN HFTIPKEQYT IINRTASSVT FTDIASLNIQ LTCNILTFGQ LEQNVYGITI 120 
ISGLPPEKPK NLSCrVNEGK KMRCEWDGGR ETHLETNFTL KSEWATHKFA DCKAKRDTPT 180 
SCTVDYSTVY FVNIEVWVEA ENALGKVTSD HINFDP VYKV KPNPPHNLSV INSEELSSIL 240 
KLTWTNPSOC SVIIUCYN1Q YRTKDASTWS QIPPEDTAST RSSFTVQDLK PfTEYVFRlR 300 
CMKEDGKGYW SDWSEEASGl TYEDRPSKAP SFWYKIDPSH TQGYRTVQLV WKTLPPFEAN 360 
GKILDYEVTL TRWKSHLQNY TVNATKLTVN LTNDRYLATL TVRNLVGKSD AAVLT1PACD 420 
FQATHPVMDL KAFPKDNMLW VEWTTPRESV KKY1LEWCVL SDKAPCITDW QQEDGTVHRT 480 
YLRGNLAESK CYUTVTPVY ADGPGSPESI KAYLKQAPPS KGPTVRTKKV GKNEAVLEWD 540 
QLPVDVQNGF 1RNYTIFYRT IIGNETAVNV DSSHTEYTLS SLTSDTLYMV RMAAYTDEGG 600 
KDGPEFTFTT PKFAQGEIEA IWPVCLAFL LTTLLGVLFC FNKRDUKKH IWPNVPDPSK 660 
SH1AQWSPHT PPRHNFNSKD QM YSDGNFTD VSVVEffiAND KKPFPEDLKS LDLFKKEKJN 720 
TEGHSSGIGO SSCMSSSRPS ISSSDENESS QNTSSTVQYS TWHSGYRHQ VPSVQVFSRS 780 
ESTQPLLDSE ERPEDLQLVD HVDGGDGILP RQQYFKQNCS QHESSPDISH FERSKQVSSV 840 
KEEDFVRLKQ QISDHISQSC GSGQMKMFQE VSAADAFGPG TEGQVERFET VGMEAATDEG 900 
MPKSYLPQTV ROGGYMPQ 

Seq ID NO: 33 DNA sequence 

Nucleic Acid Accession NM0I 8255. 1 

Coding sequence: 1 1 -249 1 (under! tned sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I 1 I I 

AGTTGGCGAC ATGGTGGCAC CCGTGCTGG A GACTTCTCAC GTGTTTTGCT GCCCAAACCG 60 
GGTGCGGGGA GTCCTGAACT GGAGCTCTGG GCCCAGAGGA CTTCTGGCCT TTGGCACGTC 120 
CTGCTCCGTG GTGCTCTATG ACCOCCTGAA AAGGGTTGTT GTTACCAACT TGAATGGTCA ISO 
CACCGCCCGA GTCAATTGCA TACAGTGGAT TTGTAAACAG GATGGCTCCC CTTCTACTGA 240 
ATTAGTTTCT GGAGGATCTG ATAATCAAGT GATTCACTGG GAAATAGAGG ATAATCAGCT 300 
TTTAAAAGCA GTGCATCTTC AAGGCCATGA AGGACCTGTT TATGCGGTGC ATGCTGTTTA 360 
CCAGAGGAGO ACATCAGATC CTGCATTATG TACACTGATC GTTTCTGCAG CTGC AGATTC 420 
TGCTGTTCG A CTCTGGTCTA AAAAGGGTCC AG AAGTAATG TGCCTTCAGA CTTTAAACTT 480 
TGGAAATGGA TTTGCTTTGG CTCTCTGCTT ATCTTTTTTG CCAAATACTG ATGTACCAAT 540 
ATTAGCATGT GGCAATGATG ATTGCAGAAT TCACATATTT GCTCAACAAA ATGATCAGTT 600 
TCAGAAAGTG CTTTCTCTCT GTGGACATGA GGATTGGATT AGAGGAGTGG AATGOGCAGC 660 
CTTTGGTAGA GAT C1 1 1 IC C TAGCAAGCTG TTC ACAAG AT TGCCTGATAA GAATATGGAA 720 
GCTGTATATA AAGTCAACAT CTTTAGAAAC TCAGGATGAC GATAACATAA GACTGAAAGA 780 
AAATACTTTT ACCATAGAAA ATG AAAGTGT TAAAATAGCA TTTGCTGTTA CT CTGGA GAC 840 
AGTGCTAGCC GGTCATGAAA ACTGGGTAAA TGCAGTTCAC TGGCAACCTG TGTTTTACAA 900 
AGATGGTGTC CTACAGCAGC CAGTGAGATT ATTATCTGCT TCCATGGATA AAACCATGAT 960 
TCTCTGGGCT CCAGATGAAG AGTCAGGAGT TTGGCTAGAA CAGOTTCGAG TAGGTGAAGT 1020 
AGGTGGGAAT ACTTTGGGAT TTTATGATTG CCAGTTCAAT GAAGATGGCT CCATGATCAT 1080 
TGCTCATGCT TTCCACGGAG CGTTGCACCT TTGG AAACAG AATACAGTTA ACCCAAGAGA 1 140 
GTGGACTCCA G AG ATTGTCA TTTCAGGACA CTTTGATGGT GTCCAAGACC TAGTCTGGGA 1200 
TCCAGAAGGA G AATTTATTA TCACTGTTGG TACTG ATCAG ACAACTAGAC TTTTTGCTCC 1260 
ATGGAAGAOA AAAGACCAAT CACAGGTOAC TTGGCATGAA ATTGCAAGGC CTCAGATACA 1320 
TGGGTATGAC CTGAAATGTT TGGCAATGAT TAATCGGTTT CAGTTTGTAT CTGGAGCAGA 1380 
TGAAAAAGTT CTTCGGGTTT TTTCTGCACC TCGGAATTTT GTGGAAAATT TTTGTGCCAT 1440 
TACAGGACAA TCACTGAATC ATGTGCTCTG TAATCAAGAT AGTGATCTTC CAGAAGGAGC 1500 
CACTGTCCCT GCATTGGG AT TATCAAATAA AGCTGTCTTT CAGGG AG ATA TAGCTTCTCA 1 560 
GCCTTCTGAT GAAGAGGAGC TGTTAACTAG TACTGGTTTT GAGTATCAGC AGGTGGCCTT 1620 
TCAGCCCTCC ATACTTACTG AGCCTCCCAC TGAGG ATCAT CTTCTGCAGA ATACTTTGTG 1680 
GCCTGAAGTT CAAAAACTAT ATGGGCACGG TTATG AAATA TTTTGTGTTA CTTGTAACAG 1740 
TTCAAAGACT CTGCTTGCCT CAGCTTGTAA GGCAGCTAAG AAAGAGCATG CAGCTATCAT 1800 
TCTTTGGAAC ACTACATCTT GG AAACAGGT GCAG AATTTA GTTTTCCACA GTTTGACAGT 1 860 
CACGCAGATG GCCTTCTCAC CTAATGAGAA GTTCTTACTA GCTGTTTCCA GAGATCGAAC 1920 
CTGGTCATTG TGGAAAAAGC AGGATACAAT CTCACCTGAG TTCGAGCCAG TTTTTAGTCT 1980 
TTTTGCCTTC ACCAACAAAA TTACTTCTGT GCACAGTAGA ATTATTTGGT CTTGTGATTG 2040 
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GAGTCCTGAC AGCAAGTATT TCTTCACTGG GAGTCGAGAC AAAAAGGTGG TTGTCTGGGG 2 1 00 
TGAGTGCGAC TCCACTGATG ACTGTATTGA GCACAACATT GGCCCCTGCT OCTCAGTCCT 2160 
GGACGTGGGT GGGGCTGTGA CAGCTGTCAG CGTCTGCCCA GTGCTCCACC CTTCTCAACG 2220 
ATACGTGGTT GCAGTAGGAT TGG AGTGTGG AAAGATTTGC TTATATACCT GG AAAAAGAC 2280 
TGATCAAGTT CCAGAAATAA ATGACTGG AC CCACTGTGTA GAAACAAGTC AAAGCCAAAG 2340 
TCATACACTG GCTATCAGAA AATTATGCTG G AAGAATTGC AGTGG AAAAA CTG AACAGAA 2400 
GGAAGCAGAA GGTGCTGAGT GGTTACACTT TGCAAGCTGT GGTGAAGATC ACACTGTGAA 2460 
GATACACAGA GTCAATAAAT GTG CACTG2AATGG 

Seq ID NO: 34 Protein sequence; 
Protein Accession #: NP_060725. 1 

1 U 21 31 41 51 
I I 1 ! I I 

MVAPVLETSH VFCCPNRVRG VLNWSSGPRG LLAFGTSCSV VLYDPLKRW VTNLNGHTAR 60 
VNCIQW1CKQ DGSPSTELVS GGSDNQVIHW ElEDNQLLKA VHLQGHEGPV Y AVHAVYQRR 120 
TSDPALCTLI VSAAADSAVR LWSKKGPEVM CLQTLNFGNG FALALCLSFL PNTDVPILAC 1 80 
GNDDCRJMF AQQNDQFQKV LSLCGHEDWI RGVEWAAFGR DLFLASCSQD CURIWKLYI 240 
KSTSLETQDD DNIRLKENTF TIENESVKIA FA VTLETVLA GHENWVNA VH WQPVF YKDG V 300 
LQQPVRLLSA SMDKTMILWA PDEESGVWLE QVRVGEVGGN TLGFYDCQFN EDGSMHAHA 360 
FHGALHLWKQ NTVNPREWTP EIVISGHFDG VQDLVWDPEG EFHTVGTDQ TTRLFAPWKR 420 
KDQSQVTWHE IARPQIHGYD LKCLAMINRF QFVSGADEKV LRVFSAPRNF VENFCAITGQ 480 
SLNHVLCNQD SDLPEGATVP ALGLSNKAVF QGD1ASQPSD EEELLTSTGF EYQQVAFQPS 540 
ILTEPPTEDH LLQNTLWPEV QKLYGHG YEI FCVTCNSSKT LLASACKAAK KEHAAHLWN 600 
TTSWKQVQNL VFHSLTVTQM AFSPNBKFLL AVSRDRTWSL WKKQDTISPE FEPVFSLFAF 660 
TNK1TSVHSR UWSCDWSPD SKYFFTGSRD KKVWWGECD STDDCIEHNI GPCSSVLDVG 720 
GAVTAVSVCP VLHPSQRYVV AVGLECGKIC LYTWKKTDQV PEINDWTHCV ETSQSQSHTL 780 
AIRKLCWKNC SGKTEQKEAE GAEWLHFASC GEDHTVKIHR VNKCAL 

Seq ID NO: 35 DNA sequence 

Nucleic Acid Accession #: NM_022 13 1 

Coding sequence: 11-2878 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

TGCTGCGAGG ATGCTGCCTG GGCGGCTGTG CTGGGTGCCG CTCCTGCTGG CGCTGGGCGT 60 
GGGGAGCGGC AGCGGCGGTG GCGGGGACAG CCGGCAGCGC CGCCTOCTCG CGGCTAAAGT 120 
CAATAAGCAC AAGCCATGG A TCGAGACTTC ATATCATGG A GTCATAACTG AG AACAATG A 1 80 
CACAGTCATT TTGGACCCAC CACTGGTAGC CCTGGATAAA GATGCACCGG TTCCTTTTGC 240 
AGGGGAAATC TGTGCGTTCA AGATCCATGG CCAGGAGCTG CCCTTTGAGG CTGTGGTGCT 300 
CAACAAGACA TCAGGAG AGO GCCGGCTCCG TGCCAAGAGC CCCATTGACT GTGAGTTGCA 360 
GAAGG AGTAC ACATTCATCA TCCAGGCCTA TGACTGTGGT GCTGGGCCCC ACGAGACAGC 420 
CTGGAAAAAG TCACACAAGG CCGTGGTCCA TAT AC AG GTG AAGGATGTCA ACGAGTTTGC 480 
TCCCACCTTC AAAG AGCCAG CCTACAAGGC TGTTGTGACG GAGGGCAAGA TCTATGACAG 540 
CATTCTGCAG GTGGAGGCCA TTGACGAGGA CTGCTCCCCA CAGTACAGCC AGATCTGCAA 600 
CTATGAAATC GTCACCACAG ATGTGCCTTT TGCCATCGAC AGAAATGGCA ACATCAGGAA 660 
CACTGAGAAG CTOAGCTATG ACAAACAACA CCAGTATGAG ATCCTGGTGA CCGCCTACGA 720 
CTGTGGACAG AAGCCCGCTG CTCAGGACAC CCTGGTGCAG GTGGATGTGA AGCCAGTTTG 780 
CAAGCCTGGC TGGC AAG ACT GG ACCAAGAG GATTG AGTAC CAGCCTGGCT CCGGGAGCAT 840 
GCCCCTGTTC CCCAGCATCC ACCTGGAG AC GTGCGATGGA GCCGTGTCTT CCCTCCAG AT 900 
CGTCACAGAG CTGCAGACTA ATTACATTGG GAAGGGTTGT G ACCGGGAGA CCTACTCTG A 960 
GAAATCCCTT CAQAAGTTAT GTGGAGCCTC CTCTGGCATC ATTGACCTCT TGCCATCCCC 1020 
TAGCGCTGCC ACCAACTGGA CTGCAGG ACT GCTGGTGGAC AGCAGTG AG A TG ATCTTCAA 1080 
GTTTGACGGC AGGCAGGGTG CCAAAATCCC CGATGGGATT GTGCCCAAGA ACCTGACCGA 1 140 
TCAGTTCACC ATCACCATGT GGATGAAACA CGGCCCCAGC CCTGGTGTG A GAGCCG AGAA 1200 
GGAAACCATC CTCTGCAACT CAGACAAAAC CGAAATGAAC CGGCATCACT ATGCCCTGTA 1260 
TGTGCACAAC TGCCGCCTCG TCTTTCTCTT GCG GAAGG AC TTCGACCAGG CTGACACCTT 1320 
TCGCCCCGCG GAGTTCCACT GGAAGCTGGA TCAGATTTGT GACAAAGAGT GGCACTACTA 1380 
TGTCATCAAT GTGGAGTTTC CTGTGGTAAC CTTATACATG GATGGAGCAA CATATGAACC 1440 
ATACCTGGTG ACCAACGACT GGCCCATTCA TCCATCTCAC ATAGCCATGC AACTCACAGT 1500 
CGGCGCTTGT TGGCAAGG AG GAGAAGTCAC CAAACCACAG TTTGCTCAGT TCTTTCATGG 1560 
AAGCCTGGCC AGTCTCACCA TCCGCCCTGG CAAAATGGAA AGCCAGAAGG TGATCTCCTG 1620 
CCTGCAGGCC TGCAAGGAAG GGCTGGACAT TAATTCCTTG GAAAGCCTTG GCCAAGGAAT 1680 
AAAGTATCAC TTCAACCCCT CGCAGTCCAT CCTGGTGATG GAAGGTGACG ACATTGGGAA 1740 
CATTAACCGT GCTCTCCAG A AAGTCTCCTA CATCAACTCC AGGCAGTTCC CAACGGCGGG 1 800 
TGTGCGGCGC CTCA A AGTAT CCTCCAAAGT CCAGTGCTTT GGGGAAG ACG TATGCATCAG 1 860 
TATCCCTGAG GTAGATGCCT ATGTGATGGT CCTCCAGGCC ATCGAGCCCC GGATCACCCT 1920 
CCGGGGCACA G ACCACTTCT GG AG ACCTGC TGCCCAGTTT G AAAGTGCCA GGGGAGTGAC 1980 
CCTCTTCCCT G AT ATCAAG A TTGTG AGCAC CTTCGCCAAA ACCGAAGCCC CCGGGGACGT 2040 
GAAAACCACA GACCCCAAAT CAGAAGTCTT AGAGGAAATG CTTCATAACT TAGATTTCTG 2100 
TGACATTTTG GTGATCGGAG GGGACTTGGA CCCAAGGCAO GAGTGCTTGG AGCTCAACCA 2160 
CAGTGAGCTC CACCAACGAC ACCTGG ATGC CACTAATTCT ACTGCAGGCT ACTCCATCTA 2220 
CGGTGTGGGC TCCATO AGCC GCTATGAGCA GGTGCTACAT CACATCCGCT ACCGCAACTG 2280 
GOGTCCGGCT TCCCTTGAGG CCCGGCGTTT CCGGATTAAG TGCTCAGAAC TCAATGGGCG 2340 
CTACACTAGC AATGAGTTCA ACTTGGAGGT CAGCATCCTT CATGAAGACC AAGTCTCAGA 2400 
TAAGG AGCAT GTCAATCATC TG ATTGTGCA GCCTCCCTTC CTCCAGTCTG TCCATCATCC 2460 
TO AGTCCCGG AGTAGCATCC AGCACAGTTC AGTGGTCCCA AGCATTGCCA CAGTGGTCAT 2520 
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CATCATCTCC GTGTQCATGC TTGTGTTTOT CGTGGCCATG GGTGTGTACC GGGTCCGGAT 2580 
CGCCCACCAG CACTTCATCC AGG AG ACTG A GGCTGCCAAG G AATCTG AG A TGGACTGGGA 2640 
CGATTCTGCG CTG ACTATCA CAGTCAACCC CATGGAGAAA CATGAAGGAC CAGGQCATGG 2700 
GGAAGATGAG ACTGAGGGAG AAGAGGAGGA AGAAGCCGAO GAAG AAATGA GCTCCAGCAG 2760 
5 TGGCTCTGAC GACAGCGAAG AGGAGG AGG A GGAGGAAGGG ATGGGCAGAG GCAGACATGG 2820 
GCAGAATGG A GCC AGGCAAG CCCAGCTGGA GTGGG ATG AC TCCACCCTCC CCTACTAGTQ 2880 
CCCAGGGGTC TGCTGCCTGG CCCACATGTC CCTTTTGTAA ACCCTGACCC AGTGTATGCC 2940 
CATGTCTATC ATACCTCACC TCTGATGTCT GTGACATGTC TGGGA AGGCC TTCTCCAGCT 3000 
TCCTGGAGCC CACCCTTTAA GCCTTGGGCA CTCCCTGTGT TTCATCCATG GGGAAGTTCC 3060 

1 0 " AAGAAGCCCA GCATGGCCAT CAGTG AGG AC TTCAGGGTAG ACTTTGTCCT GTAGCCTCCA 3120 
CTTCTGCCCT AAGTTCCCCA GCATCCTGAC TACCTGTCTG CAGAGTTTGC CTTTGTTTTT 3180 
TCCTGCAGGG AAGAAGGCCC ACCTTTGTGT CACTCACCTC CCCAGGCTCA GAGTCCCCAA 3240 
GGCCCTGGGG TTCCAACTCA CTGTGCGTCT CCTCCACACA GACCAGTAGG TTCTCCTATG 3300 
CTGACTCCAG GTTGCTTCAT ACAAGG AGGG TGGTTGAACT TCACACACGT AAGGTCTTAG 3360 

1 5 TGCTTAACAG TTTAAAGGAA AGTCCTTGTT GAGGCAGAAC TAAGTTTACA GGGAAAGGTA 3420 
CAC ACATTCT CTCTCTCTCT CTCTCTCTGT CTATCTAGTT CCCCAGCTTG GAGAGCCTTT 3480 
CCCCTTGCrr CTTTCTGAGG CCATATAAGC TTATAAGAAA AGTCCCAAAC CAAGAATAGG 3540 
TCCTTGGCCA CAAGC AGGGT CTGATCCCCC ATCAGAGCTA TCTGAGCCTG CCTGTCTGGG 3600 
CACCTGCTGC AACCATGCAG CTACCCTGCC AGGGGCACTC AGCAAACAGA ACCACAGGGC 3660 

20 CCAGGAGGCA TTCCACACAG GCACTGCCCC AGGACAACAC AACAAGGACA GTCACAACAA 3720 
GGACAACAAG GACACAACAC AACACACAAC AAGGACAGTC ACAACAAGCC TAGAGCCAGA 3780 
AAGCAGATGG AAATGCTAAT GAGGTCAAAC GTAGGCTTCA TGGTGGGTGG AGTGGGGGTG 3840 
GCTGGGCTCC CCCAGGACAG AGGGGACCCT GAGGTTGGCA AGGCTCTCAC CACTCAGCCT 3900 
TATGGTCCCT TATCTCCTAT CTTCCCTCTT GAG AAAATAC ACGCTTTCTG CATGTATTAG 3960 

25 AAACGCACG A GCTCCACCAA GTCTACAATG AAAGTTTGAA ATTTAACTGC AAGG AATTAG 4020 
AAGCATATTT GCAATCATTG CAGCTTCTTC TTTCTTCTGC TCATAAAAGG AGG AACACTT 4080 
TAGATAGAGG GCAAATATAT CTGAAAACCT AATTTCTTTC TTTTTTTGAT AAGGAAATCT 4140 
TTTCCATCTC CATCCTAACA TGCACAACCT GTG AAGAG A A TTGTTTCTAT AGTAACTGGT 4200 
CTGTGATCTT TTGTGGCCAA GAG AATAGCA GGCAAGAATT AGGGCCTTGA CAGAATTTCC 4260 

30 ACGAAGCTCT GAGAACATGT TTGTTTCGAA TGTCTGATTC CTCTTTGTCA TCAATGTGTA 4320 
TGCTCTGTCC CCATCCTTCA CTCCTCCTCA AGCTCACACC AATTGGTTTG GCACAGGCAC 4380 
AGAGCTGGTC CCTAGTTAAG TGGCATTTAT GTTAAAAAAA AATAGTTCAG AATCTCAGCC 4440 
TTTTCTTTGT GTCATCAAAA CAGCTTAAG A AGGGGACTAC TGCCAATGTC CTCTAGTCTG 4500 
ACCTCCACCC AGGGAGGACC CATGGCAGGT CTTTTCAACT TTCTGATTCA TGAGAACAAC 4560 

35 CTTGTGAAGC TTTTCCCACC TCCTAAAGTG TTTTCTGCAT CTGTTCCTTC CTTTGGACCT 4620 
CACAACAAAT CCTGTGAAGT AACTGAGACA TCTGTTGTTA GATACATTTT TGTGATGAGT 4680 
AAACTGAGGC TTOG 

Seq ID NO: 36 Protein sequence: 
40 Protein Accession U: NP_07l414.1 

1 II 21 31 41 51 

MLPGRLCWVP LLLALGVGSG SGGGGDSRQR RLLAAKVNKH KPWIETSYHG V1TENNDTVI 60 
45 LDPPLVALDK DAPVPFAGEI CAFKIHGQEL PFEAWLNKT SGEGRLRAKS PIDCELQKEY 120 
TFIIQAYDCG AGPHETAWKK SHKAWHIQV KDVNEFAPTF KEPAYKAWT EGKIYDSILQ 1 80 
VEAIDEDCSP QYSQICNYEI VTTDVPFAID RNGNIRNTEK LSYDKQHQYE ILVTAYDCGQ 240 
KPAAQDTLVQ VDVKPVCKPG WQDWTKRIEY QPGSGSMPLF PSIHLETCDG AVSSLQIVTE 300 
LQTNYIGKGC DRETYSEKSL QKLCGASSGI IDLLPSPSAA TNWTAGLLVD SSEMIFKFDG 360 
50 RQGAKIPDGI VPKNLTDQFT ITMWMKHGPS PGVRAEKETI LCNSDKTEMN RHHYALYVHN 420 
CRLVFLLRKD FDQADTFRPA EFHWKXDQ1C DKEWHYYVIN VEFPWTLYM DG ATYEPYLV 480 
TNDWPIHPSH IAMQLTVGAC WQGGEVTKPQ FAQFFHGSLA SLTIRPGKME SQKVISCLQA 540 
CKEGLDINSL ESLGQGDCYH FNPSQSILVM EGDDIGNINR ALQKVSYINS RQFPTAGVRR 600 
UCVSSKVQCF GEDVCISIPE VDA YVMVLQA IEPRITLRGT DHFWRPAAQF ESARG VTLFP 660 
55 DIKT/STFAK TEAPGDVKTT DPKSEVLEEM LHNLDFCDIL V1GGDLDPRQ ECLELNHSEL 720 

HQRHLDATNS TAGYSIYGVG SMSRYEQVLH HIRYRNWRPA SLEARRFRIK CSELNGRYTS 780 
NEFNLEVSIL HEDQVSDKEH VNHLIVQPPF LQSVHHPESR SSIQHSSWP SIATWIHS 840 
VCMLVFWAM GVYRVRIAHQ HFIQETEAAK ESEMDWDDSA LTITVNPMEK HEGPGHGEDE 900 
TEGEEEEEAE EEMSSSSGSD DSEEEEEEEG MGRGRHGQNG ARQAQLEWDD STLPY 

60 

Seq ID NO: 37 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 143-874 (underlined sequences correspond to start and stop codons) 

65 1 11 21 31 41 51 

GGGAGGGAG A GAGGCGCGCG GGTG AAAGGC GCATTGATGC AGCCTGCGGC GGCCTCGGAG 60 
CGCGGCGG AG CCAG ACGCTG ACCACGTTCC TCTCCTCGGT CTCCTCCGCC TCCAGCTCCG ! 20 
CGCTGCCCGG CAGCCGGGAG CCATGCGACC CCAGGGCCCC GCCGCCTCCC CGCAGCGGCT 180 

70 CCGCGGCCTC CTGCTGCTCC TGCTGCTGCA GCTGCCCGCG CCGTCGAGCG CCTCTGAGAT 240 

CCCCAAGGGG AAGCAAAAGG CGCAGCTCCG GC AG AGGG AG GTGGTGGACC TGTATAATGG 300 
AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TGGTCGAGAC GGGAGCCCTG GGGCCAATGG 360 
CATTCCGGGT ACACCTGGG A TCCCAGGTCG GGATGGATTC AAAGGAGAAA AGGGGGAATG 420 
TCTGAGGGAA AGCTTTGAGG AGTCCTGGAC ACCCAACTAC AAGCAGTGTT CATGGAGTTC 480 

75 ATTGAATTAT GGCATAGATC TTGGGAAAAT TGCGGAGTGT ACATTTACAA AGATGCGTTC 540 
AAATAGTGCT CTAAGAGTTT TGTTCAGTGG CTCACTTCGG CTAAAATOCA GAAATGCATG 600 
CTGTCAGCGT TGGTATTTCA CATTCAATGG AGCTGAATGT TCAGGACCTC TTCCCATTGA 660 
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AGCTATAATT TATTTGG ACC AAGG AAGCCC TOAAATG AAT TCAACAATTA ATATTCATCG 720 
CACTTCTTCT GTGG AAGGAC TTTGTGAAGG AATTGGTGCT GGATTAGTGG ATGTTGCTAT 780 
CTGGGTTGGC ACTTGTTCAG ATTACCCAAA AGGAGATGCT TCTACTGGAT GGA ATTCAGT 840 
TTCTCGCATC ATTATTGAAG AACTACCAAA ATAAATGCTT TAATTTTCAT TTGCTACCTC 900 
5 TTTTTTTATT ATGCCTTGG A ATGGTTCACT TAAATGACAT TTTAAATAAG TTTATGTATA 960 

CATCTGAATG AAAAGCAAAG CTAAATATGT TTACAGACCA AAGTGTGATT TCACA CTGTT 1020 
TTTAAATCTA GCATTATTCA TTTTGCTTCA ATCAAAAGTG GTTTCAATAT TTTTTTTAGT 1080 
TGGTTAGAAT ACTTTCTTCA TAGTCACATT CTCTCAACCT ATAATTTGGA ATATTGTTGT 1 140 
GGTCTTTTGT TTTTTCTCTT AGTATAGCAT TTTTAAAAAA ATATAAAAGC TACCAATCTT 1200 
1 0 TGTACAATTT GTAAATGTTA AGAATTTTTT TTATATCTGT TAAATAAAAA TTATTTCCAA 1260 
CAACCTTAAA AAAAAAAAA A AAAA 

Seq ID NO: 38 Protein sequence: 
^ ^ Protein Accession U: none found 

1 U 21 31 41 51 

MRPQGPAASP QRLRGLLLLl'lLQLPAPSSA SEIPKGKQKA QLRQREWDL YNGMCLQGPA 60 
GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKOCS WSSLNYG1DL 120 
20 GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNG AECSGPL PIEAIIYLDQ 180 
GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTGW NSVSR111EE 240 
LPK 

Seq ID NO; 39 DMA sequence 
25 Nucleic Acid Accession ft NM_000949 

Coding sequence: 285-2 1 53(undcriined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

30 GGAGGCTGAA ATCCCCAGAC GCCGGTTTTC TGGGCTGGGC TTTCTGCTTA CTCACTCCTT 60 
CTCCCTCTTT CTGGATTTTA CCGACCGTTC GCGAAACAGC TTTCCACACA ATGGAGCTTC 120 
ATGTCCTCGT GCAGGAAGTA CTCATCG ACT GATGTGGCAG ACTTTGCTCC CTG ACAAAAC 1 80 
TAAAG AACTC TCCTATTCAT GG AGGCGAAC ACTGAGGATG CTTTCCACAT GAACCCTGAA 240 
GTGAACTTCT GATACATTTC CTGCAGCAAG AGAAGGCAGC CAACATGAAG GAAAATGTGG 300 

35 CATCTGCAAC CGTTTTCACT CTGCTACTTT TTCTCAACAC CTGCCTTCTG AATGG ACAGT 360 
TACCTCCTGG AAAACCTGAG ATCTTTAAAT GTCGTTCTCC CAATAAGGAA ACATTCACCT 420 
GCTGGTGGAG GCCTGGGACA GATGG AGGAC TTCCTACCAA TTATTCACTG ACTTACCACA 480 
GGGAAGGAGA GACACTCATG CATGAATGTC CAGACTACAT AACCGGTGGC CCCAACTCCT 540 
GCCACTTTGG CAAGCAGTAC ACCTCCATGT GGAGGACATA CATCATGATG GTCAATGCCA 600 

40 CTAACCAGAT GGGAAGCAGT TTCTCGGATG AACTTTATGT GGACGTGACT TACATAGTTC 660 
AGCCAGACCC TCCTTTGGAG CTGGCTGTGO AAGTAAAACA GCCAGAAGAC AGAAAACCCT 720 
ACCTGTGGAT TA AATGGTCT CCACCTACCC TG ATTGACTT AAAAACTGGT TGGTTCACGC 780 
TCCTGTATGA AATTCGATTA AAACCCGAGA AAGCAGCTGA GTGGGAGATC CATTTTGCTG 840 
GGCAGCAAAC AGAGTTTAAG ATTCTCAGCC TACATCCAGG ACAGAAATAC CTTGTCCAGG 900 

45 TTCGCTGCAA ACCAG ACCAT GGATACTGG A GTGCATGGAG TCCAGCGACC TTCATTCAGA 960 
TACCTAGTGA CTTCACCATG AATGATACAA CCGTGTGGAT CTCTGTGGCT GTCCTTTCTG 1020 
CTGTCATCTG TTTGATTATT GTCTGGGCAG TGGCTTTGAA GGGCTATAGC ATGGTGACCT 1080 
GCATCTTTCC GCCAGTTCCT GGGCCAAAAA TAAAAGGATT TGATGCTCAT CTGTTGGAGA 1140 
AGGGCAAGTC TGAAGAACTA CTGAGTGCCT TGGGATGCCA AGACTTTCCT CCCACTTCTG 1200 

50 ACTATGAGGA CTTGCTGGTG GAGTATTTAG AAGTAGATGA TAGTGAGGAC CAGCATCTAA 1260 
TGTCAGTCCA TTCAAAAGAA CACCCAAGTC AAGGTATGAA ACCCACATAC CTGGATCCTG 1320 
ACACTGACTC AGGCCGGGGG AGCTGTGACA GCCCTTCCCT TTTGTCTGAA AAGTGTGAGG 1380 
AACCCCAGGC CAATCCCTCC ACATTCTATG ATCCTGAGGT CATTGAGAAG CCAGAGAATC 1440 
CTGAAACAAC CCACACCTGG GACCCCCAGT GCATAAGCAT GGAAGGCAAA ATCCCCTATT 1500 

55 TTCATGCTGG TGGATCCAAA TGTTCAACAT GGCCCTTACC ACAGCCCAGC CAGCACAACC 1560 
CCAGATCCTC TTACCACAAT ATTACTGATG TGTGTGAGCT GGCTGTGGGC CCTGCAGGTG 1 620 
CACCGGCCAC TCTGTTGAAT G AAGCAGGTA AAG ATGCTTT AAAATCCTCT CAAACCATTA 1680 
AGTCTAGAGA AGAGGGAAAG GCAACCCAGC AGAGGGAGGT AGAAAGCTTC CATTCTGAGA 1740 
CTfiACCAGGA TACGCCCTGG CTGCTGCCCC AGGAGAAAAC CCCCTTTOGC TCCGCTAAAC 1 800 

60 CCTTGGATTA TGTGGAGATT CACAAGGTCA ACAAAGATGG TGCATTATCA TTGCTACCAA I860 

AACAGAGAGA GAACAGCGGC AAGCCC AAG A AGCCCGGGAC TCCTGAG AAC AATAAGGAGT 1920 
ATGCCAAGGT GTCCGGGGTC ATGGATAACA ACATCCTGGT GTTGGTGCCA GATCCACATG 1980 
CTAAAAACGT GGCTTGCTTT GAAGAATCAG CCAAAGAGGC CCCACCATCA CTTG AACAGA 2040 
ATCAAGCTGA GAAAGCCCTG GCCAACTTCA CTGCAACATC AAGCAAGTGC AGGCTCCAGC 2100 

65 TGGGTGGTTT GGATTACCTG GATCCCGCAT GTTTTACACA CTCCTTTCAC TCAJAGCTTG 2160 
ACTAATGGAA TGATTGGTTA AAATOTGATT TTTCTTCAGG TAACACTACA GAGTACGTGA 2220 
AATGCTCAAG A ATGTAGTCA G ACTG ACACT ACTAAAGCTC CCAGCTCCTT TCATGCTCCA 2280 
TTTTTAACCA CTTGCCTCTT TCTCCAGCAG CTG ATTCCAG AACAAATCAT TATGTTTCCT 2340 
AACTGTGATT TGTAGATTTA CTTTTTGCTG TTAGTTATAA AACTATGTGT TCAATGAAAT 2400 

70 AAAAGCACAC TGCTTAGTAT TCTTGAGGG A CAATGCCAAT AGGTATATCC TCTGGAAAAG 2460 
GCTTTCATGA TTTGGCATGG GACAGACGGA AATGAAATTG TCAAAATTGT TT ACCAT AG A 2520 
AAGATG ACAA AAGAAAATTT TCCACATAGG AAAATGCCAT GAAAATTGCT TTTGAAAAAC 2580 
AACTGCATAA CCTTTACACT CCTCGTCCAT TTTATTAGG A TTACCCAAAT ATAACCATTT 2640 
AAAG AAAGAA TGCATTCCAO AACAAATTGT TTACATAAGT TCCTATACCT TACTG ACACA 2700 

75 TTGCTGATAT GCAAGTAAGA AAT 



Seq ID NO:40PjpJcjQjejauence: 
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Protein Accession U: NP_000940, 1 

1 It 21 31 41 51 
I I ! I I I 

MKENVASATV FTLLLFLNTC LLNGQLPPGK PEIFKCRSPN KETFTCWWRP GTDGGLPTNY 60 
SLTYHREGET LMHECPDYIT GGPNSCHFGK QYTSMWRTYI MMVNATNQMG SSFSDELYVD 120 
VTYIVQPDPP LELAVEVKQP EDRKPYLWIK WSPPTUDLK TGWFTLLYEI RLKPEKAAEW 1 80 
EIHFAGQQTE FKILSLHPGQ KYLVQVRCKP DHGYWSAWSP ATFIQIPSDF TMNDTTVWIS 240 
VAVLSAVICL nVWAVALKG YSMVTCIFPP VPGPKIKGFD AHLLEKGK.SE ELLSALGCQD 300 
FPPTSDYEDL LVEYLEVDDS EDQHLMSVHS KEHPSQGMKP TYLDPDTDSG RGSCDSPSLL 360 
SEKCEEPQAN PSTFYDPEVI EKPENPETTH TWDPQCISME GKIPYFHAGG SKCSTWPLPQ 420 
PSQHNPRSSY HNITDVCELA VGPAGAPATL LNEAGKDALK SSQTIKSREE GKATQQREVE 480 
SFHSETDQDT PWLLPQEKTP FGS AKPLDYV EKKVNKDGA LSLLPKQREN SGKPKKPGTP 540 
ENNKEYAKVS GVMDNNILVL VPDPHAKNVA CFEESAKEAP PSLEQNQAEK ALANFTATSS 600 
KCRLQLGGLD YLDPACFTHS FH 

Seq ID NO: 41 DNA sequence 

Nucleic Acid Accession it: none (bund, Eos cloned sequence 

Coding sequence: 1 -1 572 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

ATGACCCAAA ATAAATTAAA GCTTTGTTCC AAAGCCAATG TGTATACTGA AGTGCCTGAT 60 
GO AGGATGGG GCTGGGCGGT AGCTGTTTCA TTTTTCTTCG TTGAAGTCTT CACCTACGGC 1 20 
ATCATCAAG A CATTTGGTGT CTTCTTTAAT GACTTAATGG ACAGTTTTAA TGAATCCAAT 1 80 
AGCAGGATCT CATGG ATAAT CTCAATCTGT GTGTTTGTCT TAACATTTTC AGCTCCCCTC 240 
GCCACAGTCC TG AGCAATCG TTTCGGACAC CGTCTGGTAG TGATGTTGGG GGGGCTACTT 300 
GTCAGCACCG GGATGGTGGC CGCCTCCTTC TCACAAGAGG TTTCTCATAT GTACGTCGCC 360 
ATCGGCATCA TCTCTGGTCT GGG ATACTGC TTTAGTTTTC TCCCAACTGT AACCATCCTA 420 
TCACAATATT TTGGCAAAAG ACGTTCCATA GTCACTGCAG TTGCTTCCAC AGGAGAATGT 480 
TTCGCTGTGT TTGCTTTCGC ACCAGCAATC ATGGCTCTG A AGGAGCGCAT TGGCTGG AGA 540 
TACAGCCTCC TCTTCGTGGG CCTACTACAG TTAAACATTG TCATCTTCGG AGCACTGCTC 600 
AG ACCCATCT TTATCAGAGG ACCAGCGTCA CCG AAAATAG TCATCCAGGA AAATCGG AAA 660 
GAAGCGCAGT ATATGCTTGA AAATG AGAAA ACACGAACCT CAATAG ACTC CATTGACTCA 720 
GGAGTAGAAC TAACTACCTC ACCTAAAAAT GTGCCTACTC ACACTAACCT GGAACTGGAG 780 
CCGAAGGCCG ACATGCAGCA GGTCCTGGTG AAGACCAGCC CCAGGCCAAG CGAAAAGAAA 840 
GCCCCGCTAT TAG ACTTCTC CATTTTGAAA GAG AAAAGTT TTATTTGTTA TGCATTATTT 900 
GGTCTCTTTG CAACACTGGG ATTCTTTGCA CCTTCCTTGT ACATCATTCC TCTGGGCATT 960 
AGTCTGGGCA TTGACCAGGA CCGCGCTGCT TTTTTATTAT CTACGATGGC CATTGCAGAA 1020 
GTTTTCGGAA GGATCGGAGC TGGTTTTGTC CTCAACAGGG AGCCCATTCG TAAGATTTAC 1080 
ATTGAGCTCA TCTGCGTCAT CTTATTGACT GTGTCTCTGT TTGCCTTTAC TTTTGCTACT 1 140 
GAATTCTGGG GTCTAATGTC ATGCAGCATA TTTTTTGGGT TTATGGTTGG AACAATAGGA 1200 
GGGACTCACA TTCCACTGCT TGCTGAGGAT GATGTCGTGG GCATTGAGAA GATGTCTTCT 1260 
GCAGCTGGGG TCTACATCTT CATTCAGAGC ATAGCAGGAC TGGCTGGACC GCCCCTTGCA 1320 
GGTTTGTTGG TGGACCAAAG TAAGATCTAC AGCAGGGCCT TCTACTCCTG CGCAGCTGGC 1380 
ATGGCCCTGG CTGCTGTGTG CCTCGCCCTG GTGAGACCGT GTAAGATGGG ACTGTGCCAG 1440 
CATCATCACT CAGGTGAAAC AAAGGTAGTG AGCCATCGTG GGAAGACTTT ACAGGACATA 1500 
CCTGAAGACT TTCTGGAAAT GGATCTTGCA AAAAATGAGC ACAGAGTTCA CGTGCAAATO 1560 
G AGCCGGTA T G A 

Seq ID NO: 42 Protein sequence: 

Protein Accession #: none found, Eos cloned sequence 

1 11 21 31 41 51 
I I I 1 I I 

MTQNKLKLCS KANVYTEVPD GGWGWAVAVS FFFVEVFTYG IIKTFGVFFN DLMDSFNESN 60 
SRJSWIISIC VFVLTFSAPL ATVLSNRFGH RLVVMLGGLL VSTGMVAASF SQEVSHMYVA 120 
IGIISGLGYC FSFLPTVTIL SQYFGKRRSI VTAVASTGEC FA VFAFAPAI MALKERIGWR 1 80 
YSLLFVGLLQ LNIVIFGALL RPIFIRGPAS PKJV1QENRK EAQYMLENEK TRTSIDSIDS 240 
GVELTTSPKN VPTHTNLELE PKADMQQVLV KTSPRPSEKK APLLDFSILK EKSFICYALF 300 
GLFATLGFFA PSLYIEPLGI SLGIDQDRAA FLLSTMAIAE VFGRIGAGFV LNREPIRKIY 360 
IEUCVILLT VSLFAFTFAT EFWGLMSCSI FFGFMVGTIG GTHIPLLAED DVVGIEKMSS 420 
AAGVYIFIQS IAGLAGPPLA GLLVDQSKIY SRAFYSCAAG MALAAVCLAL VRFCKMGLCQ 480 
HHHSGETKW SHRGKTLQDI PEDFLEMDLA KNEHRVHVQM EPV 

Seq ID NO: 43 DNA sequence 

Nucleic Acid Accession it: FGENESH predicted ORF 

Coding sequence: 1-1749 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

ATGCTGTCTG GCTTCTTGAT GAGTCCCAGT ACCCAGCACA G AGCACAGTA CACTCCCGGA 60 
GGAAAGAAAC TTCCGTGGG A GGCTTCCATC GGTGCGCACA CCTCCCGAGG GCGAGGCAGC 120 
GACCGGGAGA GGG AG AG CCG GCCGGAGGCT GCCGGGCTCC TGTGGGACCG CGCTGCAGCC 180 
GGGGAGGCGG AGAAGGGGAA CCGGGGCGAG CCGCCCGCCT GGATCCGCGC CCAGCAGCAG 240 
CCGCGGCCGC CGCCAGCTGG GCAGGCTCCC GGGACTGCGG CTGGGGGCGC GCAGGACCCT 300 
CGCCTGCGTC CTGGACGTTC CCGGGGGAGG GTCCGGTTGC CAGTGAAACC TCCAGAGGCT 360 
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TCCGGACGAC AGCCCCGGGG GCCTTCTG AC TGCATCCCG A G ATTTCCATC AGCG AGTGCA 420 
ACTCATAAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG AGGACGGGGA TGGCTTAGGA 480 
GCTCCTGGAC CTAGGGCCCG GCGTCGTCGC CTCCTGGGCG TCGCGGCAGA GGGGAGTGGC 540 
CCGCGCGG AA AGCGCCGCGG GACAGTCAOT GACGAGGCCC GGGGGTCGCC GGGGCCACGA 600 
CTTCTCGGAG ACCGTCCTGC GCTCTCTGG A GACGCGCTGT CCGCGCCCAG GGTGGTGCCA 660 
TGTGGGGCGC TCGCCGCTCG TCCGTCTCCT CATCCTGG AA CGCCGCTTCG CTCCTGCAGC 720 
TGCTGCTGGC TGCGCTGCTG GCGGCGGGGG CGAGGGCCCA GCGGCGAGTA CTGCCACGGC 780 
TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCO CTTCG ACGGC 840 
GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 900 
GCGCGCCTGG ACCAGGGCGG CTGCG ACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 960 
CGGGCGGACA AAGACGGGCC CCGACGGCTC GGCAGGGCTT CATGTCTTAO GGGTACCCAA 1020 
GGAGACGGCG AGGGTGCGCC CCCACCCGTG AGGGCCTGGC AGCGGTGCTC CCCTGAAGGC 1080 
TCCCCGAAAG GAAGGCAGCT CCTCAGGGCT TTOCCGGGGC TGCTGCCCCG TGCCAGACGC 1 140 
CGCGGATTCC CATCTTCTCC ACGCGGCGGC CCCTCTCCCC TGCAGCGGCC CGCCTTGCCC 1200 
ATCTACOTGC CGTTCCTCAT TGTTGGCTCC GTGTTTGTCG CCTTTATCAT CTTGGGGTCC 1260 
CTGGTGGCAG CCTGTTGCTG CAGATGTCTC CGGCCTAAGC AGGATCCCCA GCAGAGCCGA 1320 
GCCCCAGGGG GTAACCGCTT GATGGAGACC ATCCCCATGA TCCCCAGTGC CAGCACCTCC 1380 
CGGGGGTCGT CCTCACGCCA GTCCAGCACA GCTGCCAGTT CCAGCTCCAG CGCCAACTCC 1440 
GGGGCCCGGG CGCCCCCAAC AAGGTCACAG ACCAACTGTT GCTTGCCGGA AGGGACCATG 1500 
AACAACGTGT ATGTCAACAT GCCCACG AAT TTCTCTGTGC TGAACTGTCA GCAGGCCACC 1560 
CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT ACGTGGGGTA CACGGTGCAG 1620 
CACGACTCTG TGCCCATGAC AGCTGTGCCA CCTTTCATGG ACGGCCTGCA GCCTGGCTAC 1680 
AGGCAGATTC AGTCCCCCTT CCCTCACACG AACAGTG AAC AG AAG ATGTA CCCAGCGGTG 1 740 
ACTOTATAAC CG AG AGTCAC TGGTGGGTTC CTTTACTGAA GGGAGACGAA GGCAGGGGTG 1 800 
GATTCTCGAG GTGGAAGTCC GCACATGTCG GTGGTATTTA TGGCACGATT CCTTTGGATG 1860 
GCTTCATTTG CCCCCAGACT GTATGAAAAC ATCTCCGAAT TAGCATTTCT GGATATGTTT 1920 
CATCCAGGGT ATCATTGATT TATGATGGAA AACCGGCCTC AGCTGGAGAT GACTGTGATG 1980 
TTGCTGATGG GTGTATAACA AATGCTTGAG TCCGAAGTGC CCTTGAGATA TGGTTGACGA 2040 
AAG AATTTTA TAAACTGATA AATTAAGG AT TTTTATTATG TTGTTATTAT TATTTCTTTT 2100 
TTOTTGTTOA CTGCACA GGA TCAAAATGCC TGTTATCTCC CTTTTACTQO GA C1 rTTTTT 2160 
inn run TTTTTTI l AA TCAGACAGGG TCTTGCTCTG TTGCCCAGGC TGGAGTGCAG 2220 
TGGTGCGATC TCGGCTCACT GCAACTTCAG CCTCCTGGAT TCAGGCAACA CTCCTGCCTC 2280 
AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 
TGTAGAGATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 
TCTGC CTGTC T CAGCCT CCC AAAGTGCTGG GATTACAGGC GTGAGCCACC GCCCCCAGCC 2460 
TGAGeCTTTT TTTTTTTCTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 
ATTCTAAAAG GAAACCTGTT TGAACTCTGT GAG ATCAGTC ATCAGTCTCA GTATTCCACA 2580 
GGCACACCTT AATTTCATTG TAAAAAG ATA TATATATTTT GTCTATTTTT GTGCTTTTGG 2640 
GGGCCTATTT TGTGCTTTTT TACCTTATGT AGAGATCTTA TTACAAAGTG ATTTTCTACA 2700 
TTAAAAAGAG ACTG AAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAG AACTC 2760 
TGGGATGATT TTAATCTTGA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCCCCTTC 2820 
TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTG AGAGGTAATG CATTCTTTTT 2880 . 
TATATTGAAA TCATAAACTA TCACCCGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 2940 
TGGTTATGGT TTGGCGTTTC CTTCTGTTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 
GAGTG CAAG T AATTACTATA CTTGTAAATG AAGATCAGTA TTTCTGCCTA GATCTGATAA 3060 
AAAAATTTTC TTGTCTTAGT TATAAAAATT CAAAGAAATG TGTTACAAAG ATACTTAOTA 3120 
TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 3180 
GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 
GATTGTTAAG AGAAAAGCTT TTCAACGAAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 
ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 
CAAATTCAAG TGAATTTATT TGTGTGTTCT TTACTTATAT AAAAAAAGAT AACTTTAAGG 3420 
ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATCGCT 3480 
GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC TATCGATCAT CTTTCTATTC 3540 
TGTTTCATGA CTGAATAATG TAAAACCAGT GTTGGCAATT GGTATCATCA ATOATACTCA 3600 
TTTTTTAATA ACCAAAGGCA GGGG AAAATC ATTTTACTTA TTAATAAATA TTTTATGATG 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 44 Protein sequence: 

Protein Accession »: FGENESH predicted 

1 11 21 31 41 51 
I I I I I I 

MLSGFLMSPS TQHRAQYTPG GKKLPWEASI GAHTSRGRGS DRERESRPEA AGLLWDRAAA 60 
GEAEKGNRGE PPAWIRAQQQ PRPPPAGQAP GTAAGGAQDP RLRPGRSRGR VRLPVKPPEA 120 
SGRQPRGPSD CIPRFPSASA THKAVPKGTG PPAEDGDGLG APGPRARRRR LLGVAAEGSG 180 
PRGKRRGTVS DEARGSPGPR LLGDRPALSG DALSAPRVVP CGALAARPSP HPGTPLRSCS 240 
CCWLRCWRRG RGPSGEYCHG WLDAQGVWRI GFQCPERFDG GDATICCGSC ALRYCCSSAE 300 
ARLDQGGCDN DRQQGAGEPG RADKDGPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPEG 360 
SPKGRQLLRA FPGLLPRARR RGFPSSPRGG PSPLQRPALP 1YVPFLIVGS VFVAF1IU3S 420 
LVAACCCRCL RPKQDPQQSR APGGNRLMET IPM1PSASTS RGSSSRQSST AASSSSSANS 480 
G ARAPPTRSQ TNCCLPEGTM NNVYVNMPTN FSVLNCQQAT QIVPHQGQYL HPPYVGYTVQ 540 
HDSVPMTAVP PFMDGLQPGY RQIQSPFPHT NSEQKMYPAV TV 

Seq 10 NO: 45 DNA sequence 

Nucleic Acid Accession #: NM_002285 

Coding sequence: 55-3738 (underlined sequences correspond to start Bad stop codons) 
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1 U 21 31 41 SI 
1 1 I 1 I I 

GGCCGAGCCT CGGCGGCGGC GGTAGCGGCG GCGGCGACGC TGACACCTCC CACCAT£GAC 60 
AGCTTCGACT TAGCCCTGCT CCAGGAATGG GACCTCGAGT CACTGTGTGT CTATGAACCA 120 
GATAGAAATG CATTACGGAG GAAAGAACGA GAAAGAAGAA ATCAAGAAAC TCAACAGGAT 180 
GATGGCACGT TTAATTCTAG TTACTCTCTC TTCAGTGAGC CCTACAAGAC TAACAAGGGG 240 
GATGAACTCT CCAACCGGAT CCAGAACACT TTAGGCAATT ATGATGAAAT GAAAGACTTT 300 
TTAACTGATA GAACCAATCA GAGTCATCTC GTTGGAGTTC CCAAACCTGG GGTTCCTCAG 360 
ACTCCTGTGA ACAAGATCGA TGAACATTTT GTTGCAGATT CAAGAGCCCA GAACCAGCCC 420 
TCGTCTATCT GTAGCACTAC AACTTCCACA CCAGCAGCTG TCCCCGTGCA GCAGAGTAAA 480 
AGAGGCACTA TGGGCTGGCA GAAGGCTGGG CACCCACCCT CTGACGGCCA ACAGAG AGCA 540 
ACACAACAGG GCTCTCTCAG GACCTTGCTT GGAGATGGTG TTGGCAG ACA GCAGCCTCGG 600 
GCCAAACAAG TGTGCAATGT GGAGGTGGGC CTTCAGACCC AGGAGAGGCC ACCTGCCATG 660 
GCGGCCAAGC ACAGCAGCAG CGG ACACTGT GTTCAGAACT TTCCTCCATC CCTAGCTTCA 720 
AAACCCAGCC TGGTCCAGCA GAAACCGACC GCGTATGTGA GGCCAATGGA CGGCCAAGAT 780 
CAGGCCCCTG ATG AGTCTCC TAAGCTGAAG TCGTCTTCGG AAACCAGCGT GCACTGCACA 840 
TCATACAGGG GAGTCCCTGC CAGCAAGCCG GAGCCTGCCA GAGCCAAGGC CAAGCTCTCC 900 
AAGTTCAGCA TCCCCAAGCA GGGGG AGGAG AGTAGATCTG GAGAAACCAA CAGCTGTGTT 960 
GAAGAAATAA TCCGGGAGAT GACCTGGCTT CCACCACTTT CTGCTATTCA AGCACCTGGC 1020 
AAAGTGGAAC CAACCAAATT TCCATTTCCA AATA AGG ACT CTCAGCTTGT ATCCTCTGGA 1080 
CACAATAATC CAAAGAAAGG TGATGCAGAG CCAGAGAGTC CAGACAATGG CACATCGAAT 1 140 
ACATCAATGC TGGAAGATG A CCTTAAGCTA AGCAGTG ATG AAGAGGAGAA TGAACAGCAG 1200 
GCAGCTCAGA GAACGGCTCT CCGCGCTCTC TCTGACAGCG CCGTGGTCCA GCAGCCCAAC 1260 
TGCAGAACCT CGGTGCCTTC CAGCAAGGGC AGCAGCAGCA GCAGCAGCAG CGGCACGAGC 1320 
AGCTCCTCCA GCG ACTCAG A GAGCAGCTCC GGATCTGACT CGGAGACCGA GAGCAGCTCC 1380 
AGCGAG AGTO AGGGCAGCAA GCCCCCCCAC TTCTCCAGCC CCGAGGCTGA ACCGGCATCC 1440 
TCTAACAAGT GGCAGCTGGA TAAATGGCTA AACAAAGTTA ATCCCCACAA GCCTCCTATT 1500 
CTGATCCAAA ATGAAAGCCA CGGGTCAGAG AGCAATC AGT ACTACAACCC GGTGAAAGAG 1 560 
GACGTCCAGG ACTGTGGGAA AGTCCCCGAC GTTTGCCAGC CCAGCCTGAG AGAGAAGGAG 1620 
ATCAAGAGCA CTTGCAAGGA GGAGCAAAGG CCAAGGACAG CCAACAAGGC CCCTGGGAGT 1680 
AAAGGCGTGA AGCAGAAGTC CCCGCCCGCG GCCGTGGCCG TGGCGGTGAG CGCAGCCGCC 1740 
CCGCCACCCG CAGTGCCCTG TGCGCCCGCG GAGAACGCGC CCGCGCCTGC CCGGAGGTCC 1800 
GCGGGCAAGA AGCCCACCAG GCGCACCGAG AGGACCTCAG CCGGGGACGG CGCCAACTGC 1860 
CACCGGCCCG AGGAGCCCGC GGCCGCGGAC GCGCTGGGGA CGAGCGTGGT GGTCCCCCCG 1920 
GAGCCCACCA AAACCAGGCC CTGTGGCAAC AACAGAGCGA GCCACCGCAA GGAGCTGCGC 1980 
TCCTCCGTG A CCTGCG AG AA GCGCCGCACG CGGGGGCTAA GCAGGATCOT CCCCAAATCC 2040 
AAGGAGTTCA TTGAGACAGA GTCGTCATCT TCATXXTCCT CCTCGGACTC CGACCTGGAG 2100 
TCCGAGCAGG AGGAGTACCC TCTGTCCAAA GCACAGACCG TGGCTGCCTC TGCCTCCTCC 2160 
GGGAATGATC AGAGGCTG AA GG AGGCCGCT GCCAACGGGG GCAGTGGT(X TAGGGCCCCT 2220 
GTAGGCTCCA TCAACGCCAG G ACCACCAGT GACATCGCCA AGGAGCTGGA GGAGCAGTTC 2280 
TACACACTGG TCCCCTTTGG CCGGAACGAA CTTCTCTCCC CTCTA AAGGA CAGTGATGAG 2340 
ATCAGGTCTC TCTGGGTCA A AATCX3 ACCTG ACCCTCCTGT CCAGGATCCC AG AACACCTG 2400 
CCCCAGGAGC CAGGGGTATT GAGCGCCCCT GCCACCAAGG ACTCTGAG AG CGCACCGCCC 2460 
AGCCACACCT CGGACACACC TGCAGAAAAG GCTTTGCCAA AATCCAAGAG GAAACGCAAG 2520 
TGTGACAACG AAGACGACTA CAGGG AG ATC AAGAAGTCCC AGGGAGAGAA AGACAGCTCT 2580 
TCAAGACTGG CCACCTCCAC CAGTAATACT TTGTCTGCAA ACCACTGCAA CATGAACATC 2640 
AACAGTGTGG CAATACCAAT AAATAA AAAT G A AAAAATGC TTCGGTCGCC CATCTCACCC 2700 
CTCTCTGATG CATCTAAACA CAA ATACACC AGCGAGGACT TAACTTCTTC CAGCCGACCT 2760 
AATGGCAACA GTTTGTTTAC TTCAGCCTCT TCCAGCAAAA AGCCTAAGGC CGACAGCCAG 2820 
CTGCAGCCTC ACGGCGG AG A CCTCACGAAA GCAGCTCACA AC AATTCTG A AAACATTCCC 2880 
CTCCACAAGT CACGGCCGCA GACGAAGCCG TGGTCTCCAG GCTCCAACGG CCACAGGGAC 2940 
TGCAAG AGGC AGAAACTTGT CTTCGATG AT ATGCCTCGCA GTGCCGATTA TTTTATGCAA 3000 
GAAGCTAAAC G AATGAAGCA TAAAGCAGAT GCAATGGTGG AAAAGTTTGG AAAGGCTTTG 3060 
AACTATGCTG AAGCAGCATT GTCGTTTATC GAGTGTGGAA ATGCAATGGA ACAAGGCCCC 3120 
ATGGAATCCA AATCTCCTTA TTACCTGATG TATTCAGAAA CAGTAGAGCT CATCAGGTAT 3180 
GCTATGAG AC TAAAAACCCA CTCAGGCCCC AATGCCACAC CAGAAGACAA ACAACTGGCT 3240 
GCATTATGTT ACCGATGCCT GGCCCTCCTG TACTGGCGGA TGTTTCGACT CAAAAGGGAC 3300 
CACGCTGTAA AGTATTCAAA AGCACTAATC GACTATTTCA AGAACTCATC TAAAGCCGCC 3360 
C AAGCCCCAT CTCCGTGGGG GGCCAGTGGA AAGAGCACTG GAACCCCATC CCCCATTTCT 3420 
CCCAACCCCT TTCCCGGCAG CTCCGTGGGG TCTCAGGGCA GCCTCTCCAA CGCCAGCGCC 3480 
CTGTCCCCGT CGACCATCGT CAGCATCCCA CAGCGCATCC ACCAGATGGC GGCCAACCAC 3540 
GTCAGCATCA CCAACAGCAT CCTGCACAGC TACGACTACT GGGAGATGGC CG AC A ACCTG 3600 
GCCAAGG AAA ACCGAGAATT CTTCAACGAC CTGG ATCTGC TCATGGGGCC GGTCACCCTG 3660 
CACAGCAGCA TGGAGCACCT GGTCCAGTAC TCCCAACAGG GCCTGCACTG GCTGCGGAAC 3720 
AGCCCCCACC TGTCA IAGGG ACCTCACCCT GGGGCCAGAG TGGGCTCTGG TCTCCACAGA 3780 
TGGCTCAACG TTTTTGGACA CTGTGCTACT GA AACTCCCA GCCACAGCAT TTATAGACTG 3840 
CGGTGAACAT TTCCTCA 

SeqIDNO: 46 Protein sequence: 
Protein Accession #: NPJD02276 

1 11 21 31 41 51 
I I I I I I 

MDSFDLALLQ EWDLESLCVY EPDRNALRRK ERERRNQETQ QDDGTFNSSY SLFSEPYKTN 60 
KGDELSNRJQ NTLGNYDEMK DFLTDRTNQS HLVGVPKPGV PQTPVNKIDE HFVADSRAQN 120 
QPSSICSTTT STPAAVPVQQ SKRGTMGWQK AGHPPSDGQQ RATQQGSLRT LLGDG VGRQQ 180 
PRAKQVCNVE VGLQTQERPP AMAAKHSSSG HCVQNFPPSL ASKPSLVQQK PTAYVRPMDG 240 
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QDQAPDESPK LKSSSETSVH CTSYRG VPAS KPEPARAKAK LSKFSIPKQG EESRSGETNS 300 
CVEEIIREMT WLPPLSAIQA PGKVEPTKFP FPNKDSQLVS SGHNNPKKOD AEPESPDNGT 360 
SNTSMLEDDL KLSSDEEENE QQAAQRTALR ALSDSAWQQ PNCRTSVPSS KGSSSSSSSG 420 
TSSSSSDSES SSGSDSETES SSSESEGSKP PHFSSPEAEP ASSNKWQLDK WLNKVNPHKP 480 
PIUQNESHG SESNQYYNPV KEDVQDCGKV PDVCQPSLRE KEIKSTCKEE QRPRTANKAP 540 
GSKGVKQKSP PAAVAVAVSA AAPPPAVPCA PAENAPAPAR RSAGKKPTRR TERTSAGDGA 600 
NCHRPEEPAA ADALGTSVW PPEPTKTRPC GNNRASHRKE LRSSVTCEKR RTRGLSRIVP 660 
KSKEFIETES SSSSSSSDSD LESEQEEYPL SKAQTVAASA SSGNDQRLKE AAANGGSGPR 720 
APVGSINART TSDIAKELEE QFYTLVPFGR NELLSPLKDS DEIRSLWVKI DLTLLSRIPE 780 
HLPQEPGVLS APATKDSESA PPSHTSDTPA EKALPKSKRK RKCDNEDDYR EHCKSQGEKD 840 
SSSRLATSTS NTLSANHCNM NINSVA1PIN KNEKMLRSPI SPLSDASKHK YTSEDLTSSS 900 
RPNGNSLFTS ASSSKKPKAD SQLQPHGGDL TKAAHNNSEN IPLHKSRPQT KPWSPGSNGH 960 , 
RDCKRQKLVF DDMPRSADYF MQEAKRMKHK ADAMVEKFGK. ALNYAEAALS FIECGNAMEQ 1020 
GPMESKSPYY LMYSETVELI RYAMRLKTHS GPNATPEDKQ LAALCYRCLA LLYWRMFRLK 1080 
RDHAVKYSKA LIDYFKNSSK AAQAPSPWGA SGKSTGTPSP ISPNPFPGSS VGSQGSLSNA 1 140 
SALSPSTIVS IPQRIHQMAA NHVSITNSIL HSYDYWEMAD NLAKENREFF NDLDLLMGPV 1200 
TLHSSMEHLV QYSQQGLHWL RNSAHLS 

Seq ID NO: 47 DNA sequence 

Nucleic Acid Accession #: NM_033 151 

Coding sequence: 35 1 -4499(underiincd sequences correspond to start and stop codons) 



I 11 21 31 41 51 

ACTGGGATAA AGCAAGAAGA CTGATTTTAT GAGCAGGGGT TTGATACATC AAAGGAGATT 60 
GCCCAGGATC A AGGGTGCGG TGTTGGGGGT GGGTTGGGGA GGGTGGTTAG AGAAGGTTTC 1 20 
ACTAAGTGAT TTGGGCCTG A GGCCTGAG AA GATGTTTAAA AAGAGGGATC AAGCACAGGC 1 80 
TAAGGAGAGG AAAGAGCAGG CACCCAAACC TCTGCATGGC CCCAATATGC TCCCTGCAGG 240 
GTAGTGCCCC CTCTTCTGGC TGCTCAAGGC GAGATCTAAG CTTCTTCTAA CTCCTGCTGT 300 
CTTTTCATAT TCTCTGATTC TGGGAAACGA AGAATTGGCA GGAACTGAA A ATGACTAGGA 360 
AGAGGACATA CTGGGTGCCC AACTCTTCTG GTGGCCTCGT GAATCGTGGC ATCG ACATAG 420 
GCGATG ACAT GGTTTCAGGA CTTATTTATA AAACCTATAC TCTCCAAG AT GGCCCCTGGA 480 
GTCAGCAAGA 0 AG A AATCCT G AGGCTCCAG GG AGGGCAGC TGTCCCACCG TGGGGGAAGT 540 
ATGATGCTGC CTTG AGAACC ATGATTCCCT TCCGTCCCAA GCCGAGGTTT CCTGCCCCCC 600 
AGCCCCTGGA CAATGCTGGC CTGTTCTCCT ACCTCACCGT GTCATGGCTC ACCCCGCTCA 660 
TG ATCCAAAG CTTACGGAGT CGCTTAGATG AGAACACCAT CCCTCCACTG TCAGTCCATG 720 
ATGCCTCAGA CAAAAATGTC CAAAGGCTTC ACCGCCTTTG GG AAGAAGAA GTCTCAAGGC 780 
GAGGGATTGA AAAAGCTTCA GTGCTTCTGG TGATGCTGAG GTTCCAGAGA ACAAGGTTGA 840 
TTTTCGATGC ACTTCTGGGC ATCTGCTTCT GCATTGCCAG TGTACTCGGG CCAATATTGA 900 
TTATACCAAA G ATCCTGGA A TATTCAGAAG AGCAGTTGGG OAATGTTGTC CATGG AGTGG 960 
G ACTCTGCTT TG CCCTTTTT CTCTCCG AAT GTGTGAAGTC TCTGAGTTTC TCCTCCAGTT 1020 
GGATCATCAA CCAACGCACA GCCATCAGGT TCCGAGCAGC TGTTTCCTCC TTTGCCTTTG 1080 
AGAAGCTCAT CCAATTTAAG TCTGTAATAC ACATCACCTC AGGAGAGGCC ATCAGCTTCT 1 140 
TCACCGGTGA TGTAAACTAC CTGTTTGAAG GGGTGTGCTA TGGACCCCTA GTACTGATCA 1200 
CCTGCGCATC GCTGGTCATC TGCAGCATTT CTTCCTACTT CATTATTGGA TACACTGCAT 1260 
TTATTGCCAT CTTATGCTAT CTCCTGGTTT TCCCACTGGC GGTATTCATG ACAAGAATGG 1320 
CTGTGAAGGC TCAGCATCAC ACATCTGAGG TCAGCGACCA GCGCATCCGT GTGACCAGTG 1380 
AAGTTCTCAC TTGCATTAAG CTGATTAAAA TGTACACATG GG AG AAACCA TTTGCAAAAA 1440 
TCATTGAAGA CCTAAGAAGG AAGGAAAGGA AACTATTGG A GAAGTGCGGG CTTGTCCAGA 1500 
GCCTG ACAAG TATAACCTTG TTCATCATCC CCACAGTGGC CACAGCGGTC TGGGTTCTCA 1 560 
TCCACACATC CTTAAAGCTG AAACTCACAG CGTCAATGGC CTTCAGCATG CTGGCCTCCT 1620 
TGAATCTCCT TCGGCTGTCA GTGTTCTTTG TGCCTATTGC AGTCAAAGGT CTCACGAATT 1680 
CCAAGTCTGC AGTGATGAGG TTCAAGAAGT TTTTCCTCCA GG AGAGCCCT GTTTTCTATG 1 740 
TCCAGACATT ACAAGACCCC AGCAAAGCTC TGGTCTTTGA GGAGGCCACC TTGTCATGGC 1800 
AACAGACCTG TCCCGGGATC GTCAATGGGG CACTGGAGCT GGAGAGGAAC GGGCATGCTT I860 
CTGAGGGGAT GACCAGGCCT AGAGATGCCC TCGGGCCAGA GG AAGAAGGG AACAGCCTGG 1920 
GCCCAGAGTT GCACAAGATC AACCTGGTGG TGTCCAAGGG GATGATGTTA GGGGTCTGCG 1980 
GCAACACGGG GAGTGGTAAG AGCAGCCTGT TGTCAGCCAT CCTGGAGGAG ATGCACTTGC 2040 
TCGAGGGCTC GGTGGGGGTG CAGGGAAGCC TGGCCTATGT CCCCCAGCAG GCCTGGATCG 2100 
TCAGCGGGAA CATCAGGGAG AACATCCTCA TGGGAGGCGC ATATGACAAG GCCCGATACC 2160 
TCCAGGTGCT CCACTGCTGC TCCCTG AATC GGGACCTGG A ACTTCTGCCC TTTGG AG ACA 2220 
TGACAGAGAT TGGAGAGCGG GGCCTCAACC TCTCTGGGGG GCAGAAACAG AGGATCAGCC 2280 
TGGCCCGCGC CGTCTATTCC GACCGTCAGA TCTACCTGCT GGACGACCCC CTGTCTGCTG 2340 
TGG ACGCCCA CGTGGGGAAG CACATTTTTG AGGAGTGCAT T AAGAA GACA CTCAGGGGGA 2400 
AG ACGGTCGT CCTGGTG ACC CACCAGCTGC AGTACTTAG A ATTTTGTGGC CAGATCATTT 2460 
TGTTGGAAAA TGGG AAAATC TGTGAAAATG GAACTCACAG TGAGTTAATG CAGAAAAAGG 2520 
GGAAATATGC CCAACTTATC CAGAAGATGC ACAAGGAAGC CACTTCGGAC ATGTTGCAGG 2580 
ACACAGCAAA GATAGCAGAG AAGCCAAAGG TAGAAAGTCA GGCTCTGGCC ACCTCCCTGG 2640 
AAGAGTCTCT CAACGGAAAT GCTGTGCCGG AGCATCAGCT CACACAGGAG GAGGAGATGG 2700 
AAGAAGGCTC CTTGAGTTGG AGGGTCTACC ACCACTACAT CCAGGCAGCT GGAGGTTACA 2760 
TGOTCTCTTG CATAATTTTC TTCTTCGTGG TGCTGATCGT CTTCTTAACG ATCTTCAGCT 2820 
TCTGGTGGCT G AGCTACTGG TTGGAGCAGG GCTCGGGGAC CAATAGCAGC CGAGAGAGCA 2880 
ATGGAACCAT GGCAGACCTG GGCAACATTG CAGACAATCC TCAACTGTCC TTCTACCAGC 2940 
TGGTGTACGG GCTCAACGCC CTGCTCCTCA TCTGTGTGGG GGTCTGCTCC TCAGGGATTT 3000 
TCACCAAAGT CACGAGGAAG GCATCCACGG CCCTGCACAA CAAOCTCTTC AAC AAGGT TT 3060 
TCCGCTGCCC CATGAGTTTC TTTGACACCA TCCCAATAGG CCGGCTTTTG AACTGCTTCG 3120 
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CAGGGGACTT GGAACAGCTG GACCAGCTCT TGCCCATCTT TTCAGAGCAG TTCCTGGTCC 3 1 80 
TGTCCTTAAT GGTGATCGCC GTCCTGTTG A TTGTCAGTGT GCTGTCTCCA TATATCCTGT 3240 
TAATGGG AGC CATAATCATG GTTATTTGCT TCATTTATTA TATGATGTTC AAGAAGGCCA 3300 
TCGGTGTGTT CAAGAGACTG GAGAACTATA GCCGGTCTCC TTTATTCTCC CACATCCTCA 3360 
5 ATTCTCTGCA AGGCCTGAGC TCCATCCATG TCTATGGAAA AACTGAAGAC TTCATCAGCC 3420 
AGTTTAAGAG GCTGACTG AT GCGCAGAATA ACTACCTGCT GTTGTTTCTA TCTTCCACAC 3480 
GATGGATGGC ATTGAGGCTG GAGATCATGA CCAACCTTGT GACCTTGGCT GTTGCCCTGT 3540 
TCGTGGCTTT TGGCATTTCC TCCACCCCCT ACTCCTTTAA AGTCATGGCT GTCAACATCG 3600 
TGCTGCAGCT GGCGTCCAGC TTCCAGGCCA CTGCCCGGAT TGGCTTGGAG ACAGAGGCAC 3660 
1 0 AGTTCACGGC TGTAG AGAGG ATACTGCAGT ACATGAAG AT GTGTGTCTCG G AAGCTCCTT 3720 
TACACATGGA AGGCACAAGT TGTCCCCAGG GGTGGCCACA GCATGGGGAA ATCATATTTC 3780 
AGGATTATCA CATGAAATAC AGAGACAACA CACCCACCGT GCTTCACGGC ATCAACCTGA 3840 
CCATCCGCGG CCACGAAGTG GTGGGCATCG TGGGAAGGAC GGGCTCTGGG A AGTCCTCCT 3900 
TGGGCATGGC TCTCTTCCGC CTGGTGG AGC CCATGGCAGG CCGGATTCTC ATTGACGGCG 3960 
1 5 TGGACATTTG CAGCATCGGC CTGGAGGACT TGCGGTCCAA GCTCTCAGTG ATCCCTCAAG 4020 
ATCCAGTGCT GCTCTCAGGA ACCATCAG AT TCAACCTAGA TCCCTTTGAC CGTCACACTG 4080 
ACCAGCAGAT CTGGGATGCC TTGGAGAGGA CATTCCTGAC CAAGGCCATC TCAAAGTTCC 4140 
CCAAAAAGCT GCATACAGAT GTGGTGGAAA ACGGTGGAAA CTTCTCTGTG GGGGAGAGGC 4200 
AGCTGCTCTG CATTGCCAGG GCTGTGCTTC GCAACTCCAA GATCATCCTT ATCGATGAAG 4260 
20 CCACAGCCTC CATTGACATG G AG ACAGACA CCCTGATCCA GCGCACAATC CGTGAAGCCT 4320 
TCCAGGGCTG CACCGTGCTC GTCATTGCCC ACCGTGTCAC CACTGTGCTG AACTGTGACC 4380 
ACATCCTGGT TATGGGCAAT GGGAAGGTGG TAGAATTTGA TCGGCCGGAG GTACTGCGG A 4440 
AGAAGCCTGG GTCATTGTTC GCAGCCCTCA TGGCCACAGC CACTTCTTCA CTGAGAJAAG 4500 
GAG ATGTGGA G ACTTCATGG AGGCTGGCAG CTGAGCTCAG AGGTTCACAC AGGTGCAGCT 4560 
2 5 TCGAGGCCCA CAGTCTGCGA CCTTCTTGTT TGG AGATGAG AACTTCTCCT GG AAGCAGGG 4620 
GTAAATGTAG GGGGGGTGGG GATTGCTGGA TGGAAACCCT GGAATAGGCT ACTTGATGGC 4680 
TCTCAAGACC TTAGAACCCC AGAACCATCT AAGACATGGG ATTCAGTGAT CATGTGGTTC 4740 
TCCTTTTAAC TTACATGCTG AATAATTTTA TAATAAGGTA AAAGCTTATA GTTTTCTGAT 4800 
CTGTGTTAGA AGTGTTGCAA ATGCTGTACT GACTTTGTAA AATATAAAAC TAAGGAAAAC 4860 



Seq ID NO: 48 protein sequence: 
Protein Accession U: NPJ 49 1 63.2 

3 5 MTRKRTYWVPNSSG GLVNRG IDIG DDM VSG U YKTYTLQDG P WSQQ ERNPE APG RA A VPPWGKYD AA LRT 
MIFFRPIO>RFPAPQPLDNAGLFSYLTVSWLTPUvlIQSUlSRU}E 
VSRRGIFX^SVLLVMLRPQRTRLIFDAlXGICPCIASVLGPIUiraLEYSEEQLGNW 
I^ECVKSLSFSSSWONQRTAIP^RAAVSSFAFEKXIQ 

VLITCASLVICSISSYHIGYTAFIAIU^LLVFPLAVFMTRMAVKAQHHTSEVSDQmVTSEVLTCK 
40 UKMYTWEKPFAKHEDIJfUUCERKIAEKCGLVQ 

UVSLNIXRLSVFFVPIAVKGLTNSKSAVMRFKKFFLQESPVFYVQTUJDPSKALVFEEATI^ 
VNGALEI^RNGHASEGMTRPRBAl^PEEEGNSUjPELHKINLWSKGMMLGVCGhTTGSG^ 
MHLLF^ SVG VQG SLAY VPQQAWIVSGKlRE^UvlGGAYD 

GLNl^GQKQRIStJ^RAWSDRQIYLIX>DPLSAVDAHVGKinFEFX:iKKTLRGKTVVLVTHQLQYLEFCG 
45 QIUXENGKJCENGTHSEU^QKKGKYAQUQKMHKEATSDMLQDTAKIAEKPKVESQA^ 

AVPEHQLTQEEEMEEGSl^WRVYHHYIQAAGGYMVSCUFFFVVUVFLTIFSFWI^YWLEQGSGTNSS 

RESNGTMADLGMADNPQI^FYQLVYGI>JAIXUCVGV<^ 

FDTfflGRLLNCFAGDLEQUXJIXPIFSEQFLVLSUW 

KKAlGVFKRLEWSRSPIJSHIU^SLQ^I^SIHVYGKTEDnSQ 
50 EIMTr^VTLAVAIJVAFGISSTPYSFKVMAVTflVU}^ 

FJVPLiiMEGTSCPQGWPQHGEIIFQDYHMKYRDm? 

LVEPMAGRIUDGVDICSIGLEDUlSKLSVIPQDPVLI^CrriRFNLDPFDRHTDQQIWDALERTFLTXA^ 
SKFPKKLHTD\A r ENG GNFSVG ERQLITIARA VLRNSKJILfflEAT ASmMETDTUQRTIREAFG^CTVL 
^ ^ VIAHRVTTVLNCDHILVMGNGKVVEEDRPEVUIKKP^ 

Seq ID NO: 49 DNA sequence 

Nucleic Acid Accession #: NM 033419 

Coding sequence: 1 8-980 (underlined sequences correspond to start and stop codons) 
60 1 11 21 31 41 51 

CGAGCCAGGG AGAAAGOATG GCCGGCCTGG CGGCGCGGTT GGTCCTGCTA GCTGGGGCAG 60 
CGGCGCTGGC GAGCGGCTCC CAGGGCGACC GTGAGCCGGT GTACCGCGAC TGCGTACTGC 120 
AGTGCG AAG A GCAG AACTGC TCTGGGGGCG CTCTGA ATCA CTTCCGCTCC CGCCAGCCAA 1 80 

65 TCTACATGAG TCTAGCAGGC TGGACCTGTC GGGACGACTG TAAGTATGAG TGTATGTGGG 240 
TCACCGTTGG GCTCTACCTC CAGGAAGGTC ACAAAGTGCC TCAGTTCCAT GGCAAGTGGC 300 
CCTTCTCCCQ GTTCCTGTTC TTTCAAGAGC CGGCATCGGC CGTGGCCTCG TTTCTCAATG 360 
GCCTGGCCAG CCTGGTGATG CTCTGCCGCT ACCGCACCTT CGTGCCAGCC TCCTCCCCCA 420 
TGTACCACAC CTGTGTGGCC TTCGCCTGGG TGTCCCTCAA TGCATGGTTC TGGTCCACAG 480 

70 TTTTCCACAC CAGGGACACT GACCTCACAG AGAAAATGGA CTACTTCTGT GCCTCCACTG 540 
TCATCCTACA CTCAATCTAC CTGTGCTGCG TCAGGACCGT GGGGCTGCAG CACCCAGCTG 600 
TGGTCAGTGC CTTCCGGGCT CTCCTGCTGC TCATGCTGAC CGTGCACGTC TCCTACCTGA 660 
GCCTCATCCG CTTCGACTAT GGCTACA ACC TGGTGGCCAA CGTGGCTATT GGCCTGGTCA 720 
ACGTGGTGTG GTGGCTGGCC TGGTGCCTGT GGAACCAGCG GCGGCTGCCT CACGTGCGCA 780 

75 AGTGCGTGGT GGTGGTCTTG CTGCTGCAGG GGCTGTCCCT GCTCGAGCTG CTTGACTTCC 840 
CACCGCTCTT CTGGGTCCTG GATGCCCATG CCATCTGGCA CATCAGCACC ATCCCTGTCC 900 
ACGTCCTCTT TTTCAGCTTT CTGG AAGATG ACAGCCTGTA CCTGCTG AAG GAATCAG AGG 960 
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TC 
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ACAAGTTCAA GCTGGACTGA AGACCTTGGA GCG AGTCTGC CCCAGTGGGG ATCCTGCCCC 1020 
CGCCCTGCTG GCCTCCCTTC TCCCCTCAAC CCTTGAGATG ATTTTCTCTT TTCAACTTCT 1080 
TGAACTTGGA CATGAAGGAT GTGOGCCCAG AATCATGTGG CCAGCCCACC CCCTGTTGGC 1140 
CCTCACCAGC CTTGGAGTCT GTTCTAGGGA AGGCCTCCCA GCATCTGGGA CTCGAGAGTG 1200 
GGCAGCCCCT CTACCTCCTG GAGCTGAACT GGGGTGGAAC TGAGTGTGCT CTTAGCTCTA 1260 
CCGGGAGGAC AGCTGOCTGT TTCCTCCCCA TCAGCCTCCT CCCCACATCC CCAGCTGCCT 1320 
GGCTGGGTCC TGAAGCCCTC TGTCTACCTG GGAGACCAGG GACCACAGGC CTTAGGGATA 1380 
CAGGGGGTCC CCTTCTGTTA CCACCCOCCA CCCTCCTCCA GGACACCACT AGGTGGTGCT 1440 
GGATGCTTGT TCTTTGGCCA GCCAAGGTTC ACGGCGATTC TCCCCATGGG ATCTTGAGGG 1500 
ACCAAGCTGC TGGGATTGGG AAGGAGTTTC ACCCTGACCA TTGCCCTAGC CAGGTTCCCA 1560 
GGAGGCCTCA CCATACTCCC TTTCAGGGOC AGGGCTCCAG CAAGCCCAGG GCAAGGATCC 1620 
TGTGCTGCTG TCTGGTTGAG AGCCTGCCAC CGTGTGTCGG GAGTGTGGGC CAGGCTGAGT 1680 
GCATAGGTGA CAGGGCCGTG AGCATGGGCC TGGGTGTGTG TGAGCTCAGG CCTAGGTGCG 1740 
CAGTGTGGAG ACGGGTGTTG TCGGGGAAG A GGTGTGGCTT CAAAGTGTGT GTGTGCAGGG 1 800 
GGTGGGTGTG TTAGCGTGGG TTAGGGGAAC GTGTGTGCGC GTGCTGGTGG GCATGTGAGA 1860 
TGAGTGACTG CCGGTGAATG TGTCCACAGT TGAGAGGTTO GAGCAGGATG AGGGAATCCT 1920 
GTCACCATCA ATAATCACTT GTGGAGCGCC AGCTCTGCCC AAGGCGCCAC CTGGGCGGAC 1980 
AGCCAGGAGC TCTCCATGGC CAGGCTGCCT GTGTGCATGT TCCCTGTCTG GTGCCCCTTT 2040 
GCCCGCCTCC TGCAAACCTC ACAGGGTCCC CACACAACAO TGCCCTCCAG AAGCAGCCCC 2100 
TCGGAGGCAG AGGAAGG AAA ATGGGGATGG CTGGGGCTCT CTCCATCCTC CTTTTCTCCT 2160 
TGCCTTCGCA TGGCTGGCCT TCCCCTCCAA AACCTCCATT CCCCTGCTGC CAGCCCCTTT 2220 
GCCATAGCCT GATTTTGGGG AGGAGGAAGG GGCGATTTGA GGG AGAAGGG GAGAAAGCTT 2280 
ATGGCTGGGT CTGGTTTCTT CCCTTCCCAG AGGGTCTTAC TGTTCCAGGG TGGCCCCAGG 2340 
GCAGGCAOGG GCCACACTAT GCCTGCGCCC TGGTAAAGGT GACCCCTGCC ATTTACCAGC 2400 
AGCCCTGGCA TGTTCCTGCC CCACAGGAAT AGAATGGAGG GAGCTCCAGA AACTTTCCAT 2460 
CCCAAAGGCA GTCTCCGTGG TTGAAGCAGA CTGGATTTTT GCTCTGCCCC TGACCCCTTG 2520 
TCCCTCTTTG AGGGAGGGGA GCTATGCTAG GACTCCAACC TCAGGGACTC GGGTGGCCTG 2580 
CGCTAGCTTC TTTTG ATACT GAAAACTTTT AAGGTGGGAG GGTGGCAAGG GATGTGCTTA 2640 
ATAAATC AAT TCCAAGCCTC AAAAAAAAA A AAAAAAAAAA AA 

ScqIDNO: 50 Protein sequence: 
Protein Accession m NP_2 19487. 1 

1 11 21 31 41 51 
I I I I I I 

MKDVGPESCG QPTPCWPSPA LESVLGKASQ HLGLESGQPL YLLELNWGGT ECALSSTGRT 60 
AACFLPISLL PTSPAAWLGP EALCLPGRPG TTGLRDTGGP LLLPPPTLLQ DTTRWCWMLV 120 
LWPAKVHGDS PHGILRDQAA GIGKEFHPDH CPSQVPRRPH HTPFQGQGSS KPRARILCCC 1 80 
LVESLPPCVG SVGQAECIGD RAVSMGLGVC ELRPRCAVWR RVLSGKRCGF KVCVCRGWVC 



Scq ID NO: 5 1 DNA sequence 

Nucleic Acid Accession #: XM_05909 8. 1 

Coding sequence: 178-61 8(underlined sequences correspond to start and stop codons) 

I II 21 31 41 51 
I I I I I I 

G ATGTACACT CTGAAGTG AG CACATTCCTG TTGGCAGGAC ATGACACCTT GGCAGCAAGC 60 
ATCTCCTGGA TCCTTTACTG CCTGGCTCTG AACCCTO AGC ATCAAGAGAG ATGCCGGGAG 120 
. GAGGTCAGGG GCATCCTGGG GGATGGGTCT TCTATCACTT GGGACCAGCT GGGTGAGATG 180 
TCGTACACCA CAATGTGCATCAAGGAGACG TGCCGATTGA TTCCTGCAGT CCCGTCCATT 240 
TCCAGAGATC TCAGCAAGCC ACTTACCTTC CCAGATGGAT GCACATTGCC TGCAGGG ATC 300 
ACCGTGGTTC TTAGTATTTG GGGTCTTCAC CACAACCCTG CTGTCTGGAA AAACCCAAAG 360 
GTCTTTGACC CCTTG AGGTT CTCTCAGGAG AATTCTGATC AGAGACACCC CTATGCCTAC 420 
TTACCATTCT CAGCTGGATC AAGGAACTGC ATTGGGCAGG AGTTTGCCAT GATTGAGTTA 480 
AAGGTAACCA TTGCCTTGAT TCTGCTCCAC TTCAG AGTG A CTCCAGACCC CACCAGGCCT 540 
CTTACTTTCC CCAACCATTT TATCCTCAAG CCCAAGAATG GGATGTATTT GCACCTGAAG 600 
AAACTCTCTG AATGTTAQAT CTCAGGGTAC AATGATTAAA CGTACTTTGT TTTTCGAAGT 660 
TAAATTTACA GCTAATGATC CAAGCAGATA GAAAGGGATC AATGTATGGT GGGAGGATTG 720 
G AGGTTGGTG GGATAGGGGT CTCTGTGAAG AGATCCAAAA TCA'I ITCTAG GTACACAGTG 780 
TGTCAGCTAG ATCTGTTTCT ATATAACTTT GGGAGATTTT CAGATCTTTT CTGTTAAACT 840 
TTCACTACTA TTAATGCTGT ATACACCAAT AGACTTTCAT ATATTTTCTG TTGTTTTTAA 900 
AATAGTTTTC AG AATTATGC AAGTAATAAG TGCATGTATG CTCACTGTCA AAAATTCCCA 960 
ACACTAGAAA ATCATGTAG A ATAAAAATTT TAAATCTCAC TTCACTTAGC CG ACATTCCA 1020 
TGCCCTGACC AATCCTACTG CTTTTCCTAA AAACAGAATA ATTTGGTGTG CATTCTTTCA 1080 
GACTTTTTCC TATACATTTT ATATGTAG AA ATGTAGCAAT GTATTTGTAT AGATGTGATC 1 140 
ATTCCTATAT TGTTATTGAT TTTTTTCACT TAATAAAAAT TCACCTTATT CCTT 

Seq ID NO: 52 Protein sequence: 
Protein Accession U: XPJ)59098. 1 

1 11 21 31 41 51 
I I I I I I 

MSYTTMCIKE TCRUPAVPS 1SRDLSKPLT FPDGCTLPAG ITWLSIWGL HHNPAVWKNP 60 
KVFDPLRFSQ ENSDQRHPYA YLPFSAGSRN C1GQEFAMIE UCVTIAULL HFRVTPDPTR 120 
PLTFPNHFIL KPKNGMYLHL KKLSEC 
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Seq ID NO: 53 DNA sequence 

Nucleic Acid Accession U: NM_O309 16 

Coding sequence: 1-1533 (underline d sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

ATGC CCCTGT CCCTGGGAGC CGAGATGTGG GGGCCTGAGG CCTGGCTGCT GCTGCTGCTA 60 
CTGCTGGCAT CATTTACAGG CCGGTGCCCC GCGGGTGAGC TGGAGACCTC AGACGTGGTA 120 
ACTGTGGTGC TGGGCCAGGA CGCAAAACTG CCCTGCTTCT ACCGAGGGGA CTCCGGCGAG 1 80 
CAAGTGGGGC AAGTGGCATG GGCTCGGGTG GACGCGGGCO AAGGCGCCCA GGAACTAGCG 240 
CTACTGCACT CCAAATACGG GCTTCATGTG AGCCCGGCTT ACGAGGGCCG CGTGGAGCAG 300 
CCGCCGCCCC CACGCAACCC CCTGGACGGC TCAGTGCTCC TGCGCAACGC AGTGCAGGCG 360 
GATGAGGGCG AGTACGAGTG CCGGGTCAGC ACCTTCCCCG CCGGCAGCTT CCAGGCGCGG 420 
CTGCGGCTCC GAGTGCTGGT GCCTCCCCTG CCCTCACTGA ATCCTGGTCC AGCACTAGAA 480 
GAGGGCCAGG GCCTGACCCT GGCAGCCTCC TGCACAGCTG AGGGCAGCCC AGCCCCCAGC 540 
GTGACCTGGG ACACGGAGGT CAAAGGCACA ACGTCCAGCC GTTCCTTCAA GCACTCCCGC 600 
TCTGCTGCCG TCACCTCAGA GTTCCACTTG GTGCCTAGCC GCAGCATGAA TGGGCAGCCA 660 
CTG ACTTGTG TGGTGTCCCA TCCTGGCCTG CTCCAGG ACC AAAGGATCAC CCACATCCTC 720 
CACGTGTCCT TCCTTGCTG A GGCCTCTGTG AGGGGCCTTG AAG ACCAAAA TCTGTGGCAC 780 
ATTGGCAGAG AAGGAGCTAT GCTCAAGTGC CTG AGTGAAG GGCAGCCCCC TCCCTCATAC 840 
AACTGG ACAC GGCTGG ATGG GCCTCTGCCC AGTGGGGTAC GAGTGGATGG GGACACTTTG 900 
GGCTTTCCCC CACTGACCAC TGAGCACAGC GGCATCTACG TCTGCCATGT CAGCAATGAG 960 
TTCTCCTCAA GGGATTCTCA GGTCACTGTG GATGTTCTTG ACCCCCAGGA AGACTCTGGG 1020 
AAGCAGGTGG ACCTAGTGTC AGCCTCGGTG GTGGTGGTGG GTGTGATCGC CGCACTCTTG 1080 
TTCTGCCTTC TGGTGGTGGT GGTGGTGCTC ATGTCCCGAT ACCATCGGCG CAAGGCCCAG 1 140 
CAGATGACCC AGAAATATGA GGAGGAGCTG ACCCTGACCA GGGAGAACTC CATCCGGAGG 1200 
CTGCATTCCC ATCACACGGA CCCCAGGAGC CAGCCGGAGG AGAGTGTAGG GCTGAGAGCC 1260 
GAGGGCCACC CTGATAGTCT CAAGGACAAC AGTAGCTGCT CTGTGATGAG TGAAGAGCCC 1320 
GAGGGCCGCA GTTACTCCAC GCTGACCACG GTGAGGGAGA TAGAAACACA GACTGAACTG 1380 
CTGTCTCCAG GCTCTGGGCG GGCCGAGGAG GAGGAAGATC AGGATGAAGG CATCAAACAG 1440 
GCCATGAACC ATTTTGTTCA GGAGAATGGG ACCCTACGGG CCAAGCCCAC GGGCAATGGC 1500 
ATCTACATCA ATGGGCGGGG ACACCTGGTC TGA 

Seq ID NO: 54 Protein sequence: 
Protein Accession U: NPJ 12178.1 

] 11 21 31 41 51 
I I I I I I 

MPLSLGAEMW GPEAWLLLLL LLASFTGRCP AGELETSDW TVVLGQDAKL PCFYRGDSGE 60 
QVGQVAWARV DAGEGAQELA LLHSKYGLHV SPAYEGRVEQ PPPPRNPLDG SVLLRNAVQA 120 
DEGEYECRVS TFPAGSFQAR LRLRVLVPPL PSLNPGPALE EGQGLTLAAS CTAEGSPAPS 1 80 
VTWDTEVKGT TSSRSFKHSR SAAVTSEFHL VPSRSMNGQP LTCWSHPGL LQDQRITHIL 240 
HVSFLAEASV RGLEDQNLWH IGREG AMLKC LSEGQPPPSY NWTRLDGPLP SGVRVDGDTL 300 
GFPPLTTEHS GIYVCHVSNE FSSRDSQVTV DVLDPQEDSG KQVDLVSASV VWOVIAALL 360 
FCLLVWWL MSRYHRRKAQ QMTQKYEEEL TLTRENSDRR LHSHHTDPRS QPEESVGLRA 420 
EGHPDSLKDN SSCSVMSEEP EGRSYSTLTT VREIETQTEL LSPGSGRAEE EEDQDEGKQ 480 
AMNHFVQENG TLRAKPTGNG IYINGRGHLV 

Seq ID NO: 55 DNA sequence 

Nucleic Acid Accession #: AF0071 70.1 

Coding sequence: 73-1725 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

AAGGAGGCGG CCTCCGGGAA AAGCGACCGC AGGACTCCTG AGAGCAGCCT CCATGAGGCC 60 
CTGGACCAGT GCATGACCGC CCTGGACCTC TTCCTCACCA ACCAGTTCTC AG AAGCACTC 120 
AOCTACCTC A AGCCCAG AAC CAAGG AAAGC ATGTACCACT CACTG ACATA TGCCACCATC 1 80 
CTGGAGATGC AGGCCATGAT GACCTTTGAC CCTCAGG ACA TCCTGCTTGC CGGCAACATG 240 
ATG AAGGAGG CACAGATGCT GTGTCAGAGG CACCGGAGG A AGTCTTCTGT AACAG ATTCC 300 
TTCAGCAGCC TGGTGAACCG CCCCACGCTG GGCCAATTCA CTGAAGAAGA AATCCACGCT 360 
G AGGTCTGCT ATGCAGAGTG CCTGCTGCAG CGAGCAGCCC TGACCTTCCT GCAGGACGAG 420 
AACATGGTG A GCTTCATCAA AGGCGGCATC AAAGTTCGAA ACAGCTACCA GACCTACAAG 480 
G AGCTGGACA GCCTTGTTCA GTCCTCACAA TACTG CAAGG GTGAGAACCA CCCGCACTTT 540 
G AAGGAGGAG TGAAGCTTGG TGTAGGGGCC TTCAACCTGA CACTG TCC AT GCTTCCTACT 600 
AGGATCCTGA GGCTGTTGGA GTTTGTGGGG TTTTCAGGAA ACAAGGACTA TGGGCTGCTG 660 
CAGCTGGAGG AGGG AGCGTC AGGGCACAGC TTCCGCTCTG TGCTCTGTGT CATGCTCCTG 720 
CTGTGCTACC ACACCTTCCT CACCTTCGTG CTCGGTACTG GGAACGTCAA CATCGAGGAG 780 
GCCGAGAAGC TCTTGAAGCC CTACCTG AAC CGGTACCCTA AGGGTGCCAT CTTCCTGTTC 840 
TTTGCAGGGA GGATTGAAGT CATTAAAGGC AACATTG ATG CAGCCATCCG GCGTTTCGAG 900 
GAGTGCTGTG AGGCCCAGCA GCACTGGAAG CAGTTTCACC ACATGTGCTA CTGGGAGCTG 960 
ATGTGGTGCT TCACCTACAA GGGCCAGTGG AAGATGTCCT ACTTCTACGC CGACCTGCTC 1020 
AGCAAGGAGA ACTGCTGGTC CAAGGCCACC TACATTTACA TGAAGGCCGC CTACCTCAGC 1080 
ATGTTTGGGA AGGAGGACCA CAAGCCGTTC GGGGACGACG AAGTGGAATT ATTTCGAGCT 1140 
GTGCCAGGCC TGAAGCTCAA GATTGCTGGG AAATCTCTAC CCACAGAG AA GTTTGCCATC 1200 
CGGAAGTCCC GGCGCTACTT CTCCTCCAAC CCTATCTCGC TGCCAGTGCC TGCTCTGGAA 1260 
ATGATGTACA TCTGGAACGG CTACGCCGTG ATTGGG AAGC AGCCGAAACT CACGGATGGG 1320 
ATACTTGAGA TTATCACTAA GGCTGAAG AG ATGCTGG AG A AAGGCCCAGA GAACGAGTAC 1380 
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TCAGTGGATG ACGAGTGCTT GGTGAAATTG TTGAAAGGCC TGTGTCTGAA ATACCTGGGC 1440 
CGTGTCCAGG AGGCCGAGGA GAATTTTAGG AGCATCTCTG CCAATGAAAA GAAGATTAAA 1500 
TATGACCACT ACTTG ATCCC AAACGCCCTG CTGGAGCTGG CCCTGCTGCT TATGGAGCAA 1560 
GACAGAAACG AAGAGGCCAT CAAACTTTTG GAATCTGCCA AGCAAAACTA CAAGAATTAC 1620 
TCCATGGAGT CAAGGACACA CTTTCG AATC CAGGCAGCCA CACTCCAAGC CAAGTCTTCC 1680 
CTAGAGAACA GCAGCAGATC CATGGTCTCA TCAGTGTCCT TGTAGCTTTG TGCAGCAGTT 1740 
CCGGGCTGGA AGACAGAGAC AGCTGGACAG AGCTCCTGAA AACATTTCAA AATACCCCCT 1800 
CCCCCTGCCC TGCCCTGCCT TTGGGGTCCA CCGGCACTCC AGTTGGATGO CACAACATAG 1 860 
TGTATCCGTG CAGAAGCCGA GCTGGCATTT TCACCAGTGT AGCCAAGGGC CTTTGCCAAG 1920 
GGCAG AGCAG GTGG AGCCCT CTGCCTGCCC TATCACACAT ACGGGTACTT GCTTTTCACT 1 980 
GTGATGTTTA AGAGAATGTA TGAACAGTTT ACATTTTCCT TAGAAATACA TTGATGGGAT 2040 
CACAGTTGGC TTTAAAAACC AACAACAATC AACCACCTGT AAGTCTTTGT CTTCACCTAT 2100 
TATCATCTGG AGGTAAATCT CTTTATATGA TGATGCCAAA GGGCAAATTG CTTTTCAAAT 2 160 
TCAGCAAGTT CTCAGCTTGT GTG ACGG AAO GTCCTTCAGA GGACCTGAGG AATGCCTGGG 2220 
AGAGGCTAAG CCTCAGGCTT CAATGCTTCT GGGGTTGGGC ATGAGGATGT ACACAG ACAC 2280 
CCACTACCTT ACTACTCACA CTTCATTTCA CTCCTTTTGT AAATTTCCAA TTTAAAAATC 2340 
AAGCACGTCT TTTTAGTG AG ATAAAATCTG AGCTCTTCTG TAGAAAAATC AATCTCTACC 2400 
AGTAGAAAAT GCCAGGGCTT GATGGAAGAG CTGTGTAGCC CTTTCTATGC CAAAGCCAGG 2460 
AAATTTGGGG GGCAGG AGGA GGTTCTCAG A ATCCAGTCTG TATCTTTGCT GTATGCCAAA 2520 
CTGAAACCAC TGGGAATAAT TTATG AAACA TAAAAATCTT CTGTACTTCA CTCCAAGGTA 2580 
CATTTGCTTA CTGACAGCAT TTTTGTTAAA ACTGTTATTC TTG AAAAAAA AAAAAAAAAA 2640 
AA 

Seq ID NO: 56 Protcm sequence: 
Prolan Accession #: AAC39582.1 

1 11 21 31 41 51 
I I I I I I 

MTALDLFLTN QFSEALSYLK PRTKESMYHS LTYATILEMQ AMMTFDPQDI LLAGNMMKEA 60 
QMLCQRHRRK SSVTDSFSSL VNRPTLGQFT EEBIHAEVCY AECLLQRAAL TFLQDENMVS 120 
FIKGGIKVRN SYQTYKELDS LVQSSQYCKG ENHPHFEGG V KXGVGAFNLT LSMLPTRILR 1 80 
LLEFVGFSGN KDYGLLQLEE G ASGHSFRSV LCVMLLLCYH TFLTFVLGTG NVNIEEAEKL 240 
LKPYLNRYPK GAIFLFFAGR IEVHCGNIDA AIRRFEECCE AQQHWKQFHH MCYWELMWCF 300 
TYKGQWKMSY FYADLLSKEN CWSKATYIYM KAAYLSMFGK EDHKPFGDDE VELFRAVPGL 360 
KLKIAGKSLP TEKFAIRKSR RYFSSNPISL PVPALEMMYI WNGYAVIGKQ PKLTDGILEI 420 
ITKAEEMLEK GPENEYSVDD ECLVKLLKGL CLKYLGRVQE AEENFRSISA NEKKIKYDHY 480 
UPNALLELA LLLMEQDRNE EAIKLLESAK QNYKNYSMES RTHFRIQAAT LQAKSSLENS 540 
SRSMVSSVSL 

Seq ID NO: 57 DNA sequence 

Nucleic Acid Accession #: NMJ006670. 1 

Coding sequence: 1-927 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

ATGCCTGGGG GGTGCTCCCG GGGCCCCGCC GCCGGGGACG GGCGTCTGCG GCTGGCGCGA 60 
CTAGCGCTGG TACTCCTGGG CTGGGTCTCC TCGTCTTCTC CCACCTCCTC GGCATCCTCC 120 
TTCTCCTCCT CGGCGCCGTT CCTGGCTTCC GCCGTGTCCG CCCAGCCCCC GCTGCCGG AC 1 80 
CAGTGCCCCG CGCTGTGCGA GTGCTCCGAG GCAGCGCGCA CAGTCAAGTG CGTTAACCGC 240 
AATCTG ACCG AGGTGCCCAC GGACCTGCCC GCCTACGTGC GCAACCTCTT CCTTACCGGC 300 
AACCAGCTGG CCAGCAACCA CTTCCTTTAC CTGCCGCGGG ATGTGCTGGC CCAACTGCCC 360 
AGCCTCAGGC ACCTGGACTT AAGTAATAAT TCGCTGGTGA GCCTGACCTA CGTGTCCTTC 420 
CGCAACCTG A CACATCTAG A AAGCCTCCAC CTGGAGG ACA ATGCCCTCAA GGTCCTTCAC 480 
AATGGCACCC TGGCTGAGTT GCAAGGTCTA CCCCACATTA GGGTTTTCCT GGACAACAAT 540 
CCCTGGGTCT GCGACTGCCA CATGGCAG AC ATGGTGACCT GGCTCAAGGA AACAGAGGTA 600 
GTGCAGGGCA AAGACCGGCT CACCTGTGCA TATCCGG AAA AAATGAGGAA TCGGGTCCTC 660 
TTGGAACTCA ACAGTGCTGA CCTGGACTGT GACCCGATTC TTCCCCCATC CCTOCAAACC 720 
TCTTATGTCT TCCTGGGTAT TGTTTTAGCC CTGATAGGCG CTATTTTCCT CCTGGTTTTG 780 
TATTTGAACC GCAAGGGGAT AAAAAAGTGG ATGCATAACA TCAG AGATGC CTGCAGGGAT 840 
CACATGGAAG GGTATCATTA CAGATATGAA ATCAATGCGG ACCCCAGATT AACAAACCTC 900 
AGTTCTAACT CGGATGTCCT CGAGTGA 

Seq ID NO: 58 Protein sequence 
Protein Accession #: NP.00666 1 . 1 



1 II 21 31 41 51 

MPGGCSRGPA AG DGRLRLAR LALV LLGWVS SSSPTSSASS FSSSAPFLAS AVS AQPPLPD 60 
QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLASNHFLY LPRDVLAQLP 120 
SLRHLDLSNN SLVSLTYVSF RNLTHLESLH LEDNALKVLH NGTLAELOGL PH1RVFLDNN 180 
PWVCDCHMAD MVTWLKETEV VQGKDRLTCA YPEKMRNRVL LELNSADLDC DPILPPSLQT 240 
SYVFLGIVLA UGAIFLLVL YLNRKGDCKW MHNIRDACRD HMEGYHYRYE INADPRLTNL 300 
SSNSDVLE 



Seq ID NO: 59 DNA sequence 

Nucleic Acid Accession #: NM 024022 
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Coding sequence: l-1362(underiined sequences correspond to start and stop codons) 
1 U 21 31 41 51 

5 ATGGGGGAAA ATGATCCGCC TGCTGTTGAA GCCCCCTTCT CATTCCOATC GCTTTTTGGC 60 
CTTGATGATT TGAAA ATAAG TCCTGTTGCA CCAGATGCAG ATGCTGTTGC TGCACAGATC 120 
CTGTCACTGC TGCCATTGAA GTTTTTTCCA ATCATCGTCA TTGGGATCAT TGCATTGATA 180 
TTAGCACTGG CCATTGGTCT GGGCATCCAC TTCGACTGCT CAGGGAAGTA CAG ATGTCGC 240 
TCATCCTTTA AGTGTATCGA GCTGATAGCT CG ATGTGACG G AGTCTCGGA TTGCAA AG AC 300 

10 GGGGAGGACG AGTACCGCTG TGTCCGGGTG GGTGGTCAGA ATGCCGTGCT OCAGGTGTTC 360 
ACAGCTGCTT CGTGG AAGAC CATGTGCTCC GATGACTGG A AGGGTCACTA CGCAAATGTT 420 
GCCTGTGCCC AACTGGGTTT CCCA AGCTAT GTOAGTTCAG ATAACCTCAG AGTGAGCTCG 480 
CTGGAGGGGC AGTTCCGGGA GGAGTTTGTG TCCATCGATC ACCTCTTGCC AGATGACAAG 540 
GTGACTGCAT TACACCACTC AGTATATGTG AGGGAGGGAT GTGCCTCTGG CCACGTGGTT 600 

15 ACCTTGCAGT GCACAGCCTG TGGTCATAGA AGGGGCTACA GCTCACGCAT CGTGGGTGGA 660 
AACATGTCCT TGCTCTCGCA GTGGCCCTGG CAGGCCAGCC TTCAGTTCCA GGGCTACCAC 720 
CTGTGCGGGG GCTCTGTCAT CACGCCCCTG TGG ATCATCA CTGCTGCACA CTGTGTTTAT 780 
GACTTGTACC TCCCCAAGTC ATGGACCATC CAGGTGGGTC TAGTTTCCCT GTTGGACAAT 840 
CCAGCCCCAT CCCACTTGGT GGAGAAGATT GTCTACCACA GCAAGTACAA GCCAAAG AGO 900 

20 CTGGGCAATG ACATCGCCCT TATGAAGCTG GCCGGGCCAC TCACGTTCAA TGAAATGATC 960 
CAGCCTGTGT GCCTGCCCAA CTCTGAAGAG AACTTCCCCG ATGGAAAAGT GTGCTGGACO 1020 
TCAGGATGGG GGGCCACAGA GGATGGAGGT GACGCCTCCC CTGTCCTGAA CCACGCGGCC 1080 
GTCCCTTTGA TTTCCAACAA GATCTGCAAC CACAGGGACG TGTACGGTGG CATCATCTCC 1 140 
CCCTCCATGC TCTGCGCGGG CTACCTGACG GGTGGCGTGG ACAGCTGCCA GGGGGACAGC 1200 

25 GGGGGGCCCC TGGTGTGTCA AGAGAGGAGG CTGTGGAAGT TAGTGGGAGC GACCAGCTTT 1260 
GGCATCGGCT GCGCAGAGGT GAACAAGCCT GGGGTGTACA CCCGTGTCAC CTCCTTCCTG 1320 
G ACTGGATCC ACGAGCAGAT GGAGAGAGAC CTAAAAACCT£A 

Seq ID NO: 60 Protein sequence 
30 Protein Accession* NP_076927 

1 II 21 31 41 51 

MGENDPPAVE APFSFRSLFG LDDLKISPVA PDADAVAAQI LSLLPLKFFP UVIGIIALI 60 
3 5 LALAIGLGIH FDCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 120 
TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 180 
VTALHHSVYV REGCASGHW TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QASLQFQGYH 240 
LCGGSV1TPL WIITAAHCVY DLYLPKSWTI QVGLVSLLDN PAPSHLVEKI VYHSKYKPKR 300 
LGNDIALMKL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT SGWGATEDGG DASPVLNHAA 360 
40 VPLISNKICN HRDVYGGHS PSMLCAGYLT GGVDSCQGDS GGPLVCQERR LWKLVGATSF 420 
G1GCAEVNKP GVYTRVTSFL DWIHEQMERD UCT 

Seq ID NO: 61 DNA sequence 
Nucleic Acid Accession #: NM 006475 

45 Coding sequence 28-2538 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

AACAG AACTG CAACGGAGAG ACTCAAGAT£ ATTCCCTTTT TACCCATGTT TTCTCTACTA 60 

50 TTGCTGCTTA TTGTTAACCC TATAAACGCC AACAATCATT ATG ACAAGAT CTTGGCTCAT 120 
AGTCGTATCA GGGGTCGGGA CCAAGGCCCA AATGTCTGTG CCCTTCAACA G ATTTTGGGC 180 
ACCAAAAAG A AATACTTCAG CACTTGTAAG AACTGGTATA AAAAGTCCAT CTGTGGACAG 240 
AAAACG ACTG TTTTATATGA ATGTTGCCCT GGTTATATGA GAATGGAAGG AATGAAAGGC 300 
TGCCCAGCAG TTTTGCCCAT TGACCATGTT TATGGCACTC TGGGCATCGT GGGAGCCACC 360 

5 5 ACAACGCAGC GCTATTCTG A CGCCTCAAAA CTGAGGGAGG AGATCG AGGG AAAGGGATCC 420 
TTCACTTACT TTGCACCGAG TAATGAGGCTTGGGACAACT TGGATTCTGA TATCCGTAGA 480 
GGTTTGGAGA GCAACGTGAA TGTTGAATTA CTGAATGCTT TACATAGTCA CATGATTAAT 540 
AAG AGAATGT TGACCAAGGA CTTAAAAAAT GGCATGATTA TTCCTTCAAT GTATAACAAT 600 
TTGGGGCTTT TCATTAACCA TTATCCTAAT GGGGTTGTCA CTGTTAATTG TGCTCGAATC 660 

60 ATCCATGGG A ACCAGATTGC AACAAATGGT GTTGTCCATG TCATTG ACCG TGTGCTTACA 720 
CAAATTGGTA CCTCAATTCA AGACTTCATT GAAGCAG AAG ATGACCTTTC ATCTTTTAGA 780 
GCAGCTGCCA TCACATCGGA CATATTGGAG GCCCTTGGAA GAGACGGTCA CTTCACACTC 840 
TTTGCTCCCA CCAATGAGGC TTTTG AGAAA CTTCCACG AG GTGTCCTAGA AAGGTTCATG 900 
G GAG AC A A AG TGGCTTCCGA AGCTCTTATG AAGTACCACA TCTTAAATAC TCTCCAGTGT 960 

65 TCTGAGTCTA TTATGGGAGG AGCAGTCTTT GAGACGCTGG AAGGAAATAC AATTGAGATA 1020 
GGATGTG ACG GTG ACAGTAT AACAGTAAAT GGAATCAAAA TGGTGAACAA AAAGGATATT 1080 
GTGACAAATA ATGGTGTGAT CCATTTGATT GATCAGGTCC TAATTCCTGA TTCTGCCAAA 1 140 
CAAGTTATTG AGCTGGCTGG AAAACAGCAA ACCACCTTCA CGGATCTTGT GGCCCAATTA 1200 
GGCTTGGCAT CTGCTCTG AG GOCAGATGG A GAATACACTT TGCTGGCACC TGTG AATAAT 1260 

70 GCATTTTCTG ATGATACTCT CAGCATGGTT CAGCGCCTCC TTAAATTAAT TCTGCAGAAT 1320 
CACATATTGA AAGTAAAAGT TGGCCTTAAT GAGCTTTACA ACGGGCAAAT ACTGG AAACC 1380 
ATCGGAGGCA AACAGCTCAG AGTCTTCGTA TATCGTACAG CTGTCTGCAT TGAAAATTCA 1440 
TGCATGGAGA AAGGGAGTAA GCAAGGGAGA AACGGTGCGA TTCACATATT CCGCGAGATC 1500 
ATCAAGCCAG CAGAGAAATC CCTCCATGAA AAGTTAAAAC AAGATAAGCG CTTTAGCACC 1560 

75 TTCCTCAGCC TACTTGAAGC TGCAGACTTG AAAGAGCTCC TGACACAACC TGGAG ACTGG 1620 
ACATTATTTG TGCCAACCAA TGATGCTTTT AAGGGAATGA CTAGTG AAG A AAAAGAAATT 1680 
CTGATACGGG ACAAAAATGC TCTTCAAAAC ATCATTCTTT ATCACCTGAC ACCAGGAGTT 1740 

373 



WO 02/059377 



TTCATTGGAA AAGGATTTGA ACCTGGTGTT ACTAACATTT TAAAGACCAC ACAAGG AAGC 1 800 
AAAATCTTTC TGAAAGAAGT AAATGATACA CTTCTGGTGA ATGAATTGAA ATCAAAAGAA I860 
TCTGACATCA TGACAACAAA TGGTGTAATT CATGTTGTAG ATAAACTCCT CTATCCAGCA 1920 
GACACACCTG TTGGAAATGA TCAACTGCTG GAAATACTTA ATAAATTAAT CAAATACATC 1980 
CAAATTAAGT TTGTTCGTGG TAGCACCTTC AAAG AAATCC CCGTGACTGT CTATACAACT 2040 
AAAATTATAA CCAAAGTTGT GGAACCAAAA ATTAAAGTGA TTGAAGGCAG TCTTCAGCCT 2100 
ATTATCAAAA CTGAAGGACC CACACTAACA AAAGTCAAAA TTGAAGGTGA ACCTGAATTC 2160 
AGACTGATTA AAGAAGGTGA AACAATAACT GAAGTGATCC ATGGAGAGCC AATTATTAAA 2220 
AAATACACCA AAATCATTGA TGGAGTGCCT QTGG AAATAA CTGAAAAAG A GACACG AGAA 2280 
GAACGAATCA TTACAGGTCC TGAAATAAAA TACACTAGGA TTTCTACTGG AGGTGGAGAA 2340 
ACAGAAG AAA CTCTGAAGAA ATTGTTACAA G AAGAGGTCA CCAAGGTCAC CAAATTCATT 2400 
G AAGGTGGTG ATGGTCATTT ATTTGAAGAT GAAGAAATTA AAAGACTGCT TCAGGGAGAC 2460 
ACACCCGTG A GGAAGTTGCA AGCCAACAAA AAAGTTCAAG GTTCTAGAAG ACGATTAAGG 2520 
G AAGGTCGTT CTCAGTGAAA ATCCAAAAAC CAGAAAAAAA TGTTTATACA ACCCTAAGTC 2580 
AATAACCTGA CCTTAGAAAA TTGTGAG AGC CAAGTTG ACT TCAGGAACTG AAACATCAGC 2640 
ACAAAGAAGC AATCATCAAA TAATTCTGAA CACAAATTTA ATATTTTTTT TTCTGAATGA 2700 
GAAACATGAG GGAAATTGTG GAGTTAGCCT CCTOTGGTAA AGO AATTGAA GAAAATATAA 2760 
CACCTTACAC CCTTTTTCAT CTTG ACATTA AAAGTTCTGG CTAACTTTGG AATCCATTAG 2820 
AGAA AAATCC TTGTCACCAG ATTCATTACA ATTCAAATCG AAGAGTTGTG AACTGTTATC 2880 
CCATTG AAAA GACCGAGCCT TGTATGTATG TTATGGATAC ATAAAATGCA CGCAAGCCAT 2940 
TATCTCTCCA TGGG AAGCTA AGTTATAAAA ATAGGTGCTT GGTGTACAAA ACTTTTTATA 3000 
TCAAAAGGCT TTGCACATTT CTATATG AOT GGGTTTACTG GTAA ATTATG TTATTTTTTA 3060 
CAACTAATTT TGTACTCTCA GAATGTTTGT CATATGCTTC TTGCAATGCA TATTTTTTAA 3120 
TCTCAAACGT TTCAATAAAA CCATTTTTCA GATATAAAGA GAATTACTTC AAATTG AGTA 3180 
ATTCAG AAAA ACTCAAGATT TAAGTTAAAA AGTGGTTTGG ACTTGGG AA 

Scq IP NO: 62 Protein sequence: 
Protein Accession ft NP 006466 

1 11 21 31 41 51 
I I I I i I 

MIPFLPMFSL LLLLIVNPIN ANNHYDK1LA HSRIRGRDQG PNVCALQQIL GTKKKYFSTC 60 
KNWYKKSICG QKTTVLYECC PGYMRMEGMK GCPAVLPIDH VYGTLGIVGA TTTQRYSDAS 120 
KLREEIEGKG SFTYFAPSNE AWDNLDSDIR RGLESNVNVE LLNALHSHMI NKRMLTKDLK 1 80 
NGMIIPSMYN NLGLF1NHYP NGWTVNCAR IIHGNQIATN GWHVIDRVL TQIGTSIQDF 240 
IEAEDDLSSF RAAAITSD1L EALGRDGHFT LFAPTNEAFE KURGVLERF MGDKVASEAL 300 
MKYHJLNTLQ CSESIMGGAV FETLEGNTIE IGCDGDSITV NGKMVNKKD IVTNNG VIHL 360 
1DQVUPDSA KQVIELAGKQ QTTFTDLVAQ LGLASALRPD GEYTLLAPVN NAFSDDTLSM 420 
VQRLUCLILQ NHILKVKVGL NELYNGQILE TIGGKQLRVF VYRTAVCIEN SCMEKGSKQG 480 
RNGAIHIFRE IDCPAEKSLH EKLKQDKRFS TFLSLLEAAD LKELLTQPGD WTLFVPTNDA 540 
FKGMTSEEKE ILIRDKNALQ NIILYHLTPG VFIGKGFEPG VTN1LKTTOG SK1FLKEVND 600 
TLLVNELKSK ESDIMTTNGV IHWDKLLYP ADTPVGNDQL LEILNKLQCY IQIKFVRGST 660 
FKEIPVTVYT TKHTKWEP KUCVIEGSLQ PIIKTEGPTL TKVKIEGEPE FRLDCEGETI 720 
TEVIHGEPfl KKYTKIIDGV PVEITEKETR EERIITGPE1 KYTRISTGGG ETEETLKKIX 780 
QEEVTKVTKF IEGGDGHLFE DEEIKRLLQG DTPVRKLQAN KKVQGSRRRL REGRSQ 

Seq ID NO: 63 DNA sequence 

Nucleic Acid Accession #: NM 020974 

Coding sequence: 81-3080 (underlined sequences correspond to start and stop cottons) 

I II 21 31 41 51 

1 I I ! I 1 

GGCGTCCGCG CACACCTCCC CGCGCCGCCG CCGCCACCGC CCGCACTCCG CCGCCTCTGC 60 
CCGCAACCGC TGAGCCATCC ATGGGGGTCO CGGGCCGCAA CCGTCCCGGG GCGGCCTGGG 120 
CGOTGCTGCT GCTGCTGCTG CTGCTGCCGC CACTGCTGCT GCTGGCGGGG GCCGTCCCGC 1 80 
CGGGTCGGGG CCGTGCCGCG GGGCCGCAGG AGGATGTAGA TGAGTGTGCC CAAGGGCTAG 240 
ATGACTGCCA TGCCG ACGCC CTGTGTCAG A ACACACCCAC CTCCTACAAG TGCTCCTGCA 300 
AGCCTGGCTA CCAAGGOGAA GGCAGGCAGT GTGAGGACAT CGATGAATGT GG AAATGAGC 360 
TCAATGGAGG CTGTGTCCAT GACTGTTTGA ATATTCCAGG CAATTATCGT TGCACTTGTT 420 
TTGATGGCTT CATGTTGGCT CATGACGGTC ATAATTGTCT TGATGTGGAC GAGTGCCTGG 480 
AGAACAATGG CGGCTGCCAG CATACCTGTG TCAACGTCAT GGGGAGCTAT GAGTGCTGCT 540 
GCAAGG AGGG GTTTTTCCTG AGTGACAATC AGCACACCTG CATTCACCGC TCGGAAGAGO 600 
GCCTGAGCTG CATGAATAAG GATCACGGCT GTAGTCACAT CTGCAAGGAG GCCCCAAGGG 660 
GCAGCGTCGC CTGTGAGTGC AGGCCTGGTT TTGAGCTGGC CAAGAACCAG AG AG ACTGCA 720 
TCTTGACCTG TAACCATGGG AACGGTGGGT GCCAGCACtC CTGTGACGAT ACAGCCGATG 780 
GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG 840 
AGCGAGAGGA CACTGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTG G7GGATGGGG 900 
ATAAAOGGGT GAAACGGCGG CTGCTCATGG AAACGTGTGC TGTCAACAAT GGAGGCTGTG 960 
ACCGCACCTG TAAGGATACT TCGACAGGTG TCCACTGCAG TTGTCCTGTT GGATTCACTC 1020 
TCCAGTTGGA TGGGAAGACA TGTAAAGATA TTGATGAGTG CCAGACCCGC AATGGAGGTT 10 80 
OTGATCATTT CTGCAAAAAC ATCGTGGGCA GTTTTGACTG CGGCTGCAAG AAAGGATTTA 1 140 
AATTATTAAC AGATGAGAAG TCTTGCCAAG ATGTGGATG A GTGCTCTTTG GATAGG ACCT 1200 
GTGACCACAG CTGCATCAAC CACCCTGGCA CATTTGCTTG TGCTTGCAAC CGAGGGTACA 1260 
CCCTGTATGG CTTCACCCAC TGTGGAGACA CCAATGAGTG CAGCATCAAC AACGGAGGCT 1320 
GTCAGCAGGT CTGTGTGAAC ACAGTGGGCA GCTATGAATG CCAGTGCCAC CCTGGGTACA 1380 
AGCTCCACTG GAATAAAAAA GACTGTGTGO AAGTGAAGGG GCTCCTGCCC ACAAGTGTGT 1440 
CACCCCGTOT GTCCCTGCAC TGCGGTAAGA GTGGTGGAGG AGACGGGTGC TTCCTCAGAT 1500 
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GTCACTCTGG CATTCACCTC TCTTCAGATG TCACCACCAT CAGGACAAGT GTAACCTTTA 1 560 
AGCTAAATGA AGGCAAGTGT AGTTTGAAAA ATGCTGAGCT GTTTCCCGAG GGTCTGCGAC 1620 
CAGCACTACC AGAGAAGCAC AGCTCAGTAA AAGAGAGCTT CCGCTACGTA AACCTTACAT 1680 
GCAGCTCTGG CAAGCAAGTC CCAGG AGCCC CTGGCCGACC AAGCACCCCT AAGGAAATGT 1740 
TTATCACTGT TGAGTTTGAG CTTGAAACTA ACCAAAAGGA GGTGACAGCT TCTTGTGACC 1800 
TGAGCTGCAT CGTAAAGCGA ACCGAGAAGC GGCTCCGTAA AGCCATCCGC ACGCTCAGAA 1860 
AGGCCGTCCA CAGGQAGCAG TTTCACCTCC AGCTCTCAGG CATGAACCTC GACGTGGCTA 1920 
AAAAGCCTCC CAGAACATCT GAACG CCAGG CAG AGTCCTG TGGAGTGGGC CAGGGTCATG 1980 
CAGAAAACCA ATGTGTCAGT TGCAGGGCTG GGACCTATTA TGATGGAGCA CGAGAACGCT 2040 
GCATTTTATG TCCAAATGGA ACCTTCCAAA ATGAGGAAGG ACAAATGACT TGTGAACCAT 2100 
GCCCAAGACC AGGAAATTCT GGGGCCCTGA AGACCCCAGA AGCTTGGAAT ATGTCTGAAT 2160 
GTGGAGGTCT GTGTCAACCT GGTGAATATT CTGCAGATGG CTTTGCACCT TGCCAGCTCT 2220 
GTGCCCTGGG CACGTTCCAG CCTGAAGCTG GTCGAACTTC CTGCTTCCCC TGTGGAGGAG 2280 
GCCTTGCCAC CAAACATCAG GG AGCTACTT CCTTTCAGGA CTGTGAAACC AGAGTTCAAT 2340 
GTTCACCTGG ACATTTCTAC AACACCACCA CTCACCGATG TATTCGTTGC CCAGTGGGAA 2400 
CATACCAGCC TGAATTTGG A AAAAATAATT GTGTTTCTTG CCCAGGAAAT ACTACGACTG 2460 
ACTTTGATGG CTCCACAAAC ATAACCCAGT GTAAAAACAG AAGATGTGGA GGGGAGCTGG 2520 
GAGATTTCAC TGGGTACATT GAATCCCCAA ACTACCCAGG CAATTACCCA GCCAACACCG 2580 
AGTGTACGTG GACCATCAAC CCACCCCCCA AGCGCCGCAT CCTGATCGTG GTCCCTGAGA 2640 
TCTTCCTGCC CATAGAGGAC GACTGTGGGG ACTATCTGOT GATGCGGAAA ACCTCTTCAT 2700 
CCAATTCTGT G ACAACATAT GAAACCTGCC AGACCTACGA ACGCCCCATC GCCTTCACCT 2760 
CCAGGTCAAA GAAGCTGTGG ATTCAGTTCA AGTCCAATGA AGGGAACAGC GCTAGAGGGT 2820 
TCCAGGTCCC ATACGTG ACA TATGATGAGG ACTACCAGGA ACTCATTGAA GACATAGTTC 2880 
GAGATGGCAG GCTCTATGCA TCTGAG AACC ATCAGG AAAT ACTTAAGGAT AAGAAACTTA 2940 
TCAAGGCTCT GTTTGATGTC CTGGCCCATC CCCAGAACTA T7TCAAGTAC ACAGCCCAGG 3000 
AGTCCOGAGA GATGTTTCCA AGATCGTTCA TCCGATTGCT ACCTTCCAAA GTGTCCAGGT 3060 
TTTTGAGACC TTACAAATGA CTCAGCCCAC GTGCCACTCA ATACAAATGT TCTGCTATAG 3120 
GGTTGGTGGG ACAGAGCTGT CTTCCTTCTG CATGTCAGCA CAGTCGGGTA TTGCTGCCTC 3180 
CCGTATCAGT GACTCATTAG AGTTCAATTT TTATAGATAA TACAGATATT TTGGTAAATT 3240 
GAACTTGGTT TTTCTTTCCC AGCATCGTGG ATGTAGACTG AGAATGGCTT TGAGTGGCAT 3300 
CAGCTTCTCA CTGCTGTGGG CGG ATGTCTT GGATAGATCA CGGGCTGGCT GAGCTGGACT 3360 
TTGGTCAGCC TAGGTG AG AC TCACCTGTCC TTCTGGGGTC TTACTCCTCC TCAAGGAGTC 3420 
TGTAGTGGAA AGGAGGCCAC AG AATAAGCT GCTTATTCTG AAACTTCAGC TTCCTCTAGC 3480 
CCGGCCCTCT CTAAGGGAGC CCTCTGCACT CGTGTGCAGG CTCTGACCAG GCAGAACAGG 3540 
CAAGAGGGGA GGGAAGGAGA CCCCTGCAGG CTCCCTCCAC CCACCTTGAG ACCTGGGAGG 3600 
ACTCAGTTTC TCC ACAGCCT TCTCCAGCCT GTGTG ATACA AGTTTGATCC CAGG AACTTG 3660 
AGTTCTAAGC AGTGCTCGTG AAAAAAAAAA GCAGAAAGAA TTAGAAATAA ATAAAAACTA 3720 
AGCACTTCTG GAGACAT 

Scq ID NO: 64 Protein sequence: 
Protein Accession #: NP J>66025. 1 

1 11 21 31 41 51 

MGVAGRNRPG AAWAVIXLLL LLPPLLLLAG AVPPGRGRAA GPQEDVDECA QGLDDCHADA 60 
LCQNTPTSYK CSCKPGYQGE GRQCEDIDEC GNELNGGCVH DCLNIPGNYR CTCFDGFMLA 120 
HDGHNCLDVD ECLENNGGCQ HTCVNVMGSY ECCCKEGFFL SDNQHTCIHR SEEGLSCMNK 180 
DHGCSHICKE APRGSVACEC RPGFELAKNQ RDCILTCNHG NGGCQHSCDD TADGPECSCH 240 
PQYKMHTDGR SCLEREDTVL EVTESNTTSV VDGDKRVKRR LLMETCAVNN GGCDRTCKDT 300 
STGVHCSCPV GFTLQLDGKT CKDIDECQTR NGGCDHFCKN IVGSFDCGCK KGFKLLTDEK 360 
SCQDVDECSL DRTCDHSCIN HPGTFACACN RGYTLYGFTH CGDTNECSIN NGGCQQVCVN 420 
TVGSYECQCH PGYKLHWNKK DCVEVKGLLP TSVSPRVSLH CGKSGGGDGC FLRCHSGIHL 480 
SSDVTTIRTS VTFKLNEGKC SLKNAELFPE GLRPALPEKH SSVKESFRYV NLTCSSGKQV 540 
PGAPGRPSTP KEMFFTVEFE LETNQKEVTA SCDLSCIVKR TEKRLRKAIR TLRKAVHREQ 600 
FHLQLSGMNL DVAKKPPRTS ERQAESCGVG QGHAENQCVS CRAGTYYDGA RERCILCPNG 660 
TFQNEEGQMT CEPCPRPGNS G AUCTPEAWN MSECGGLCQP GEYSADGFAP CQLCALGTFQ 720 
PEAGRTSCFP CGGGLATKHQ GATSFQDCET RVQCSPGHFY NTTTHRCIRC PVGTYQPEFG 780 
KNNCVSCPGN TTTDFDGSTN ITQCKNRRCG GELGDFTGYI ESPNYPGNYP ANTECTWTIN 840 
PPPKRRJUV VPEIFLPIED DCGDYLVMRK TSSSNSVTTY ETCQTYERPI AFTSRSKKLW 900 
IQFKSNEGNS ARGFQVPYVT YDEDYQELIE D1VRDGRLYA SENHQEILKD KKLIKALFDV 960 
LAHPQNYFKY TAQESREMFP RSFIRLLRSK VSRFLRPYK 

Scq ID NO: 65 DNA sequence 

Nucleic Acid Accession #: NMJ107210 

Coding sequence: 1-1869 (underlined sequences correspond to start and stop codons) 
I 11 21 31 41 51 

ATGAGGCTCC TCCGCAG ACG CCACATGCCC CTGCGCCTGG CCATGGTGGG CTGCGCCTTT 60 
GTGCTCTTCC TCTTCCTCCT GCATAGGGAT GTGAGCAGCA GAGAGGAGGC CACAGAGAAG 120 
CCGTGGCTG A AGTCCCTGGT GAGCCGG AAG GATCACGTCC TGGACCTCAT GCTGG AGGCC 1 80 
ATG AACAACC TTAGAGATTC AATGCCCAAG CTCCAAATCA GGGCTCCAG A AGCCC AG CAG 240 
ACTCTGTTCT CCATAAACCA GTCCTGCCTC CCTGGGTTCT ATACCCCAGC TGAACTGAAG 300 
CCCTTCTGGG AACGGCCACC ACAGGACCCC AATGCCCCTG GGGCAGATGG AAAAGCATTT 360 
CAGAAGAGCA AGTGGACCCC CCTGGAGACC CAGGAAAAGG AAGAAGGCTA TAAGAAGCAC 420 
TOTTTCAATG CCTTTGCCAG CGACCGGATC TCCCTGCAG A GGTCCCTGGG GCCAGACACC 4«0 
CGACCACCTG AGTGTGTGGA CCAGAAGTTC CGGCGCTGCC CCCCACTGGC CACCACCAGC 540 
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GTGATCATTG TGTTCCACAA CGAAGCCTGG TCCACACTGC TGCGAACAGT GTACAGCGTC 600 
CTACACACCA CCOCTGCCAT CTTGCTCAAG GAGATCATAC TGGTGGATGA TGCCAGCACA 660 
G AGGAGCACC TAAAGGAGAA GCTGGAGCAG TACGTGAAGC AGCTGCAGGT GGTGAGGGTG 720 
GTGCGGCAGG AGGAGCGGAA GGGGTTGATC ACCGCCCGGC TGCTGGGGGC CAGCGTGGCA 780 
CAGGCGGAGG TGCTCACGTT CCTGGATGCC CACTGTGAGT GCTTCCACGG CTGGCTGG AG 840 
CCCCTCCTGG CTCGAATCGC TGAGGACAAG ACAGTGGTGG TGAGCCCAGA CATCGTCACC 900 
ATCGACCTTA ATACTTTTGA GTTCGCCAAG CCCGTCCAGA GGGGCAGAGT CCATAGCCGA 960 
GGCAACTTTG ACTGGAGCCT GACCTTCGGC TGGGAAACAC TTCCTCCACA TGAG AAGCAG 1020 
AGGCGCAAGG ATGAAACATA CCCCATCAAA TCCCCGACGT TTGCTGGTGG CCTCTTCTOC 1080 
ATCCCCAAGT CCTACTTTGA GCACATCGGT ACCTATGATA ATCAGATGGA GATCTGGGGA 1 140 
GGGGAGAACO TGGAAATGTC CTTCCGGGTG TGGCAGTGTO GGGGCCAGCT GGAGATCATC 1200 
CCCTGCTCTG TCGTAGGCCA TGTGTTCCGG ACCAAGAGCC COCACACCTT CCOCAAGGGC 1260 
ACTAGTGTCA TTGCTCGCAA TCAAGTGCGC CTGGCAGAGG TCTGGATGGA CAGCTACAAG 1320 
AAGATTTTCT ATAGGAGAAA TCTGCAGGCA GCAAAGATGG CCCAAGAG AA ATCCTTCGGT 1380 
GACATTTCGG AACG ACTGCA GCTGAGGGAA CAACTGCACT GTCACAACTT TTCCTGGTAC 1 440 
CTGCACAATG TCTACCCAGA GATGTTTGTT CCTGACCTG A CGCCCACCTT CTATGGTGCC 1500 
ATCAAGA ACC TCGGCACCAA CCAATGCCTG GATGTGGGTG AG AACAACCG CGGGGGG AAG 1560 
CCCCTCATCA TGTACTCCTG CCACGGCCTT GGCGGCAACC AGTACTTTGA GTACACAACT 1620 
CAGAGGGACC TTCGCCACAA CATCGCAAAG CAGCTGTGTC TACATGTCAG CAAGGGTGCT 1680 
CTGGGCCTTG GGAGCTGTCA CTTCACTGGC AAGAATAGCC AGGTCCCCAA GGACGAGGAA 1740 
TGGGAATTGG CCCAGGATCA GCTCATCAGG AACTCAGGAT CTGGTACCTG CCTGACATCC 1800 
CAGGACAAAA AGCCAGCCAT GGCCCCCTGC AATCCCAGTG ACCCCCATCA GTTGTGGCTC 1860 
TTTGTCTAGG ACCCAGATCA TCCCCAGAGA GAGCCCCCAC AAGCTCCTCA GGAAACAGGA 1920 
TTGCTGATGT CTGGG AACCT GATCAOCAGC TTCTCTGGAG GCCGTAAAGA TGGATTTCTA 1980 
AACCCACTGG GTGGCAAGGC AGG ACCTTCC TAATOCTTGC AACAACATTG GGCCCATTTT 2040 
CTTTCCTTCA CACCGATGGA AGAGACCATT AGGACATATA TTTAGCCTAG CGTTTTCCTG 2100 
TTCTAGAAAT AGAGGCTCCC AAAGTAGGGA AGGCAGCTGG GGGAGGGTTC AGGGCAGCAA 2160 
TGCTGAGTTC AAGAAAAGTA CTTCAGGCTG GGCACAGTGG CTCATGCCTG AAATCCTAGC 2220 
ACTTTGGGAA GACAATGTGG GAGAATGGCT TGAGCCCAGG AGTTCAAGAC CGGCCTGAGC 2280 
AACATAGTGA GGATCCCATC TCTACGCCCA CCCTCCCCCC GGCAAAAAAA AAAGCTGGGT 2340 
ATGGTGGCTT ATGCCTGTAG TCGCAGCTAC TCAG AAGGCT GAGGTGGGAG GATTGCTTGT 2400 
TCCCCGGAGG TTGAAGCTAC AGTG AGCCTT G ATTGTGTCA CTGCACTCCA GCCTGGGCAA 2460 
CAGGTAAGAC TCTGTCTCAA AAAAAAAACA A AAA AG AAG A AGAAAAGTAC TTCTACAGCC 2520 
ATGTCCTATT CCTTGATCAT CCAAAGCACC TGCAGAGTCC AGTGAAATGA TATATTCTGG 2580 
CTGGGCACAG TGGCTCACAC CTGTAATCCT AGCACTTTGG GAGGCCAAGG CAGGTGGATC 2640 
ACCTGAGGTC AGAAGTTTGA AACCAGCCTG GACTACATGG TGAAACTCCA TCTCTACTAA 2700 
AAGTACAAAA ATTAGCTGGG CATGATGGCA CGCACCTGCA GTCCCAGCTA CTTGGGAGGC 2760 
TGAGGCAGGA GAATCACTCG AACCCAGGAG GCAGAGGTTG CAGTGAGCCA AGACAGCACC 2820 
ATTGCACCCC AGCCTGAGCA ACAAGAGCGA AACTCCATCT CAGGAAAAAA AAAAAAAAAA 2880 
A 

Seq ID NO: 66 Protein sequence: 
Protein Accession #: NPj0O9l 4 1 

1 11 21 31 41 51 
11)11) 

MRLLRRRHMP LRLAMVGCAF VLFLFLLHRD VSSREEATEK PWLKSLVSRK DHVLDLMLEA 60 
MNNLRDSMPK LQIRAPEAQQ TLFSINQSCL PGFYTPAEUC PFWERPPQDP NAPGADGKAF 120 
QKSKWTPLET QEKEEGYKKH CFNAFASDRI SLQRSLGPDT RPPECVDQKF RRCPPLATTS 180 
VITVFHNEAW STLLRTVYSV LHTTPAILLK EULVDDAST EEHLKEKLEQ YVKQLQWRV 240 
VRQEERKGLI TARLLGASVA QAEVLTFLDA HCECFHGWLE PLLARIAEDK TVWSPDI VT 300 
IDLNTFEFAK PVQRGRVHSR GNFDWSLTFG WETLPPHEKQ RRKDETYPK SPTFAGGLFS 360 
IPKSYFEHIG TYDNQMEIWG GENVEMSFRV WQCGGQLBH PCSWGHVFR TKSPHTFPK.G 420 
TSVIARNQ VR LAE VWMDSYK KIFYRKNLQA AKMAQEKSFG DISERLQLRE QLHCHNFSWY 480 
LHNVYPEMFV PDLTPTFYGA DCNLGTNQCL DVGENNRGGK PLIMYSCHGL GGNQYFEYTT 540 
QRDLRHNIAK QLCLHVSKGA LGLGSCHFTO KNSQVPKDEE WELAQOQUR NSGSGTCLTS 600 
QDKKPAMAPC NPSDPHQLWL FV 

Seq ID NO: 67 DNA sequence 

Nucleic Acid Accession #; NM_014 1 12 

Coding sequence: 600-4484 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
! I 1 I I 1 

TTCCTCCGCG AAGGCTCCTT TGATATTAAT AGTGTTGGTG TCTTGAAACT OACGTAATGC 60 
GCGGAG ACTG AGGTCCTGAC AAGCG ATAAC ATTTCTGATA AAG ACCCG AT CTTACTGCAA 1 20 
TCTCTAGCGT CCTCTTTTTT GGTGCTGCTG GTTTCTCCAG ACCTCGCGTC CTCTCGATTG 1 80 
CTCTCTCGCC TTCCTATTTC TTTTTTTTTI TTTTAAACAA AAAACAACAC CCCCTCCCCT 240 
CTCCCACCCG GCAOCGGGCA CATCCTTGCT CTATTTCCTT TCTCTTTCTC TCTCTC TCTC 300 
TCTCTTTTTT AATAAGGGTG GGGGAGGGAA AGGGGGGGG A GGCAGGAAAG ACCTTTTTCT 360 
CTCCCCCCCG CAATAATCCA AGATCAACTC TGCAAACAAC AG AAG ACGGT TCATGGCTTT 420 
GGCCGCCGCG CCACCATCTT TCGGGCTGCC GAGGGTGTTC TTG ACGATTA ATCA ACAG AT 480 
GTACAGATCA GCTCTCAAAA TGTCTTCTGT GTCTTCTG AG CGTCTTCTAA GACAATTGCA 540 
TTAGCCTCCT GCTAGTTG AC TAATAGAATT AATAATTGTA AAAAGCACTC TAAAGCCACA 600 
TQCCTTATG A AGTCAATGCT GGGTATGATT TTACAAATAT GGTCCGGAAA AAGAACCCCC 660 
CTCTGAGAAA CGTTGCAAGT GAAGGCGAGG GCCAGATCCT GGAGCCTATA GOTACAGAAA 720 
GCAAGGTATC TGGAAAGAAC AAAG AATTCT CTGCAGATCA G ATGTCAG AA AATACGG ATC 780 

376 



WO 02/059377 



AGAGTGATGC TGCAGAACTA AATCATAAGG AGGAACATAG CTTGCATGTT CAAGATCCAT 840 
CTTCTAGCAG TAAGAAGG AC TTGAAAAGCG CAGTTCTGAG TGAGAAGGCT GGCTTCAATT 900 
ATGAAAGCCC CAGTAAGGG A GGAAACTTTC CCTCCTTTCC GCATG ATG AG GTG ACAGACA 960 
GAAATATGTT GGCTTTCTCA TTTCCAGCTG CTGGGGGAGT CTGTGAGCCC TTGAAGTCTC 1020 
CGCAAAGAGC AGAGGCAGAT GACCCTCAAG ATATGGCCTG CACCCCCTCA GGGGACTCAC 1080 
TGGAGACAAA GGAAGATCAG AAGATGTCAC CAAAGGCTAC AGAGGAAACA GGGCAAGCAC 1 140 
AGAGTGGTCA AGCCAATTGT CAAGGTTTGA GCCCAGTTTC AGTGGCCTCA AAAAACCCAC 1200 
AAGTGCCTTC AGATGGGGGT GTAAGACTGA ATAAATCCAA AACTGACTTA CTGGTGAATG 1260 
ACAACCCAGA CCCGGCACCT CTGTCTCCAG AGCTTCAGGA CTTTAAATGC AATATCTGTG 1320 
GATATGGTTA CTACGGCAAC GACCCCACAG ATCTGATTAA GCACTTCCGA AAGTATCACT 1380 
TAGGACTGCA TAACCGCACC AGGCAAGATG CTGAGCTGGA CAGCAAAATC TTGGC CCTTC 1440 
ATAACATGGT GCAGTTCAGC CATTCCAAAG ACTTCCAGAA GGTCAACCGT TCTGTGTTTT 1 500 
CTGGTGTGCT GCAGG ACATC AATTCTTCAA GGCCTGTTTT ACTAAATGGG ACCTATGATG 1 560 
TGCAGGTGAC TTCAGGTGGA ACATTCATTG GCATTGGACG GAAAACACCA GATTGCCAAG 1620 
GGAACACCAA GTATTTCCGC TGTAAATTCT GCAATTTCAC TTATATGGGC AACTCATCCA 1 680 
CCGAATTAGA ACAACATTTT CTTCAGACTC ACCCAAACAA AATAAAAGCT TCTCTCCCCT 1740 
CCTCTGAGGT TGCAAAAOCT TCAGAGAAAA ACTCTAACAA GTCCATCCCT GCACTTCAAT 1800 
CCAGTGATTC TGGAGACTTG GGAAAATGGC AGGACAAGAT AACAGTCAAA GCAGGAGATG 1860 
ACACTCCTGT TGGGTACTCA GTGCCCATAA AGCCCCTCGA TTCCTCTAGA CAAAATGGTA 1920 
CAGAGGCCAC CAGTTACTAC TGGTGTAAAT TTTGTAGTTT CAGCTGTGAG TCATCTAGCT 1980 
CACTTAAACT GCTAGAACAT TATGGCAAGC AGCACGGAGC AGTGCAGTCA GGCGGCCTTA 2040 
ATCCAGAGTT AAATGATAAG CTTTCCAGGG GCTCTGTCAT TAATCAGAAT GATCTAGCCA 2100 
AAAGTTCAGA AGGAGAGACA ATGACCAAGA CAGACAAGAG CTCGAGTGGG GCTAAAAAGA 2160 
AGGACTTCTC CAGCAAGGG A GCCGAGGATA ATATGGTAAC OAGCTATAAT TGTCAGTTCT 2220 
GTGACTTCCG ATATTCCAAA AGCCATGGCC CTGATGTAAT TGTAGTGGGG CCACTTCTCC 2280 
GTCATTATCA ACAGCTCCAT AACATTCACA AGTGTACCAT TAAACACTGT CCATTCTGTC 2340 
CCAGAGG ACT TTGCAGCCCA G AAAAGCACC TTGGAGAAAT TACTTATCCO TTTGCTTGTA 2400 
G AAAAAGTAA TTGTTCCCAC TGTGCACTCT TGCTTCTGCA CTTGTCTCCT GGGGCGGCTG 2460 
GAAGCTCGCG AGTCAAACAT CAGTGCCATC AGTGTTCATT CACCACCCCT GACGTAGATG 2520 
TACTCCTCTT TCACTATGAA AGTGTGCATG AGTCCCAAGC ATCGGATGTC AAACAAGAAG 2580 
CAAATCACCT GCAAGGATCG GATGGGCAGC AGTCTGTCAA GGAAAGCAAA GAACACTCAT 2640 
GTACCAAATG TGATTTTATT ACCCAAGTGG AAGAAG AGAT TTCCCGACAC TACAGGAGAG 2700 
CACACAGCTG CTACAAATGC CGTCAGTGCA GTTTTACAGC TGCCGATACT CAGTCACTAC 2760 
TGGAGCACTT CAACACTGTT CACTGCCAGG AACAGGACAT CACTACAGCC AACGGCGAAG 2820 
AGGACGGTCA TGCCATATCC ACCATCAAAG AGGAGCCCAA AATTGACTTC AGGGTCTACA 2880 
ATCTGCTAAC TCCAG ACTCT AAAATGGGAG AGCCAGTTTC TG AG AGTGTG GTGAAGAGAG 2940 
AGAAGCTGGA AGAGAAGGAC GGGCTCAAAG AGAAAGTTTG G ACCG AG AGT TCC AGTGATG 3000 
ACCTTCGCAA TGTGACTTGG AGAGGGGCAG ACATCCTGCG GGGGAGTCCG TCATACACCC 3060 
AAGCAAGCCT GGGGCTGCTG ACGCCTGTGT CTGGCACCCA AGAGCAGACA AAGACTCTAA 3120 
GGGATAGTCC CAATGTGGAG GCCGCCCATC TGGCGCGACC TATTTATGGC TTGGCTGTGG 3 180 
AAACCAAGGG ATTCCTGCAG GGGGCGCCAG CTGGCGGAGA GAAGTCTGGG GCCCTCCCCC 3240 
AGCAGTATCC TGCATCGGGA GAAAACAAGT CCAAGGATGA ATCCCAGTCC CTGTTACGGA 3300 
GGCGTAGAGG CTCCGGTGTT TTTTGTGCCA ATTGCCTGAC CACAAAGACC TCTCTCTGGC 3360 
GAAAGAATGC AAATGGCGGA TATGTATGCA ACGCGTGTGG CCTCTACCAG AAGCTTCACT 3420 
CGACTCCCAG GCCTTTAAAC ATCATTAAAC AAAACAACGG TGAGCAGATT ATTAGGAGGA 3480 
G AACAAG AAA GCGCCTTAAC CCAGAGG C AC TTCAGGCTGA GCAGCTCAAC AAACAGCAGA 3540 
GGGGCAGCAA TG AGGAGCAA GTCAATGGAA GCCCGTTAGA GAGGAGGTCA GAAGATCATC 3600 
TAACTGAAAG TCACCAG AGA G AAATTCCAC TCCCCAGCCT AAGTAAATAC GAAGCCCAGG 3660 
GTTCATTGAC TAAAAGCCAT TCTGCTCAGC AGCCAGTCCT GGTCAGCCAA ACTCTGGATA 3720 
TTCACAAAAG GATGCAACCT TTGCACATTC AGATAAAAAG TCCTCAGGAA AGTACTGGAG 3780 
ATCCAGGAAA TAGTTCATCC GTATCTGAAG GGAAAGGAAG TTCTGAGAGA GGCAGTCCTA 3840 
TAGAAAAGTA CATGAGACCT GCGAAACACC CAAATTATTC ACCACCAGGC AGCCCTATTG 3900 
A AAAGTACCA GTACCCACTT TTTGGACTTC CCTTTGTACA TAATGACTTC CAGAGTGAAG 3960 
CTGATTGGCT GCGGTTCTGG AGTAAATATA AGCTCTCCGT TCCTGGG AAT CCGCACTACT 4020 
TG AGTCACGT GCCTGGCCTA CCAAATCCTT GCCAAAACTA TGTGCCTTAT CCCACCTTCA 4080 
ATCTGCCTCC TCA'l 1' T T T CA GCTGTTGGAT CAG ACAATG A CATTCCTCTA GATTTGGCG A 41 40 
TCAAGCATTC CAG ACCTGGG CCAACTGCAA ACGGTGCCTC CAAGGAGAAA ACG AAGGCAC 4200 
CACCAAATGT AAAAAATGAA GGTCCCTTGA ATGTAGTAAA AACAGAGAAA GTTGATAGAA 4260 
GTACTCAAGA TGAACTTTCA ACAAAATGTG TGCACTGTGG CATTGTCTTT CTGGATGAAG 4320 
TGATGTATGC TTTGCATATG AGTTGCCATG GTGACAGTGG ACCTTTCCAG TGCAGCATAT 4380 
GCCAGCATCT TTGCACGGAC AAATATGACT TCACAACACA TATCCAGAGG GGCCTGCATA 4440 
GGAACAATGC ACAAGTGGAA AAAAATGGAA AACCTAAAGA GTAAAACCTT AGCACTTAGC 4500 
ACAATTAAAT AGAAATAGGT TTTCTTGATG GGAATTCAAT AGCTTGTAAT GTCTTATGAA 4560 
GACCTATTAA AAAAATACTT CATAGAGCCT GCCTTATCCA ACATGAAATT CCCTTCTTTT 4620 
GTTATTCTTT CTTTTGATGA GTAGGTTACC AAG ATTAAAA AGTG AGATAA ATGGTCAATG 4680 
AG AAAGAATG GAAGATGGTA AACAATCACT TTTTAAAACC TGTTAAGTCA AAACCATCTT 4740 
GGCTAATATG TACTGGGGAA ATAATCCATA AGAGATATCA CCAGACTAGA ATTAATATAT 4800 
TTATAAAGAA AGAGACCAAA ACTGTCTAGA ATTTGAAAGG GTTTACATAT TATTATACTA 4860 
AAGCAGTACT GG ACTGGCCA TTGGACCATT TGTTCCAAAA CCCATAAATT GTTGCCT AAA 4920 
TTTATAATGA TCATG AAACC CTAGGCAG AG GAGG AG AAAT TG AAGGTCCA GGOCA ATGAA 4980 
AGAAAAATGG CGCCCTCTCA ATTTAGTCTT CTCTCATTGG CCATGTTTCA GATTTTGACC 5040 
TAGAAATGCG AGCTGTGGTT AGGCTTGGTT AGAGTGCAGC AAGCAACATG ACAGATGGTG 5100 
GCACGCTGTT TTTACCCAGC CCTGCCTGTA CATACACATG CACACCCTCT CTGATATTTT 5160 
TGTCCTTTAG ATGTTCAAAT ACTCAGTAGT CCTTTTGTTT GCGGTTTAGA TTCATTTTGT 5220 
CCACACATGT ACCCATTTTA AAAAACAATG TCCTCG ATGC TTCTGTAGTG ATTTCATTTT 5280 
AGCCAGGTAT TTCTTTCTTG TGTGTGATGA ACCAGTATGG ATTTGCTTTT CTAAGCCTCC 5340 
TGTTGGTTAC TAATCTCACT TGGCACATTA TA ACTAAAGG AATCCCCTCA ATTCAAAAGC 5400 
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ATAGATGGAT ACAAATGTCA GACCGTGGGT TTAATTTGTT TAG AACACAT GGCATTTCTT 5460 
CACAAGGTAA CCTGCTGTAT TTATTTATTT TCTTTTGGTT AAATATAATT TCCAAACTTT 5520 
GTGGTCAGGC AGCGTCTAAG GTTACOTTAC CACAGACTG A CAGTTGGTAT ATGTACCAGC 5580 
CAATCCCTTC ATTAAATGTA TACAGATTTA GTTAAGTAGC ATTAAATAGG ATTCTTAGAA 5640 
GTATGTCCTC ATAGAACTTT TAATACTTAA GGCTTTGTAA AAACTATCCA TGAAGGGAAA 5700 
GCTCCTCAGC ATAACTGCTC AGGGAAATAO GGCTAAATAA CTG AACATTA AATAATTGGT 5760 
TAAAGGTGCT GTTAGTCGAG CCTCAATGCT TGCTACAAGG ATGTATGTAC AAGGACTGAC 5820 
TTTAATAATT TGCATTATAT TGTCCCAACC AGTAGTTTAT TTTTTGCCAC GGAGATGTAG 5880 
AAG ATATTAC AAGCTACTGG ATGCACTGTC AGATTAACTT ATTTCATTAA AGAAGTTGGG 5940 
AGAACAAATA GG AAAAAAAA AACTTATTTT TCTAGTAAAT ATTAATGTAT TACATTTCAA 6000 
ATAATGGTGC CTG ACATATT GAATAATTAT TTTCTACAGT GTACGTATGC AACAAAGATA 6060 
TTCCATCATG CATTAGAGTC AGTTCTGGCT CTGCCTAGCT GTTTACATTT GCAAATGTAG 6120 
CAAACAAGGT AATGAAGCAA CTATTTCTAT TGCAGTAGAT ATCCTTTTGT GTGTGTGTGT 6180 
GTGCATTAAA GTTGTAAACG GTAACATGAA ACAAATGAAA GTTCTTGCTA TAATGGTATG 6240 
GAAAACAAGA AGGAAATGAA AATATTTTTA TGCCTACTTA GGAAAAAAAG GGTAGCACTT 6300 
ATTCATTCCA AGTACTTTTT TTTTTTTAAT TTTTAAGCTC TTAACTCACA TTGTTATGCT 6360 
TAAGATG ATA AACATATATC CTCTTTTTAT TGCTTTGTCT ATOTTT CATA T GAAACATTT 6420 
CAG AAATTAT TTTGATAAGT GTTGCTGGAA TCTGCAACGC TGATTTTTTT TTG CATTC TG 6480 
TAGTCGCATT TGCACTCCAT TTTTACATTA ATTCGCAGTT GCTTTGTATC ATTGTTTTGT 6540 
TTGGGTTTTG TTTCTTTTTC ACAGTGCCGG GTCTTCGTTT CTTAAAGTTG GATGGCAGGT 6600 
AGAGTTCAAC CAGTTCX5TG A CTGTTGTAGC GAATG AAGTT AAAAAAATGT CTTTCTGATG 6660 
TTGTGTTGTC ATTTTCATTT TTGCATTTTT TTGTTTGCAT ATTAAAAAAA GAGAAAAGAG 6720 
AAAGCAAGAG ACAGAAATCA GGACTAAGTC CTCTGCTTCA GTTTCATTGT TAACGGGCCT 6780 
TATTCTGATC TCACCTGTCG CGTAGCTCTA ATATTCACAT AAACTGAAAT AAAGAAGTGG 6840 
AATGAGGAGC TTTGACATTC AAATTATGTG ATGTAATTTA TCTTCCTTAG GAATTTTGAT 6900 
GG ATGCATCT CAAAATGTAT AGCCAGACTT G AG AGGTG AC AATTAAAGAT CTAAAAAAG A 6960 
G AGGAG ATTC CCCCAAACAA CAATATTTAA TTTTCTTAGT AAAAAGAATA ACAGAATGCA 7020 
TCGTGGCAAT CCTTAAGCA A CATTATCTAT GTGGACTGCT TAAATCAGCA AAACACCAG A 7080 
AGTTTGGTTA ACTTGGGCAA TATGACAAGT ATTACTTTTT GGGCAAAACT ACTCATTAAG 7140 
CAATTTCTCT AGTGTGTCGG ACACAAATAG GTTCTTTATT TTTGGCATGT ATGCCTTTTT 7200 
ATTTTCATTC A ATTTTl 1 1 T 1 1 1 1C TCAGA CAGACATAGT AGTATCAACT AGCAT TGGAA 7260 
AATACATATC ACTATTCTTG GAATATTTAT GGTCAGTCTA CTTTTTAGTA AAATA TTTTT 7320 
GGATAGCGTT GACACGATAG ATCTTATTCC ATACTTCTTT ATTATTGATA ATTTTATTTT 7380 
CA' llllHO C TTTCATTATT ATACATATTT TGGTGGAGAA GAGGTTGGGC TTTTTTG AAA 7440 
GAGACAAAAA TTTATTATAA CACTAAACAC TCCTTTTTTG ACATATT AAA GCC TTTAT TC 7500 
CATCTCTCAA GATATATTAT AAAATTTATT TTTTTAATTT AAG ATTTCT O AATTATTTTA 7560 
TCTTAAATTG TGATTTTAAA CGAGCTATTA TGGTACGG AA CTTTTTTTAA TGAGGAATTT 7620 
CATGATGATT TAGGAATTTT CTCTCTTGGA AAAGGCTTCC CCTGTGATGA AAATG ATGTG 7680 
CCAGCTAAAA TTGTGTGCCA TTTAAAAACT G AAAATATTT TAAAATTATT TGTCTATATT 7740 
CTAAATTGAG CTTTGGATCA AACTTTAGGC CAGGACCAGC TCATGCGTTC TCATTCTTCC 7800 
TTTTCTCACT CTTTCTCTCA TCACTCACCT CTGTATTCAT TCTGTTGTTT GGGATAGAAA 7860 
AATCATAAAG AGCCAACCCA TCTCAGAACG TTGTGGATTG AGAGAG ACAC TACATGACTC 7920 
CAAGTATATG AGAAAAGGAC AGAGCTCTAA TTG ATAACTC TGTAGTTCAA AAGGAAAAGA 7980 
GTATGCCCAA TTCTCTCTAC ATGACATATT GAGATTTTTT TTAATCAACT TTTAAGATAG 8040 
TGATGTTCTG TTCTAAACTG TTCTGTTTTA GTGAAGGTAG ATTTTT ATAA AACAAGC ATG 8100 
GGGATTCTTT TCTAAGGTAA TATTAATGAG AAGGGAAAAA AGTATCTTTA ACAGCTCTTT 8160 
GTTGAAGCCT GTGGTAGCAC ATTATGTTTA TAATTGCACA TGT GCACA TA ATCTATTATG 8220 
ATCCAATGCA AATACAGCTC CAAAAATATT AAATGTATAT ATATTTTAAA ATGCCTGAGG 8280 
AAATAC ATTT TTCTTAATAA ACTGAAGAGT CTCAGTATGG CTATTAAAAT AATTATTAGC 8340 
CTCCTGTTGT GTGGCTGCAA AACATCACAA AGTGACCGGT CTTGAGACCT GTGAACTGCT 8400 
GCCCTGTTTA OTAAATAAAA TTAATGCATT TCTAGAGGGG GAATATCTGC CATCCAGTGO 8460 
TGGAAATGTG GAGTAAAG AA GCTGGTGGTC TGCTTCTGTG CTGTATGCCA GCCTTTTGCC 8520 
TTAAGTTG AG AGGAGGTCAA CTTTAGCTAC TGTCTTTGGT TTGAGAGCCA TGGCAAAAAA 8580 
AAAAAAAGAA AAAAAGATCA AGTCGTCTTT GGTGAGCCAG TAAGGTGAAA GCTTGCTGAC 8640 
TGTCCAAGGC AC AAG AG AAA ATTGAGGAAT TGAAATGCAA CCTGAGTATC AAACTAAATA 8700 
TTCTAATCAA AGGTAGGTAC TGTTAGGTGG AATTCTATCA GCAG GCAAC T GCAAATGAGA 8760 
AGAAGATAGA AGGACGCCCG TCGGGACTTT GGAGGGCATT GTTA TTTTCC CAAAGAAAGA 8820 
CGGCCAAGGG CAGAGGCATG GATTCTTTGC AGAGCACTTC CTTTTGGTTT TTCAGTACTG 8880 
TTTCATAGAC AGTGCGCTCA CATGTTCCTG ATAGTGCTGC AGTTGCTTAQ AAAGCATCCC 8940 
AGTTAATTGC AGTAATTAGA ACTTCTGGAA TATGCTAGGG CAGAAGTATG TCAAGTATGT 9000 
CACATGAAGA AAATGTGAAA TTCAAGAGTA ATCCACACGT GAGAAACTAG ACAATGTACA 9060 
TTCATGTGTT CTCTTGAAAG GAAAGGGAGA GCTGTAAGCT TCACTCTGTC CTACACCGGA 9120 
GAAAAGCAGG AATAACTTTA CCGTGGAAAT AATGTTTAGC TTTT ATCAGA GAAAATTGTC 9180 
CTTCTAG AGC ATAGAGTCCC AAAACTCAAT TCTGGTTTTC CCCTGTTTTT TTTTTTTTTT 9240 
TTTTTCCCAA CATATGAACT GCAGCATATC ACTTTTTCTT TTTGTGCCTC AGGTTCCTCA 9300 
CCTGTAAAAT TGAAAAATAT ATGTATTAAT AATATTATTA ATAATAATAA TGGTAATGTA 9360 
GTACTTGTTT GTAAAGCACT TTGAGATCCTTGGTTGAAAG GCACCATAGG AGTGCCAAGT 9420 
ATTATTATGT GGCCAAGGGG GTTATTTAAA CTGTCAGTTC CCAAAGGCCA GGAAAGGTTG 9480 
GGGTCATTTT TCTTAAAGAC GAGCTGTAAA TATCAACTAG GCAGCCAATA GTGTTGACTA 9540 
TGAAGATGCA AAACTATTAC TAGGCTG ATA AAATCATAGT TTCTTAATGG CTACCAATAA 9600 
GGCAAATATC ACAAT AATAA ACGCCAAATT CCTTAGGGCG GACTATTTGA CAACCACATG 9660 
GAAAACTTTG GGGGAGGCAT GAGGGGGGAA CATCTCAAAA TGCCAATGTA AAATTTAACT 9720 
TACAGCAATA TTCACCAGCA G AAAATGTCT TTCATATGG A ATG ATTTCAT GTTGCTAAG A 9780 
AAAAGAATTC AATTTGTAGT CCTGATTTG A ATACTAGAAT GTT GGCTATA ATAGTTCTGT 9840 
TCTTACAACA CATG AAATTT TTTCGTTTTA TTTTATTTTG TTTTCATAGT GCATGTTCAT 9900 
TTCTACTCAC AAACATGTTC TTGGTGTATT TCTTATGCAA ACAATCTTCA GGCAGCAAAG 9960 
ATGTCTGTTA CATCTAAACT TG AATAATAA AGTTTTACCA CCAGTTACAC A 
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Scq ID NO: 68 Protein sequence: 
Protein Accession U: NPJ>5483 1 

5 1 U 21 31 41 51 

MPYEVNAGYD FTNM VRKKNP PLRNVASEGE GQILEPIOTE SKVSGKNKEF SADQMSENTD 60 
QSDAAELNHK EEHSLHVQDP SSSSKKDLKS AVLSEKAGFN YESPSKGGNF PSFPHDEVTD 120 
RNMLAFSFPA AGGVCEPLKS PQRAEADDPQ DMACTPSGDS LETKEDQKMS PKATEETGQA 1 80 

10 QSGQANCQGL SPVSVASKNP QVPSDGGVRL NKSKTDLLVN DNPDPAPLSP ELQDFKCNIC 240 
GYGYYGNDPT DUKHFRKYH LGLHNRTRQD AELDSKILAL HNMVQFSHSK DFQKVNRSVF 300 
SGVLQDINSS RPVLLNGTYD VQVTSOGTFI GIGRKTPDCQ GNTKYFRCKF CNFTYMGNSS 360 
TELEQHFLQT HPNKJKASLP SSEVAKPSEK NSNKSIPALQ SSDSGOLGKW QDKITVKAGD 420 
DTPVG YSVPl KPLDSSRQNG TEATSYYWCK FCSFSCESSS SLKIXEHYGK. QHGAVQSGGL 480 

15 NPELNDKLSR GSVINQNDLA KSSEGETMTK TDKSSSGAKK KDFSSKGAED NMVTSYNCQF 540 
CDFRYSKSHG PDVIWGPLL RHYQQLHNIH KCTIKHCPFC PRGLCSPEKH LGEITYPFAC 600 
RKSNCSHCAL LLLHLSPGAA GSSRVKHQCH QCSFTTPDVD VLLFHYESVH ESQASDVKQE 660 
ANHLQGSDGQ QSVKESKEHS CTKCDFITQV EEE1SRHYRR AHSCYKCRQC SFTAADTQSL 720 
LEHFNTVHCQ EQDITTANGE EDGHAISTDC EEPKIDFRVY NLLTPDSKMG EPVSESWKR 780 

20 EKLEEKDGUC EKVWTESSSD DLRNVTWRGA DILRGSPSYT QASLGLLTPV SGTQEQTKTL 840 
RDSPNVEAAH LARPIYGLAV ETKGFLQGAP AGGEKSGALP QQYPASGENK SKDESQSLLR 900 
RRRGSGVFCA NCLTTKTSLW RKNANGGYVC NACGLYQKLH STPRPLN1IK QNNGEQIIRR 960 
RTRKRLNPEA LQAEQLNKQQ RGSNEEQVNG SPLERRSEDH LTESHQREEP LPSLSKYEAQ 1020 
GSLTKSHSAQ QPVLVSQTLD IHKRMQPLHI QDCSPQESTG DPGNSSSVSE GKGSSERGSP 1080 

25 1EKYMRPAKH PNYSPPGSPI EKYQYPLFGL PFVHNDFQSE ADWLRFWSKY KLSVPGNPHY 1140 
LSHVPGLPNP CQNYVPYPTF NLPPHFSAVG SDNDIPLDLA IKHSRPGPTA NGASKEKTKA 1200 
PPNVKNEGPL NWKTEKVDR STQDELSTKC VHCGIVFLDE VMY ALHMSCH GDSGPFQCSI 1260 
CQHLCTDKYD FTTHIQRGLH RNNAQVEKNG KPKE 

30 Seq ID NO: 69 DN A sequence 

Nucleic Acid Accession »: XM_073879 

Coding sequence: l-387(underlined sequences correspond to start and stop codons) 

35 1 u 21 31 4 | 51 

ATfiGGGrrTG GAGACCAGGG AACGGTGGAA GGGAGCCTAG GAACGTCGAA AAAACCACCT 60 
GAAGTGAAAA TGTTTGGAGC CAGTCAAGGT TTGCTGACAA TGGAAACAAA CCAGTCCCTG 120 
OCACAAGGCA CAGGCTGCTC AGTAGTAAAG GTAGACACTG TTCTCTTCGA GAGTTTATAC 180 
CACTGCGGCT TTGAACACGG G AGCGTG ATG CACTGCCTTG GGGATGATCA CCCCCAGGAA 240 
40 GACAGG AAGG CTCACTTCTC TGCCCCAGTT GCAGCCATCG CCTCTCCAGC ACCGACTCCT 300 
GTCTGTCCTG CACACCACTC AACACAG AGC ATCTGCCAGT TTCTACAGCA CTGCAGGCAG 360 
AACACTCACT TGCAGGCTGC TAACIAA 

Seq ID NO: 7Q Protein sequence: 
45 Protein Accession*: XPJ>73879 

1 11 21 31 41 51 

MGFGDQGTVE GSLGTSKKPP EVKMFG ASQG LLTMETNQSL AQGTGCSWK VDTVLFESLY 60 
50 HCGFEHGSVM HCLGDDHPQE DRKAHFSAPV AAIASPAPTP VCPAHHSTQS ICQFLQHCRQ 120 
NTHLQAAN 

Scq ID NO: 71 DNA sequence 
Nucleic Acid Accession #: Afl 033064 

5 5. Coding sequence: 826-1986 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

GGGGAACAGC AAATTCAGTC ACAGACAATC CTCCACTCGG TCAAGAGCCA CTTTTCTCTT 60 

60 CCTGCCTTGC CCCCCCGCAG GGGGTAAGGA ACTGAGCGTT TAATCTTTAG CCGGTTGGCT 120 
ACCAGCTAAA ATTCTACTTA TCTTAGTTTC TAGTGG ATAG CTTTCTTATT TTGCCCATGT 180 
TTTCTTAGAA TCCCTGTTTA ATATACTTTT GTCAGTAGTA GTATCTAGGA GTAGCAGGGA 240 
G AGTGACAAT AAATTAGCCC CUC 1 T11U CCCTTGTCAT TCAGGCCCCT mCCTCTCC 300 
AGAGGGAAAT TACCAGTAAA CTCTTCTAAA TCTTCCACCC CTTCTCAGTC ATACTGTGAA 360 

65 GAAACACACT AAAGTGG ACA TTATTTGACC AGTGAACACG AACCCAGCTT CAGGCATTGG 420 
TTTGTTGTGG CACATGGAGA AACATCTCTT TTAAAATATC TCCCAATTAC CCTTTTCACA 480 
ATTTGTATCC ACCTAGGATT TGCTGCTGGG GTAAGTCACT AGATTTATTT CTCAAAGCTC 540 
CCCTCTCTAT GAGCTGAAAG ACTG ACCAAC CATGAACACT AGTAGGGGAT GGGGAAAGGG 600 
GACAGAGCAG AGCCAGTTGT TCCACACTTT GGGAAGCAGG AGTAGCTTTT ATCATCTTCC 660 

70 TCTGGGGAGC AGGCATAGAG ACATAAACTG AGTGAAAATG GGTGGAGGAA GAACTTCTAT 720 
ACCCACGAAC AACATGTGAA G AGAGAGAAC CAAACATAAA GTAAGGAGGG TAGACGTTAC 780 
ATCCAAGAGG AAATAATCCA GGCAAGG AAG CACAAGCTGA TCAAG ATGTG TAGTTCTGTG 840 
GCTGCCAAGT TGTGGTTTTT GACAGATCGT CGCATCAGGG AAGACTATCC TCAAAAAGAG 900 
ATTTTACGAG CATTGAAGGC CAAATGTTGT GAGGAGGAAC TGGACTTTAG GGCTGTGGTG 960 

75 ATGGATGAGG TGGTGCTGAC AATCGAGCAA GGAAACCTGG GTCTGCGGAT CAATGGAGAG 1020 
CTAATCACTG CCTACCCACA AGTGGTGGTA GTCAGAGTAC CAACCCCTTG GGTGCAAAGT 1080 
GATAGTG ACA TCACTGTTTT GCGCCATCTA G AGAAG ATGG GATGTCGGTT AATGAACCGA 1 140 
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CCTCAAGCCA TCCTGAACTG CGTTAATAAG TTCTGGACAT TTCAAGAGTT GGCTGGCCAT 1200 
GGTGTTOCTC TGCCGGATAC TTTCTCTTAT GGTGGCCACG AAAATTTTGC TAAAATGATT 1260 
GATGAGGCTO AAGTTCTGGA GTTCCCAATG OTAGTAAAGA ATACGCGGGG TCACAGAGGT 1320 
AAAGCTGTTT TCTTGGCTCG AGATAAGCAC CATTTGGCTO ATCTAAGCCA TCTTATTCGC 1380 
CATGAAGCGC CATACCTGTT CCAGAAGTAT GTTAAAG AGT CTCATGGACG GGATGTACGT 1440 
GTCATTGTCG TGGGAGGCCO TGTGGTTGGC ACCATGTTAC GTTGTTCAAC AOATGGGAGA 1500 
ATGCAAAGCA ACTGCTCATT AGGTGGTGTG GGGATGATGT GCTCATTGAG TGAACAAGGG 1560 
AAGCAGCTAG CTATCCAGGT GTCTAATATC CTGGGGATGG ATGTGTGTGG CATTGACCTG 1620 
CTGATGAAAO ATG ACGGCTC CTTCTG CGTC TGTGAGGCCA ATGCAAATGT AGGTTTCATC 1680 
GCCTTTGATA AGGCTTGTAA TCTAGATGTA GCTGGTATCA TAGCAGACTA TGCCGCCTCC 1 740 
CTTCTACCCT CTGGCCGGCT CACCCGGCGT ATGTCCCTGC TCTCCGTGGT GTCCACTGCC 1800 
AGTGAGACTA GTG AGCCGGA GCTGGGTCCC CCAGCCAGCA CTGCTGTTGA CAACATGAGT I860 
GCAAGTTCCA GCTCTGTTGA CAGCGACCCT GAAAGCACGG AGCGAGAGCT GCTCACCAAG 1920 
CTCCCAGGGG GCCTGTTCAA CATGAACCAG CTGCTAGCCA ATGAAATCAA ACTACTGGTG 1980 
GACTGACTCC ACTGGTAATT AACCAACAAA ACCCTTGTAA AACTTTCTTT CTTCTTTTCT 2040 
ATTTTTAAAA CCAACTTGCA ATGCTGTTCA TGGAGGATGC TCAGGAAGAT GAGAGAAAAT 2100 
TAGTAGGATT AGTTGGAGAG AGTGGGAGAT AGATGAGACC TCTGCTAGTA AGATGTTACT 2160 
TTCATTTACA AATCCTACAA ATAGAGAGGC AGAATAGGTG GGGTATAGAA AAATGTCAGG 2220 
CTCTCATAGT TACCCTTTTA AATTGCTAAA AAATQTGTAT GCTCATAGGC CATGAGGAAC 2280 
AAATACTTTT TTTTTTTCAT GGTCCCTTGC TTTTGTTTTT GTACAAAAAA AAATGGTTTT 2340 
GCTACAAATA TCCAAGTAGC ATAACTTCAC ATTGTGTTGO AAGATTTOfC ATCAGTGAOG 2400 
AAAACATCTG CTTAAATTAC AGG AATTTTT GTATTATACA GCTCTGAAAA TTCTGCCATT 2460 
TCCTTATTAA CTAGCAGCTT TAGTTTGTAG TTTATG AA AT CTTGAGGGGC TCI 1 1 TACTG 2520 
OO ATTTCTT A 11 1 HUG 11 TTTTCCCGCT TAATTTGGTG GGAGGTCAAA TTGAATATAA 2580 
CCCAATAAAG GCTTCTTAAT GACAAAATTG GCATGTTTGC ATG ATG AAAT GGAAATGAAC 2640 
AGTATTGCAA TGTCCGGTAT ACAAAATAAC ATTAATTCAA TGTAGATAAA ATTACACTAG 2700 
TTTAAAATAT GTGCATTCAC TTGTATTTGT TAGTGTTTTA GTCTTTTTTG AAAG ATGTGC 2760 
TCTGTTAATG TTGCTTTTTT TTTTTTTTTI TAATACATGC TAGTCTAACA TTTCCTGCTC 2820 
TATGCCTGCA TCTTTAACAA TGGCCAAAGT GAAGAAAATG CTACCTTTTT TGTTAACAAO 2880 
ACACTGACTT GAAACATGTA CATTTAAAGC CTTTTATTTT TTCCCTTTTT GTTTTGGTAG 2940 
TTGGGCATTT AAATAAGGAC AAGG AAAAAT ATTTTTGGGG GCAAATCAAG AGCCTATGAG 3000 
TTCTAAGTAT AAAGCTGAAG TGATTTCGAA TGCCAGCGTT ATATATTTGC ATTTTTCACA 3060 
TTTTACGAGG GAGTATATGT GTATGTGTGT GCACGCATGC ATGTGTATGT GTTTTGCTTT 3120 
TTGTTTCCAT CAACTAATCA AAAAGGATAA TTTAGAAAAT GGAGCATGAT GGGAAACAGA 3180 
GTT T TTO ACT TTAAAAAACA GATGAGTTGT TTTCATAAGT AGACTCCACT GOGGTAGAGG 3240 
TATTCACCTT AAAACATAGG GTGAGTAGAT GCTTTTTTAG GCCTTTTTGT GTATATGTAC 3300 
GTTGTTTGTT TTTTTCCTTT TGTTTCTAGC CTGTTCAGTG TACAGTTTAT TCAAGGCTAC 3360 
ATGCTTTTCT TTAATGCTTC TGGCTATGCA TTTTCTCTTT TTACATATAG OATTTGGGAT 3420 
TGGGGGTGGG TTGG ATGTTT TTGTTTGGGG ACTTATTTAG TAGTATTGAG TCTCTTATAG 3480 
CCCTACTCTT AAGCCTTCAA TACTGTCCAC TCTTTATATT CCTTTACTTG CAG AATTTAT 3540 
AAAAGCCCCC AAACTGCATA TAATATGAGC CTTTAAAACA TGGGTAAAAC TAATO CCAT T 3600 
GATGGGTTTG GATGGTATGT TAAGAAATGG AGATGCTGCA GAGCCCAACG TAATTTTTTA 3660 
AACAGCAAGT TTTCCATCTC CCTACGAATC CTCTGAAGCT TTTACCCAAG CCCTTTCTTG 3720 
CCTCTCCAGT GCTATTTTCC TTCAG ATGGA CCTTAAACAT AATTTCTTGG ACACTACTAG 3780 
AGAGACTTCG AGGCAATAAT AAAAGATCAG TATTAACCAG CTATAACAGA GGTTTG A TCA 3840 
TGCTTACTTG TACAGTTTTT CCCCCGTTTT AAAAAGG AAT GTAATAAAAT TTGTTTTTTC 3900 
CATAG AATTA AATAATATTA AAATTGAGTG AAAGGTTGAT TGTTGATG AA TAGAATAGTA 3960 
CCTCTCATCT GTGCAGTGTC TCATTTCACC TCAGAG AAAA GOATACATAA GAGGAGTTTG 4020 
TAATTTATCT TAGG ATATTC TAATTGCATT TAAAAGAACT TATCTTGCGC AGGGTAAATG 4080 
GGGGACTCAC ATACATATAT TAATACCTCT GACTCATTAA CAGAAAGAAA TACTTGGTAC 4140 
TTCTTTCGCT GAATG ACCAT ACTGTGG AGO ATGCATACTA TTTGGTATAG AGAAATAAAT 4200 
GAGGAAGAAA GAACTGCTTA ATTAAATTAT CATTCATATG TTCATATAGA GACCATCTGG 4260 
TTGCCATGTG TATTATGACA CATACACTTT G AATAGTTAC ATATCACAAG TATGTAGTTC 4320 
ATGTTTGTGT TGGTGGGGTA AGGCATCAGG AAAAATGTAG TTAGTCTTTT CTTAA CTTAT 4380 
ACCAAATTAA CCAACTATAT TATAGG AAAT ATGTG AAATT AGTTCATTAG CTTTATTCAC 4440 
TATTATGCAT TCACATGATA TTAAAACGTA CACTCACATG TTAGAATGAA AAG AGCAGTA 4500 
GTTATCTTAG ATTTTAAAAA CATGGATATC TTCTTGAATT CCTTCAAG AT TGAGGTAG AG 4560 
AATAAGAGCA AATCATTCTG GAAGTACCTT AAGGAAACAA ACAGCAGCAG ATATTTAGGT 4620 
TAAACTTATT TTCATAATTG TTTAATAACT TTTGTATAAT CTTCATTGCT ATTATGAGAG 4680 
AGAATGTATA TATCAAATAT GTGTAATGAT AAAATCTG AA TTGTAAAATT T TTGTA TATT 4740 
GTTAAAATTG TAATTCTAAA TTGTATTTCA AAAATG ATTA TTTCTGATAT TGTTTTTATG 4800 
TCACCCATGA TGAAAACTGG ACTTTATATA TCTAAACATA CAAGTATGAA CTATTCTATT 4860 
TAAAATTTTT AAT AG 1 1111 TTCTTTTTTG GTGCCTATAA TTGATTGGTC ATTTCTGCTG 4920 
GCTTTTCTCC AATGAACATT GAAATCTTCC TGTATATGTT ACCAATAAGA AAACTACCCT 4980 
GGAACAGTAG AAAAACCCAA CAAGAGACTT GGCATTCATC AAGCACATTA TCAGACTTTG 5040 
AGAACATATT GAAGGCATTG ACTTTGAAAA TCATCTCTTT TTCTCAAG AA GAAAGCAATG 5100 
GAGAAGCAAA TTTGTTTCAT TCAGTGAATC CCCAGTTTGG G GCTTGTGG G GCTTAGAGAC 5160 
ATTGTGAAAT CAAATCTTGT GTTATACTTT TCTCCTGGCT CACTTTTTTT G AG AAGOTTT 5220 
ATGGGCTATT TGGCTGGTGA GACACGATCC CCTCCTAAGA AAATGTAGGT GCTCAGACAG 5280 
GTAAOCACTG CTGCTACTGT TTTTATTTGT TTGTTTGTTC AATTTTATTT AAG ATTTGTT 5340 
TTTGTTGTAC TAGGATTTTA AAAAATGTAA TATATTGCAG G ATTTATAAC CAG 

Scq ID NO: 72 Protein sequence: 
Protein Accession #: BAA86552 



1 II 21 31 41 51 
I I I I I I 
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MCSSVAAKLW FLTDRRIRED YPQKEILRAL KAKCCEEELD FRAVVMDEW LTIEQGNLGL 60 
RING E LIT AY PQVV WRVPT PWVQSDSDIT VLRHLEKMGC RLMNRPQAIL NCVNKFWTFQ 120 
ELAGHGVPLP DTFSYGGHEN FAKMIDEAEV LEFPMWKNT RGHRGKAVFL ARDKHHLADL 180 
SHLIRHEAPY LFQKYVKESH GRDVRVIVVG GRVVGTMLRC STDGRMQSNC SLGGVGMMCS 240 
5 LSEQGKQLAI QVSN1LGMDV CGIDLLMKDD GSFCVCEANA NVGFIAFDKA CNLDVAG1IA 300 
DYAASLLPSG RLTRRMSLLS WSTASETSE PELGPPASTA VDNMSASSSS VDSDPESTER 360 
ELLTKLPGGL FNMNQLLANE IKLLVD 

Seq ID NO: 73 DNA sequence 
10 Nucleic Acid Accession #: XMJJ40080.2 

Coding sequence: 159-1 104 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

1 5 CTGAGTGGGG GCGGGGACTG CTGGAGTTGC GGGGCCTGCC TGGGGTAGGG CGGGGCAGGA 60 
CAGCTTGGAG ATAGGGCCCG GAATTGCGGG CGTCACTCTG CTCCTGCGAC CTAGCCAGGC 120 
GTGAGGGAGT GACAGCAGCG CATTCGCGGG ACX3AGAGCGATGAGTGAGAA CGCCGCACCA 180 
GGTCTGATCT CAGAGCTGAA GCTGGCTGTG CCCTGGGGCC ACATCGCAGC CAAAGCCTGG 240 
GGCTCCCTGC AGGGCCCTCC AGTTCTCTGC CTGCACGGCT GGCTGGACAA TGCCAGCTCC 300 

20 TTCGACAQAC TCATCCCTCT TCTCCCGCAA GACTTTTATT ACQTTGCCAT GGATTTCGGA 360 
GGTCATGGGC TCTCX3TCCCA TTACAGCCCA GGTGTCCCAT ATTACCTCCA GACTTTTGTG 420 
AGTGAGATCC GAAGAGTTGT GGCAGCCTTG AAATGGAATC GATTCTCCAT TCTGGGCCAC 480 
AGCTTCGGTG GCGTCGTGGG CGGAATGTTT TTCTGTACXrr TCCCCGAGAT GGTGGATAAA 540 
CTTATCTTGC TGGACACGCC GCTCTTTCTC CTGGAATCAG ATG AAATGGA GAACTTGCTG 600 

25 ACCTACAAGC GGAGAGCCAT AGAGCACGTG CTGCAGGTAG. AGGCCTCCCA GGAGCCCTCG 660 
CACGTGTTCA GCCTG AAGCA GCTGCTGCAG AGGTTACTGA AGAGCAATAG CCACTTGAGT 720 
GAGG AGTGCG GGGAGCTTCT CCTGCAAAGA GG AACCACGA AGGTGGCCAC AGGTCTGGTT 780 
CTGAACAGAG ACCAGAGGCT CGCCTGGGCA GAGAACAGCA TTGACTTCAT CAGCAGGGAG 840 
CTGTGTGCGC ATTCCATCAG GAAGCTGCAG GCCCATGTCC TGTTGATCAA AGCAGTCCAC 900 

3 0 GGATATTTTG ATTCAAGACA GAATTACTCT GAGAAGG AGT CCCTGTCGTT CATGATAGAC 960 
ACGATGAAAT CCACCCTCAA AGAGCAGTTC CAGTTTGTGG AAGTCCCAGG CAATCACTGT 1020 
QTCCACATGA GCGAACCCCA GCACGTGGCC AGTATCATCA GCTCCTTCTT ACAGTGCACA 1080 
CACATGCTCC CAGCCCAGCT GTAGCTCTGG GCCTGGAACT ATGAAGACCT AGTGCTCCCA 1140 
GACTCAACAC TGGGACTCTG AGTTCCTGAG CCCCACAACA AGGCCAGGGA TGGTGGGG AC 1200 

3 5 AGGCCTCACT AGTCTTG AGG CCCAGCCTAG G ATGGTAGTC AGGGGAAGGA GCGAGATTCC 1260 
AACTTCAACA TCTGTGACCT CAAGGGGGAG ACAG AGTCTG GGTTCCAGGG CTGCTTTCTC 1320 
CTGGCTAATA ATAAATATCC AGCCAGCTGG AGGAAGGAAG GGCAGGCTGG GCCCACCTAG 1380 
CCTTTCCCTG CTGCCCAACT GGATGGAAAA TAAAAGGTTC TTGTATTCTC A 

40 Seq ID NO: 74 Protein sequence: 

Protein Accession »: XP_040080. 1 

I 11 21 31 41 51 

45 Uenaapguselki^vywgm^^ 60 
yvamdfgghq lsshyspgvp yylqtfvsei rrwaalkwn rfsilghsfg gwggmffct 120 
fpemvdklil ldtplflles demenlltyk rraiehvlqv easqepshvf slkqllqrll 180 
ksnshlseec gelllqrgtt kvatglvlnr dqrlawaens idf1srelca hsirklqahv 240 
lukavhgyf dsrqnyseke slsfmidtmk. stlkeqfqfv evpgnhcvhm sepqhvasii 300 

50 ssflqcthml paql 

Seq ID NO: 75 DNA sequence 
Nucleic Acid Accession #: NM_005794 
^ j Coding sequence: 434-1 276 (underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 

GGTTCOCTTC CACGCTGTGA AGCTTTGTTC TTTTGGTCTT CATGATAAAT CTTGCTGCTG 60 
CTCACTCGTT GGGTCCGTGC CACCTTTAAG AGCTGTAACA CTCACCGCGA AGGTCTGCAA 120 

60 CTTCACTCCT GGGGCCAGCA AGACCACGAA TGCACCGAGA GG AATGAACA ACTCTGGACA 1 80 
CACCATCTTT AAG AACCGTA ATACTCACCG CAAGGGTCTG CAACTTCATT CTTGAAGTCA 240 
GTG AGGCCAA GAACCCATCA ATTCCGTACA CATTTTGGTG ACTTTGAAGA G ACTGTCACC 300 
TATCACCAAG TGGTGAGACT ATTGCCAAGC AGTGAGACTA TTGCCAAGTG GTGAG ACCAT 360 
CACCAAGCGG TGAGACTATC ACCTATCGCC AAGTGGCCTG ATTCAGCAGG AAGCATCTCA 420 

65 GACACCAACC ACTATGCTGT CAGCAGTTGC CCGGGGCTAC CAGGGCTGGT TTCATCCCTG 480 
TGCTAGGCTT TCTGTGAGGA TGAGCAGCAC CGGGATAGAC AGGAAGGGCG TCCTGGCTAA 540 
CCGGGTAGCC GTGGTCACGG GGTCCACCAG TGGGATCGGC TTTGCCATCG CCCGACGTCT 600 
GGCCCGGGAC GGGGCCCACG TGGTCATCAG CAGCCGGAAG CAGCAGAACG TGGACCGGGC 660 
CATGGCCAAG CTGCAGGGGG AGGGGCTG AG TGTGGCGGGC ATTGTGTGCC ACGTGGGGAA 720 

70 GGCTG AGGAC CGGGAGCAGC TGGTGGCCAA GGCCCTGGAG CACTGTGGGG GCGTCGACTT 780 
CCTGGTGTGC AGCGCAGGGG TCAACCCTCT GGTAGGGAGC ACTCTGGGGA CCAGTGAGCA 840 
GATCTGGG AC AAGATCCTAA GTGTGAACGT GAAGTCCCCA GCCCTGCTGC TGAGCCAGTT 900 
GCTGCCCTAC ATGGAGAACA GGAGGGGTGC TGTCATCCTG GTCTCTTCCA TTGCAGCTTA 960 
TAATCCAGTA GTGGCGCTGG GTGTCTACAA TGTCAGCAAG ACAGCGCTGC TGGGTCTCAC 1020 

75 TAGAACACTG GCATTGG AGC TGGCCCCCAA GGACATCCGG GTAAACTGCG TGGTTCCAGG 1080 
AATTATAAAA ACTGACTTCA GCAAAGTGTT TCATGGGAAT G AGTCTCTCT GGAAGAACTT 1 140 
CAAGGAACAT CATCAGCTGC AGAGGATTGG GGAGTCAGAG GACTGTGCAG GAATCGTGTC 1200 
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CTTCCTGTGC TCTCCAGATG CCAGCTACGT CAACGGGGAG AACATTGCGG TGGCAGGCTA 1260 
CTCCACTCGG CTCTGAGAGG AGTGGGGGCG GCTGCGTAGC TGTGGTCCCA GCCCAGGAGC 1320 
CTGAGGGGGT GTCTAGGTGA TCATTTGGAT CTGGAGCAGA GTCTGCCATT CTGCCAGACT 1380 
AGCAATTTGG GGGCTTACTC ATGCTAGGCT TGAGGAAGAA GAAAAACGCT TCGGCATTCT 1440 
CC 

Seq ID NO: 76 Protein sequence; 
Protein Accession #: NP 005785 

I 11 21 31 41 SI 
1 I I I I I 

MLSAVARGYQ GWFHPCARLS VRMSSTGIDR KGVLANRVAV VTGSTSGIGF AIARRLARDG 60 
AHVV1SSRKQ QNVDRAMAKL QGEGLSVAOI VCHVGKAEDR EQLVAKALEH CGGVDFLVCS 120 
AGVNPLVGST LGTSEQIWDK ILSVNVKSPA LLLSQLLPYM ENRRGAVILV SSIAAYNPW 180 
ALGVYNVSKT ALLGLTRTLA LELAPKDIRV NCVVPGIIKT DFSKVFHGNE SLWKNFKEHH 240 
QLQRIGESED CAGIVSFLCS PDASYVNGEN IAVAGYSTRL 



Seq ID NO: 77 DNA sequence 

Nucleic Acid Accession NM_002425 

Coding sequence: 26-1453 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I 1 I 

AAAOAAGGTA AGGGCAGTGA GAATGATGCA TCTTGCATTC CTTGTGCTGT TGTGTCTGOC 60 
AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG GAGGACTCCA ACAAGOATCT 120 
TGCCCAGCAA TACCTAGAAA AGTACTACAA CCTCGAAAAG GATGTGAAAC AGTTTAGAAG 180 
AAAGGACAGT AATCTCATTG TTAAAAAAAT CCAAGGAATG CAGAAGTTCC TTGGGTTGGA 240 
GGTGACAGGG AAGCTAGACA CTGACACTCT GGAGGTGATG CGCAAGCCCA GGTGTGGAGT 300 
TCCTGACGTT GGTCACTTCA GCTCCTTTCC TGGCATGCCG AAGTGGAGGA AAACCCACCT 360 
TACATACAGG ATTGTGAATT ATACACCAO A TTTGCCAAGA GATGCTGTTG ATTCTGCCAT 420 
TGAGAAAGCT CTGAAAGTCT GGGAAG AGGT GACTCCACTC ACATTCTCCA GGCTGTATGA 480 
AGGAGAGGCT GATATAATG A TCTCTTTCGC AGTTAAAGAA CATGGAGACT TTTACTCTTT 540 
TGATGGCCCA GG ACACAGTT TGGCTCATGC CTACCCACCT GGACCTGGGC TTTATGG AO A 600 
TATTCACTTT GATGATGATG AAAAATGGAC AGAAGATGCA TCAGGCACCA ATTTATTCCT 660 
CGTTGCTOCT CATGAACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTGAAGC 720 
TTTGATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGCC CAGTTCCGCC TTTCGCAAGA 780 
TGATGTGAAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGAACCCCT 840 
GGTOCCCACA AAATCTGTTC CTTCGGGATC TGAGATGCCA GCCAAGTGTG ATCCTGCTTT 900 
GTCCTTCG AT GCCATCAGCA CTCTGAGGGG AGAATATCTG TTCTTTAAAG ACAGATATTT 960 
TTGGCGAAGA TCCCACTGGA ACCCTGAACC TGAATTTCAT TTGATTTCTG CATTTTGGCC 1020 
CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CCGTTTTTAT 1080 
TTTTAAAGGA AATGAGTTCT GGGCCATCAG AGGAAATGAG GTACAAGCAG GTTATCCAAG 1140 
AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 1200 
CAAGGAAAAG AAGAAAACAT ACTTCTTTGC AGCGGACAAA TACTGGAGAT TTGATGAAAA 1260 
TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACTAATAGCT GATGACTTTC CAGGAGTTGA 1320 
GCCTAAGGTT GATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 1380 
ACAGTTTGAG TTTGACCCCA ATGCCAGGAT GGTGACACAC ATATTAAAG A GTAACAGCTG 1440 
GTTACATTGC T^GGCGAGAT AGGGGGAAGA CAGATATGGG TGTTTTTAAT AAATCTAATA 1500 
ATTATTCATC TAATGTATTA TGAGCCAAAA TGGTTAATTT TTCCTGCATG TTCTGTGACT 1560 
GAAG AAG ATG AGCCTTGCAG ATATCTGCAT GTGTCATG AA GAATGTTTCT GGAATTCTTC 1 620 
ACTTGCTTTT GAATTGCACT GAACAGAATT AAGAAATACT CATGTGCAAT AGGTGAGAG A 1680 
ATGTATTTTC ATAG ATGTGT TATTACTTCC TCAATAAAAA GTTTTATTTT GGGCCTGTTC 1 740 
CTT 

Seq ID NO: 78 Protein sequence: 
Protein Accession #: NPJW24 1 6 

I tl 21 31 41 51 

i i i i i i 

MHLAFLVLLC LPVCSAYPLS GAAKEEDSNK DLAQQYLEKY YNLEKDVKQF RRKDSNUVK 60 
KIQGMQKFLG LEVTGKLDTD TLEVMRKPRC G VPDVGHFSS FPGMPK.WRKT HLTYRIVNYT 1 20 
PDLPRDAVDS AIEKALKVWE EVTPLTFSRL YEGEADIMIS FAVKEHGDFY SFDGPGHSLA 180 
HAYPPGPGLY GDIHFDDDEK WTEDASGTNL FLVAAHELGH SLGLFHSANT EALMYPLYNS 240 
FTELAQFRLS QDDVNGIQSL YGPPPASTEE PLVPTKSVPS GSEMPAKCDP ALSFDAJSTL 300 
RGEYLFFKDR YFWRRSHWNP EPEFHUSAF WPSLPSYLDA AYEVNSRDTV FTFKGNEFWA 360 
IRGNEVQAG Y PRGIHTLGFP PTIRKIDAAV SDKEKKKTYF FAADKYWRFD ENSQSMEQGF 420 
PRUADDFPG VEPKVDAVLQ AFGFFYFFSG SSQFEFDPNA RMVTHUXSN SWLHC 

Seq ID NO: 79 DNA sequence 

Nucleic Acid Accession*: NM 000493.1 

Coding sequence: 1-2043 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

ATGCTGCCAC AAATACCCTT TTTGCTGCTA GTATCCTTQA ACTTCGTTCA TGGAGTGTTT 60 
TACGCTGAAC GATACCAAAC GCCCACAGGC ATAAAAGGCC CACTACCCAA CACCAAGACA 120 
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CAGTTCTTCA TTCCCTACAC CATAAAGAGT AAAGGTATAG CAGTAAGAGG AGAGCAAGGT 180 
ACTCCTGGTC CACCAGGCCC TGCTGGACCT CG AGGGCACC CAGGTCCTTC TGG ACCAGCA 240 
GGAAAACCAG GCTACGGAAG TCCTGGACTC CAAGGAGAGC CAGGGTTGCC AGGACCACCG 300 
GGACCATCAG CTGTAGGGAA ACCAGGTGTG CCAGGACTCC CAGGAAAACC AGGAGAGAGA 360 
5 OGACCATATG GACCAAAAGG AGATGTTGGA CCAGCTGGCC TACCAGGACC CCGGGGCCCA 420 
CCAGGACCAC CTGGAATCCC TGGACCGGCT GGAATTTCTG TGCCAGGAAA ACCTGGACAA 480 
CAGGGACCCA CAGGAGCCCC AGGACCCAGG GGCTTTCCTG GAGAAAAGGG TGCACCAGGA 540 
GTCCCTGGTA TGAATGGACA GAAAGGGG AA ATGGGATATG GTGCTCCTGG TCGTOCAGGT 600 
GAG AGGGGTC TTCCAGGCCC TCAGGGTCCC ACAGGACCAT CTGGCCCTOC TGG AGTGGG A 660 

1 0 AAAAG AGGTG AAAATGGGGT TCCAGG ACAG CCAGGCATCA AAGGTGATAG AGGTTTTCCG 720 
GGAGAAATGG GACCAATTGG CCCACCAGGT CCCCAAGGCC CTCCTGGGGA ACGAGGGCCA 780 
GAAGGCATTG GAAAGCCAGG AGCTGCTGG A GCCCCAGGCC AGCCAGGGAT TCCAGGAACA 840 
AAAGGTCTCC CTGGGGCTCC AGGAATAGCT GGGCCCCCAG GGCCTCCTGG CTTTGGGAAA 900 
CCAGGCTTGC CAGGCCTGAA GGGAGAAAGA GGACCTGCTG GCCTTCCTGG GGGTCCAGGT 960 

1 5 GCCAAAGGGG AACAAGGGCC AGCAGGTCTT CCTGGGAAGC CAGGTCTGAC TGG ACCCCCT 1020 
GGGAATATGG GACCCCAAGG ACCAAAAGGC ATCCCGGGTA GCCATGGTCT CCCAGGCCCT 1080 
AAAGGTGAGA CAGGGCCAGC TGGGCCTGCA GG ATACCCTG GGGCTAAGGG TGAAAGGGGT 1 140 
TCCCCTGGGT CAGATGGAAA ACCAGGGTAC CCAGGAAAAC CAGGTCTCGA TGGTCCTAAG 1200 
GGTAACCCAG GGTTACCAGG TCCAAAAGGT GATCCTGGAG TTGGAGGACC TCCTGGTCTC 1260 

20 CCAGGCCCTG TGGGCCCAGC AGGAGCAAAG GGAATGCCCG GACACAATGG AGAGGCTGGC 1320 
CCAAGAGGTG CCCCTGG AAT ACCAGGTACT AGAGGCCCTA TTGGGCCACC AGGCATTCCA 1380 
GGATTCCCTG GGTCTAAAGG GGATCCAGGA AGTCCCGGTC CTCCTGGCCC AGCTGGCATA 1440 
GCAACTAAGG GCCTCAATGG ACCCACCGGG CCACCAGGGC CTCCAGGTCC AAGAGGCCCC 1500 
TCTGGAGAGC CTGGTCTTCC AGGGCCCCCT GGGCCTCCAG GCCCACCAGG TCAAGCAGTC 1560 

25 ATGCCTGAGG GTTTTATAAA GGCAGGCCAA AGGCCCAGTC TTTCTGGGAC CCCTCTTGTT 1620 
AGTGCCAACC AGGGGGTAAC AGGAATGCCT GTGTCTGCTT TTACTGTTAT TCTCTCCAAA 1680 
GCTTACCCAG CAATAGGAAC TCCCATACCA TTTGATAAAA TTTTGTATAA CAGGCAACAG 1740 
CATTATGACC CAAGG ACTGG AATCTTTACT TGTCAGATAC CAGG AATATA CTATTTTTCA 1 800 
TACCACGTGC ATGTGAAAGG GACTCATGTT TGGGTAGGCC TGTATAAGAA TGGCACCCCT 1860 

3 0 GTAATGTACA CCTATGATGA ATACACCAAA GGCTACCTGG ATCAGGCTTC AGGGAGTGCC 1920 
ATCATCGATC TCACAGAAAA TGACCAGGTG TGGCTCCAGC TTCOCAATGC CGAGTCAAAT 1980 
GGCCTATACT CCTCTGAGTA TGTCCACTCC TCTTTCTCAG GATTCCTAGT GGCTCCAATG 2040 
TGAGTACACC CCACAGAGCT AATCTAAATC TTGTGCTAGA AAAAGCATTC TCTAACTCTA 2100 
COCCACCCTA CAAAATGCAT ATGGAGGTAG GCTG AAAAG A ATGTAATTTT TATTTTCTGA 2160 

3 5 AATACAG ATT TG AGCTATCA GACCAACAAA CCTTCCCCCT GAAAAGTGAG CAGCAACGTA 2220 
AAAACGTATG TG AAGCCTCT CTTGAATTTC TAGTTAGCAA TCTTAAGGCT CTTTAAGGTT 2280 
TTCTCCAATA TTAAAAAATA TCACCAA AG A AGTCCTGCTA TGTTAAAAAC AAACAACAAA 2340 
AAACAAAGCA ACAAAAAAAA AAATTAAAAA AAAAAACAGA AATAG AGCTC T AAGTTATGT 2400 
G AAATTTG AT TTG AG AAACT CGGCATTTCC TTTTTAAAAA AGCCTGTTTC TAACTATGAA 2460 

40 TATGAGAACT TCTAGGAAAC ATCCAGGAGG TATCATATAA CTTTGTAGAA CTTAAATACT 2520 
TGAATATTCA AATTTAAAAG ACACTGTATC CCCTAAAATA TTTCTGATGG TGCACTACTC 2580 
TGAGGCCTGT ATGGCCCCTT TCATCAAT AT CT ATTCAAAT ATACAGGTGC ATATATACTT 2640 
GTTAA AGCTC TTATATAAAA AAGCCCCAAA ATATTGAAGT TCATCTGAAA TGCAAGGTGC 2700 
TTTCATCAAT G AACCTTTTC AAAACTTTTC TATGATTGCA GAGAAGCTTT TTATATACCC 2760 

45 AGCATAACTT GGAAACAGGT ATCTGACCTA TTCTTATTTA GTTAACACAA GTGTGATTAA 2820 
TTTGATTTCT TTAATTCCTT ATTGAATCTT ATGTGATATG ATTTTCTGG A TTTACAGAAC 2880 
ATTAGCACAT GTACCTTGTG CCTCCCATTC AAGTGAAGTT ATAATTTACA CT GAGGG TTT 2940 
CAAAATTCGA CTAG AAGTGG AGATATATTA TTTATTTATG CACTGTACTG TATTTTTATA 3000 
TTGCTGTTTA AAACTTTT AA GCTGTGCCTC ACTTATTAAA GCACAAAATG TTTTACCTAC 3060 

50 TCCTTATTTA CGACACAATA AAATAACATC AATAGATTTT TAGGCTGAAT TAATTTGAAA 3120 
GCAGCAATTT GCTGTTCTCA ACCATTCTTT CAAGGCTTTT CATTCGACAC AATAAAATAA 3180 
CATCAATAGA TTTTTAGG 

Seg ID NO: 80 Protein sequence: 
55 Protein Accession ft NPJ>00484.I 



I 11 21 31 41 51 

60 MLPQIPFLIX VSLNLVHG VF YAERYQTPTG DCGPLPNTKT QFFIPYTKS KGIAVRGEQG 60 

TPGPPGPAGP RGHPGPSGPP GKPGYGSPGL QGEPGLPGPP GPSAVGKPGV PGLPGKPGER 120 
GPYGPKGDVG PAGLPGPRGP PGPPGIPGPA GISVPGKPGQ QGPTGAPGPR GFPGEKGAPG 180 
' VPGMNGQKGE MGYGAPGRPG ERGLPGPQGP TGPSGPPGVG KRGENGVPGQ PGKGDRGFP 240 
GEMGPIGPPG PQGPPGERGP EGIGKPGAAG APGQPGIPGT KGLPGAPGIA GPPGPPGFGK 300 

65 PGLPGLKGER GPAGLPGGPG AKGEQGPAGL PGKPGLTGPP GNMGPQGPKG IPGSHGLPGP 360 
KGETGPAGPA GYPG AKGERG SPGSDGKPGY PGKPGLDGPK. GNPGLPGPKG DPGVGGPPGL 420 
PGPVGPAGAK GMPGHNGEAG PRGAPGIPGT RGPIGPPGIP GFPGSKGDPG SPGPPGPAGI 480 
ATKGLNGPTG PPGPPGPRGP SGEPGLPGPP GPPGPPGQAV MPEGFKAGQ RPSLSGTPLV 540 
SANQGVTGMP VSAFTVILSK AYPAIGTPIP FDKILYNRQQ HYDPRTGIFT CQIPGIYYFS 600 

70 YHVHVKGTHV W VGLYKNGTP VMYTY DEYTK G YLDQASGSA IEDLTENDQV V/LQLPNAESN 660 
GLYSSEYVHS SFSGFLVAPM 

Seq ID NO: 81 DNA sequence 
Nucleic Acid Accession U: NM_000786 
75 Coding sequence: 332-1861 (underlined sequences correspond to start and stop codons) 

I 11 21 31 41 51 
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CGCGATTCTC AGGG ATTGAT CCGCCTCTTC AGGTAAGTTA TCTTCCGGCC CCGTACCACT 60 
GTGCCACAGG CGCAGCCCGC TTCCTCAGGT GCCCTATCCC GCGCAGAAGA CCACGGCTTC 1 20 
ACAGAGTGTT ATTTAAGGGC GTGGCCAGCG GAACATCCCG CCCCATTCTG TGACGCACGG 180 
GGTGGCGCGC GTGGG ACCCG AGGGGTGGGG CTGGGTTTAG TAGGAGACCT GGGGCAAGGC 240 
CCCCTGTGGA CGACCATCTG CCAGCTTCTC TCGTTOCGTC GATTGGGAGG AGCGGTGGCG 300 
ACCTCGGCCT TCAGTGTTTC CGACGGAGTG AATGGCGGCO GCGGCTGGGA TGCTGCTGCT 360 
GGGCTTGCTG CAGGCGGGTG GGTCGGTGCT GGGCCAGGCG ATGGAGAAGG TGACAGGCGG 420 
CAACCTCTTG TCCATGCTGC TGATCGCCTG CGCCTTCACC CTCAGCCTGO TCTACCTG AT 480 
CCGTCTGGCC GCCGGCCACC TGGTCCAGCT GCCCGCAGGG OTGAAAAGTC CTCCATACAT 540 
TTTCTCCCCA ATTCCATTCC TTGGGCATGC CATAGCATTT GGGAAAAGTC CAATTGAATT 600 
TCTAG AAAAT GCATATG AGA AGTATGGACC TGTATTTAGT TTTACCATGG TAGGCAAGAC 660 
ATTTACTTAC CTTCTGGGGA GTGATGCTGC TGCACTGCTT TTTAATAGTA AAAATGAAGA 720 
CCTGAATGCA GAAGATGTCT ACAGTCGCCT G ACAACACCT GTGTTTGGGA AGGGAGTTGC 780 
ATACGATGTG CCTAATCCAG TTTTCTTGGA GCAGAAGAAA ATGTTAAAAA OTGQCCTTAA 840 
CATAGCCCAC TTTAAACAGC ATGTTTCTAT AATTGAAAAA GAAACAAAGG AATACTTTGA 900 
GAGTTGGGGA G AAAGTGGAG AAAAAAATGT GTTTGAAGCT CTTTCTGAGC TCATAATTTT 960 
AACAGCTAGC CATTGTTTGC ATGGAAAGGA AATCAG AAGT CAACTCAATG AAAAGGTAGC 1020 
ACAGCTGTAT GCAGATTTGG ATGGAGGTTT CAGCCATGCA GCCTGGCTCT TACCAG GTTG 1080 
GCTGCCTTTG CCTAGTTTCA GACGCAGGGA CAGAGCTCAT CGGGAAATCA AGOATATTTT 1 140 
CTATAAGGCA ATCCAGAAAC GCAGACAGTC TCAAGAAAAA ATTGATOACA TTCTCCAAAC 1200 
TTTACTAGAT GCT ACATACA AGGATGGGCG TCCTTTG ACT G ATG ATGAAG TAGCAGGG AT 1 260 
GCTTATTGGA TTACTCTTGG CAGGGCAGCA TACATCCTCA ACTACTAGTG CTTGGATGGG 1320 
CTTCTTTTTG GCCAGAGACA AAACACTTCA AAAAAAATGT TATTTAG AAC AGAAAACAGT 1380 
CTGTGGAGAG AATCTGCCTC CTTTAACTTA TGACCAGCTC AAGGATCTAA ATTTACTTGA 1440 
TCGCTGTATA AAAGAAACAT TAAGACTTAG ACCTCCTATA ATGATCATGA TGAGAATGGC 1500 
CAGAACTCCT CAGACTGTGG CAGGGTATAC CATTCCTCCA GG ACATCAGG TGTGTGTTTC 1560 
TCCCACTGTC AATCAAAGAC TTAAAGACTC ATGGGTAGAA CGCCTGGACT TTAATCCTGA 1620 
TCGCTACTTA CAGGATAACC CAGCATCAGG GGAAAAGTTT GCCTATGTGC CATTTGGAGC 1680 
TGGGCGTCAT CGTTGTATTG GGGAAAATTT TGCCTATGTT CAAATTAAG A CAATTTGGTC 1740 
CACTATGCTT CGTTTATATG AATTTG ATCT CATTGATGGA TACTTTCCCA CTGTGAATTA 1 800 
TACAACTATG ATTCACACCC CTGAGAACCC AGTTATCCGT TACAAACG AA GATCAAAAT^ I860 
AAAAAGGTTG CAAGGAACGA ATATATGTGA TTATCACTGT AAGCCACAAA GGCATTCGAA 1920 
GAGAATGAAG TGTACAAAAC AACTCTTGTA GTTTACTGTT TTTTTAAGTG TGTAATTCTA 1980 
AAAGOCAGTT TATGATTTAG G ATTTTGTTA ACTGAATGGT TCTATCAA AT ATAATAGCAT 2040 
TTGACACATT TTCTAATAGT TATGATACTT ATACATGTGC TTTCAGGAAO TTCCTTGGTG 2100 
AAACAATTGT TGAGGGGGG A TCTAGGTAAT TGGCAGATTC TAAATAATAT AATTTCCAGA 2160 
TAGTAATTTT AAG AGTACTC ATCGCTCTTG CCAAATAAGT TCAGGGTATT CAAATCTTGG 2220 
ACTAGTCCTG CAAGGTATAA AGAATAAAAA TCCCAGTGAG ATACTTGGAA ACCACAOTTT 2280 
ATTATTATTT ATCTGGGCAA TTATTGTGTG TGTGAGGATG GAAGGGTAGG GAATAATCGA 2340 
ACATCTAAAG CCTTGAATAA GAGAATACTA ATTGTTTTGG TATGATGATA CTCAGAAATG 2400 
GAGATATTAT AGO AAAAAG A AATCCTTTGO AATTTTAACT AAAATCACTG CATATGGGAA 2460 
ATTAAGAGAT CCAGGACCAT ATTTGATAAG AGTTCCTAAA AATAATGTAA TTATTAATGC 2520 
TAAAGACTGC TCATGTATCT TGATCTAATT ACTAAATAAA TTACATATTT ATTTACCTGA 2580 
TAAATATGTA TCTAG TTCTA CAAGGTCACA TTTATOTGGA AGTCCAAAGT CAAGTCCTTA 2640 
GGGGATAATT TTGTTTTGGG CTCAGTTGTT CCCTGCTTCC 11 1111 1111 II II 1 1 1 11 1 2700 
TTGAOATGGA GTCTCGCTCT GTTGCCCAGG CTGGAGTGCA GTGGTGCGAT CTCAGCTCAC 2760 
TGCATCCTCT GCCTCCCGGG TTCAAGCAAT TCTCTGCCTC AGCCTCCCAA GTAGTTGGGA 2820 
TTACAGGCAC CTGCCACCAT GCCTGGCTAA TTTTTTGTAT TTTTAGTAGA GACGGGGGTT 2880 
TCACTATGTT GGCTAGGCTG GTCTTGAACT CCTGAGCCTC GTGAGTCCAC CCGCCTTGGC 2940 
CTCCCAAAGT GCTGGGATTA CAGGCATGAG CCACCGCACC TG G CCTTCCC TG CTTCCTCT 3000 
CTAGAATCCA ATTAGGG ATG TTTGTTACTA CTCATATTGA TTAAAACAGT TAACAAACTT 3060 
rrrTCTTTTT AAAATGTGAG ATCAGTGAAC TCTGGTTTTA AGATAATCTG AAACAAGGTC 3120 
CTTGGGAGTA ATAAAATTGG TCACATTCTG TAAAGCACAT TCTGTTTAGG AATCAACTTA 3180 
TCTCAAATTG TAACTCGGGG CCTAACTATA TG AG ATG GCT GAAAAAATAC CACATCGTCT 3240 
GTTTTCACTA GGTG ATOCCA AAATATTTTG CTTTATGTAT ATTACAOTTC TTTTTAAAAC 3300 
ACTGGAAGAC TCATGTTAAA CTCTAATTGT GAAGGCAGAA TCTCTGCTAA TTTTTCAGAT 3360 
TAAAATTCTC TTTG AAAAAA T 

Seq ID NO: 82 Protein sequence: 
Protein Accession U: NP 000777 

1 11 21 31 41 51 
11)111 

MAAAAGMLLL GLLQAGGSVL GQAMEKVTGG NLLSMLLIAC AFTLSLVYLI RLAAGHLVQL 60 
PAGVKSPPY1 FSPIPFLGHA IAFGKSPIEF LENAYEKYGP VFSFTMVGKT FTYLLGSDAA 120 
ALLFNSKNED LNAEDVYSRL TTPVFGfCGVA YDVPNPVFLE QKKMLKSOLN IAHFKQHVSI 180 
ffiJCETKEYFE SWGESGEKNV FEALSEUIL TASHCLHGKE IRSQLNEKVA QLYADLDGGF 240 . 
SHAAWLLPGW LPLPSFRRRD RAHREIKDIF YKAIQKRKQS QEKIDDILQT LLDATYKDGR 300 
PLTDDEVAGM UGLLLAGQH TSSTTSAWMG FFLARDKTLQ KKCYLEQKTV CGENLPPLTY 360 
DQLKDLNLLD RCDCETLRLR PPIM1MMRMA RTPQTVAGYT IPPGHQVCVS PTVNQRLKDS 420 
WVERLDFNPD RYLQDNPASG EKFAYVPFG A GRHRCIGEKF AYVQHCTTWS TMLRLYEFDL 480 
IDGYFPTVNY TTMIHTPENP VIRYKRRSK 

Seq ID NO: 83 DNA sequence 

Nucleic Acid Accession #: NM_006551.2 

Coding sequence: 64-336 (underlined sequences correspond to start and stop codoos) 
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1 11 21 31 41 51 

AATTCTAG AA GTCCAAATCA CTCATTGTTT GTGAAAGCTG AGCTCACAGC AAAACAAGCC 60 
ACCATGAAGC TGTCGGTGTG TCTCCTGCTG GTCACGCTGG CCCTCTGCTG CTACCAGGCC 120 
AATGCCGAGT TCTGCCCAGC TCTTGTTTCT GAGCTGTTAG ACTTCTTCTT CATTAGTGAA 180 
CCTCTGTTCA AGTTAAGTCT TGCCAAATTT GATGCCCCTC CGGAAGCTGT TGCAGCCAAG 240 
TTAGGAGTGA AGAGATGCAC GGATCAGATG TCCCTTCAGA AACGA AGCCT CATTGCGGAA 300 
GTCCTGGTGA AAATATTG AA GAAATGTAGT GTGTCACATG TAAAAACTTT CATOCTGGTT 360 
TCCACTGTCT TTCAATG ACA CCCTGATCTT CACTGCAGAA TGTAAAGGTT TCAACGTCTT 420 
GCTTTAATAA ATCACTTGCT CTAC 

Seq ID NO: 84 Pro tern sequence: 
Protein Accession #: NP_006542.1 

1 11 21 31 41 51 

MKLSVCLLLV TLALCCV'QAN AEFCPALVSE LLDFFFISEP LFKLSLAKFD APPEAVAAKL 60 
GVKRCTDQMS LQKRSL1AEV LVKILKKCSV 

Seq ID NO. 85 DNA sequence 

Nucleic Acid Accession*: NM 002317.1 

Coding sequence: 231-1484 (underlined sequences correspond to start and stop codoos) 
1 11 21 31 41 51 

GGGCCAGGAC TG AGAAAGGG GAAAGGGAAG GGTGCCACGT CCGAGCAGCC GCCTTGACTG 60 
GGGAAGGGTC TG AATCCCAC CCTTGGCATT GCTTGGTGGA GACTGAGATA CCCGTGCTCC 120 
GCTCGCCTCC TTGGTTGAAG ATTTCTCCTT CCCTCACGTG ATTTGAGCCC CGTTTTTATT 1 80 
TTCTGTGAGC CACGTCCTCC TCGAGCGGGG TCAATCTGGC AAAAGGAGTG ATGCGCTTCG 240 
CCTGGACCGT GCTCCTGCTC GGGCCTTTGC AGCTCTGCGC GCTAGTGCAC TGCGCCCCTC 300 
CCGCCGCCGG CCAACAGCAG CCCCCGCGCG AGCCGCCGGC GGCTCCGGGC GCCTGGCGCC 360 
AGCAGATCCA ATGGG AGAAC AACGGGCAGG TGTTCAGCTT GCTGAGCCTG GGCTCACAGT 420 
ACCAGCCTCA GCGCCGCCGG GACCCGGGCG CCGCCGTCCC TGGTGCAGCC AACGCCTCCG 480 
CCCAGCAGCC CCGCACTCCG ATCCTGCTG A TCCGCGACAA CCGCACCGCC GCGGGGCGAA 540 
CGCGGACGGC CGGCTCATCT GGAGTCACCG CTGGCCGCCC CAGGCCCACC GCCCGTCACT 600 
GGTTCCAAGC TGGCTACTCG ACATCTAGAG CCCGCGAAGC TGGGCOCTCG CGCGCGGAGA 660 
ACCAGACAGC GCCGGGAGAA GTTCCTGCTC TCAGTAACCT GCGGCCGCCC AGCCGCGTGG 720 
ACGGCATGGT GGGCGACGAC CCTTACAACC CCTACAAGTA CTCTG ACG AC AACCCTTATT 780 
ACAACTACTA CGATACTTAT G AAAGGCCCA GACCTGGGGG CAGGTACCGG CCCGGATACG 840 
GCACTGGCTA CTTCCAGTAC GGTCTCCCAG ACCTGGTGGC CGACCCCTAC TACATCCAGG 900 
CGTCCACGTA CGTGCAGAAG ATGTCCATGT ACAACCTGAG ATGCGCGGCG GAGGAAAACT 960 
GTCTGGCCAG TACAGCATAC AGGGCAGATG TCAGAGATTA TGATCACAGG GTGCTGCTCA 1020 
GATTTCCCCA AAGAGTGAAA AACCAAGGGA CATCAGATTT CTTACCCAGC CGACCAAGAT 1080 
ATTCCTGGGA ATGGCACAGT TGTCATCAAC ATTACCACAG TATGGATGAG TTTAGCCACT 1 140 
TGTACCTGCT TG ATGCCAAC ACCCAGAGGA G ATGGGCTG A AGGCCACAAA GCAAGTTTCT 1200 
GTCTTGAAGA CACATCCTGT GACTATGGCT ACCACAGGCG ATTTGCATGT ACTGCACACA 1260 
CAC AGOG ATT G AGTCCTGGC TGTTATG ATA CCTATGGTGC AG ACATAG AC TGCCAGTGGA 1320 
TTGATATTAC AGATGTAAAA CCTGGAAACT ATATCCTAAA GGTCAGTGTA AACCCCAGCT 1380 
ACCTGGTTCC TG AATCTGAC TATACCAACA ATGTTGTGCG CTGTGACATT CGCTACACAG 1440 
GACATCATGC GTATGCCTCA GGCTGCACAA TTTCACCGTA TJAfiAAGGCA AAGCAAAACT 1500 
CCCAATGGAT AAATCAGTGC CTGGTGTTCT GAAGTGGGAA AAA AT AG ACT AACTTCAGTA 1560 
GOATTTATGT ATTTTGAAAA AGAGAACAGA AAACAACAAA AGAATTTTTG TTTGGACTGT 1620 
TTTCAATAAC AAAGCACATA ACTGGATTTT GAACGCTTAA GTCATCATTA CTTGGGAAAT 1680 
TTTTAATGTT TATTATTTAC ATCACTTTGT GAATTAACAC AGTGTTTCAA TTCTGTAATT 1740 
ACATATTTGA CTCTTTCAAA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 86 Protein sequence; 
Protein Accession if; NP_002308. 1 

1 II 21 31 41 51 

MRf AWTVLLL GPLQLCALVH CAPPAAGQQQ PPREPPAAPG AWRQQIQWEN NGQVFSLLSL 60 
GSQYQPQRRR DPGAAVPGAA NASAQQPRTP ILURDNRTA AGRTRTAGSS G VTAGRPRPT 1 20 
ARHWFQAG YS TSRAREAGPS RAENQTAPGE VPALSNLRPP SRVDGMVGDD PYNPYKYSDD 1 80 
NPYYNYYDTY ERPRPGGRYR PGYGTGYFQY GLPDLVADPY YIQASTYVQK MSMYNLRCAA 240 
EENCLASTAY RADVRDYDHR VLLRFPQRVK NQGTSDFLPS RPRYSWEWHS CHQHYHSMDE 300 
FSHLYLLDAN TQRRWAEGHK ASFCLEDTSC DYGYHRRFAC TAHTQGLSPG CYDTYGADID 360 
CQWIDITDVK PGNYILKVSV NPSYLVPESD YTNNVVRCDI RYTGHHAYAS GCTISPY 

Seq ID NO: 87 DNA sequence 

Nucleic Acid Accession ft: NMJD06419.1 

Coding sequence: 91-420 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

TTCGGCACTT GGGAGAAGAT GTTTGAAAAA ACTGACTCTG CTAATGAGCC TGGACTCAGA 60 
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GCTCAAGTCT GAACTCTACC TCCAGACAGA ATGAAGTTCA TCTCGACATC TCTGCTTCTC 120 
ATGCTGCTGG TCAGCAGCCT CTCTCCAGTC CAAGGTGTTC TGGAGGTCTA TTACACAAGC 180 
TTGAGGTGTA GATGTGTCCA AGAGAGCTCA GTCTTTATCC CTAG ACGCTT CATTGATCGA 240 
ATTCAAATCT TGCCCCGTGG GAATGGTTGT CCAAGAA AAO AAATCATAGT CTGG AAGAAG 300 
AACAAGTCAA TTGTGTGTGT GGACCCTCAA GCTGAATGGA TACAAAGAAT GATGGAAGTA 360 
TTG AO AAAAA GAAGTTCTTC AACTCTACCA GTTCCAGTGT TTAAG AGAAA GATTCCCTSA 420 
TGCTG ATATT TCCACTAAGA ACACCTGCAT TCTTCCCTTA TCCCTGCTCT GGATTTTAGT 480 
TTTGTGCTTA GTTAAATCTT TTCCAGGGAG AAAGAACTTC CCCATACAAA TAAGGCATGA 540 
GGACTATGTG AAAAATAACC TTGCAGG AGC TG ATGGGGCA AACTCAAGCT TCTTCACTCA 600 
CAGCACCCTA TATACACTTG GAGTTTGCAT TCTTATTCAT CAGGGAGGAA AGTTTCTTTG 660 
AAAATAGTTA TTCAGTTATA AGTAATACAG GATTATTTTG ATTATATACT TGTTGTTTAA 720 
TGTTTAAAAT TTCTTAGAAA ACAATGG AAT GAGAATTTAA GCCTCAAATT TGAACATGTG 780 
GCTTGAATTA AGAAGAAAAT TATGGCATAT ATTAAAAGCA GGCTTCTATG AAAGACTCAA 840 
AAAGCTGCCT GGGAGGCAG A TGGAACTTGA GCCTGTCAAG AGGCAAAGGA ATCCATGTAG 900 
TAGATATCCT CTGCTTAAAA ACTCACTACG G AGG AG AATT AAGTCCTACT TTTAAAGAAT 960 
TTCTTTATAA AATTTACTGT CTAAGATTAA TAGCATTCG A AGATCCCCAG ACTTCATAGA 1020 
ATACTCAGGG AAAGCATTTA AAGGGTGATG TACACATGTA TCCTTTCACA CATTTGOCTT 1080 
GACAAACTTC TTTCACTCAC ATCTTTTTCA CTGACTTTTT TTGTQGGGGC GGGGCCGGGG 1140 
GGACTCTGGT ATCTAATTCT TTAATG ATTC CTATAAATCT AATGACATTC AATAAAGTTG 1200 
AGCAAACATT TTACTT 

Seq ID NO. 88 Protein sequence: 
Protein Accession #: NPJJ06410. 1 

1 tl 21 31 41 51 

MKFISTSLLL MLLVSSLSPV QGVLEVYYTS LRCRCVQESS VFIPRRFIDR IQILPRGNGC 60 
PRKEUVWKK NKSIVCVDPQ AEWIQRMMEV LRKRSSSTLP VPVFKRKIP 

Scq ID NO: 89 DNA sequence 

Nucleic Acid Accession #: NM 002652 

Coding sequence: 37-477 (underlined sequences correspond to start and stop codons) 
1 II 21 31 41 51 

CTTCTCTGGG ACACATTGCC TTCTGTTTTC TCCAGCATGC GCTTGCTCCA GCTCCTGTTC 60 
AGGGCCAGCC CTGCCACCCT GCTCCTGGTT CTCTGCCTGC AGTTGGGGGC CAACAAAGCT 120 
CAGGACAACA CTCGG AAG AT CATAATA AAG AATTTTG ACA TTCCCA AGTC AGTACGTCCA 1 80 
AATGACGAAG TCACTGCAGT GCTTGCAGTT CAAACAGAAT TGAAAGAATG CATGGTGGTT 240 
AAAACTTACC TCATTAGCAG CATCCCTCTA CAAGGTGCAT TTAACTATAA GTATACTGCC 300 
TGCCTATGTG ACGACAATCC AAAA ACCTTC TACTGGG ACT TTTACACCAA CAGAACTGTG 360 
CAAATTGCAG CCGTCGTTGA TGTTATTCGG GAATTAGGCA TCTGCCCTGA TGATGCTGCT 420 
GTAATCCCCA TCAAAAACAA CCGGTTTTAT ACTATTGAAA TCCTAAAGGT AGAATAATGG 480 
AAGCCCTGTC TGTTTGCCAC ACCCAGGTG A TTTCCTCTAA AG AAACTTGG CTGGAATTTC 540 
TGCTGTGGTC TATAAAATAA ACTTCTTAAC ATGCTT 

Seq ID NO: 90 Protein sequence: 
Protein Accession #: NP J)02643. 1 

1 11 21 31 41 51 

MRLLQLLFRA SPATLLLVLC LQLG ANKAQD NTRKIUJCNF DIPKSVRPND EVTAVLAVQT 60 
ELKECMWKT YUSSIPLQG AFNYKYTACL CDDNPKTFYW DFYTNRTVQI AAWDVTREL 120 
GICPDDAAVI PIKNNRFYTI EILKV 

Seq ID NO: 91 DNA sequence 

Nucleic Acid Accession U: AK00034 1 

Coding sequence: 85-975 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

G ATAGCGCCG GGCAGAGGGA CCCGGCTACC CTGGACAGCG CATCGCCCCC CGCCCGGGTC 60 
GCCGCGCCAC AGCCGCTGCG G ATCATGG AA CATCTAAAGG CCTTTG ATG A TG AAATCAAT 120 
GCTTTTTTGG ACAATATGTT TGGACCGCG A G ATTCTCG AG TCAGAGGGTG GTTCACGTTG 1 80 
GACTCTTACC TTCCTACCTT TTTTCTTACT GTCATGTATC TGCTCTCAAT ATGGCTGGGT 240 
AACAAGTATA TG AAG AACAG ACCTGCTCTT TCTCTC AGGG GTATCCTCAC CTTGTATAAT 300 
CTTGG AATCA CACTTCTCTC CGCGTACATG CTGGCAGAGC TCATTCTCTC CACTTGGGAA 360 
GG AGGCTACA ACTTACAGTG TCAAGATCTT ACCAGCGCAG GGGAAGCTG A CATCCGGGTA 420 
GCCAAGGTGC TTTGGTGGTA CTATTTCTCC AAATCAGTAG AGTTCCTGGA CACAATTTTC 480 
TTCGTTTTGC GGAAAAAAAC G AGTCAG ATT ACTTTTCTTC ATGTATATCA TCATGCTTCT 540 
ATGTTTAACA TCTGGTGGTG TGTCTTG AAC TGGATACCTT GTGGACAAAG TTTCTTTGGA 600 
CCAACACTGA ACAGTTTTGT CCACATTCTT ATGTACTCCT ACTATGG ACT TTCTGTQTTT 660 
CCATCTATGC ACAAGTATCT TTGGTGGAAG AAATATCTCA CACAGGCTCA GCTGGTGCAG 720 
TTCGTGCTCA CCATCACGCA CACCATGAGC GCCGTCOTGA AACCGTGTOG CTTO CCCTT C 780 
GGTTGTCTCA TCTTCCAGTC ATCTTATATG CTAACGTTAG TCATCCTCTT CTTAAATTTT 840 
TATGTTCAG A CATACCG AAA AAAGCCAATG AAGAAAGATA TGCAAGAGCC ACCTGCAGGG 900 
A AAG AAG TG A AGAATGGTTT TTCCAAAGCC TACTTCACTG CAGCAAATGG AGTGATGAAC 960 
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AAGAAAGCAC AATAAAAATG AGTAACAGAA AAAGCACATA TACTAGCCTA ACAGATTG GC 1020 
TTGTTTTAAA GCAAAG ACTO AATTGAAGGT TACATGTTTT AGGATAAACT AATTTCTTTT 1080 
GAGTTCATAA ATCATTTGTA CCCAGAATGT ATTAATATAT TGCTATTAGG TTAATCTGTT 1140 
AACTGAATGC TTTGATCAGC ATTG AGGTGA TGCTCACCTC CGAGGACCTC AGAACTGGTG 1200 
5 CAGCTTCTCT CTCCCTCCCT CCCACAGACT GAACCTTTCG CCAGAAGCTG TCCTTATAAC 1260 
GCCTTATACG CATACACAGC CAGGAAACGT GGAGCATTGT TTCTCACAGA GAGTCTCCAA 1320 
ATAAAAAGGG TTTTGTTCAG ATTAAAATGT TTACAACAAA ATGTTAATTA TATTCTAAAT 1380 
ACAGGGTATG TTCTAATCTA TATTAAGCAA TAATGCCAGT GCATAATCAT TCCATTTGTT 1440 
CCTTTAGCAA TCAACCCCAG AAAATATTAA AATGGGATCA TACACAGAAG ATAGAAAAAT 1500 

1 0 CTAGCAAAAC TTCTCTTTCT GTAAGCCAGA GTCTTGTCTA TCAGATTCCC ACAACCACTC 1560 
CTGATTCTAA ATTTAGTGAT ATGGTAATGA AATTGGTATT TATTTTAAAT ATTAGTTATT 1620 
CTAAGGAGAA AAAAATGCTT CTGCAAGATT TTCATAATTC AGGGGCTGTG GATAGGATTG 1680 
TTCCTCTGTT TCCCTAATCA TTCATCTGTT CATGTCTCCC TCTTGTGCCA GTCAGCCTAG 1740 
GTTATACAG A TGCCATGCTC CACACCACGA GCAGTGTACA AATCTGGCTG CCCGTTTACT 1800 

1 5 TTCTGAGCAA GCACTGGAGT CCACTCCGAC CTTTTTCTTT GAACATGCAT GCTGCTGGAA 1860 

TATGTATAAA TCAGAACTAG CAGAAGTAGC AGAGTGATGG GAGCAAAATA GGCACTGAAT 1920 
TCGTCAACTC TTTTTTGTG A GCCTACTTGT GAATATTACC TCAGATACCT GTTGTCACTC 1980 
TTCACAGGTT ATTTAAGTTC TTG AAGCTGG GAGGAAAAAG ATGGAGTAGC TTGG AAAGAT 2040 
TCCAGCACTG AGCCGTGAGC CGGTCATGAG CCACGATAAA AAATGCCAGT TTGGCAAACT 2100 

20 CAGCACTOCT GTTCCCTGCT CAGGTATATG CGATCTCTAC TGAGAAGCAA GCACAAAAGT 2160 
AG ACCAAAGT ATTAATGAGT ATTTCCTTTC TCCATAAGTG C AGGACTGT T ACTCACTACT 2220 
AAACTCTACC AAGAATGGAA ACCAAGAATA TTTTCTGAAG ATTTTTTTGA AGATTAATTT 2280 
ATACCCTATA AAATAAAACT TGTTAGCTTC GATGAAGTCA AAAAAAAAAA AAAAAAAAAA 

25 Sag ID NO: 92 Protein sequence: 

Protein Accession #: BAA9 1 096. 1 

1 11 21 31 41 51 

30 MEHLKAFDDE INAFLDNMFG PRDSRVRGWF TLDSYLPTFF LTVMYLLSIW LGNKYMKNRP 60 
ALSLRGILTL YNLGITLLSA YMLAELILST WEGGYNLQCQ DLTSAGEADI RVAKVLWYY 120 
FSKSVEFLDT IFFVLRKK.TS Q1TFLHVYHH ASMFNIWWCV LMWIPCGQSF FGPTLNSFVH 180 
ILMYSYYGLS VFPSMHKYLW WKKYLTQAQL VQFVLTITHT MSAWKPCGF PFGCLIFQSS 240 
YMLTLVILFL NFYVQTYRKK PMKKDMQEPP AGKEVKNGFS KAYFTAANGV MNKKAQ 
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SeqIDNO:93 DNA sequence 

Nucleic Acid Accession »: NM 000044 

Coding sequence: 1 1 15-3874 (underlined sequences correspond to start and stop codons) 



40 I 11 21 31 41 51 

CGAGATCCCG GGGAGCCAGC TTGCTGGO AG AGCGGOACGG TCCGGAGCAA GCCCACAGGC 60 
AGAGGAGGCG ACAGAGGG AA AAAGGGCXGA GCTAGCCGCT CCAGTGCTGT ACAGGAGCCG 120 
AAGGGACGCA CCACGCCAGC CCCAGCCCGG CTCCAGCGAC AGCCAACGC C TCTT GCAGCG 180 

45 CGGCGGCTTC GAAGCCGCCG CCCGGAGCTG CCCTTTCCTC TTCGGTGAAG TTTTTAAAAG 240 
CTGCTAAAGA CTCGGAGGAA GCAAGGAAAG TGCCTGGTAG GACTGACGGC TGCCTTTGTC 300 
CTCCTCCTCT CCACCCCGCC TCCCCCCACC CTGCCTTCCC CCCCTCCCCC GTCTTCTCTC 360 
CCGCAGCTGC CTCAGTCGGC TACTCTCAGC CAACCCCCCT CACCACCCTT CTCCCCACCC 420 
GCCCCCCCGC CCCCGTCGGC CCAGCGCTGC CAGCCCGAGT TTGCAGAG AG GTAACTCCCT 480 

50 TTGGCTGCGA GCGGGCGAGC TAGCTGCACA TTGCAAAGAA GGCTCTTAGG AGCCAGGCGA 540 
CTGGGGAGCG GCTTCAGCAC TGCAGCCACG ACCCGCCTGG TTAGAATTCC GGCGG AGAG A 600 
ACCCTCTGTT TTCCCCCACT CTCTCTCCAC CTCCTCCTGC CTTCCCCACC CCGAGTGCGG 660 
AGCAGAGATC AAAAGATGAA AAGGCAGTCA GGTCTTCAGT AGCCAAAAAA CAAA ACAAAC 720 
AAAAACAAAA AAGCCGAAAT AAAAGAAAAA GATAATAACT CAGTTOTAT TTGCACCTAC 780 

55 TTCAGTGGAC ACTGAATTTG GAAGGTGGAG GATTTTGTTT TTTTCTTTTA AG ATCTGGGC 840 
ATCTTTTGAA TCTACCCTTC AAGTATTAAG AGACAGACTG TGAGCCTAGC AGOGCAOATC 900 
TTGTCCACCG TGTGTCTTCT TCTGCACGAG ACTTTG AGGC TGTCAG AGCG CTTTTIGCGT 960 
GGTTGCTCCC GCAAGTTTCC TTCTCTGGAG CTTCCCGCAG GTGGGCAGCT AGCTGCAGCG 1020 
ACTACCGCAT CATCACAGCC TGTTGAACTC TTCTGAGCAA GAGAAGGGGA GGCGGGGTAA 1080 

60 GGGAAGTAGG TGGAAGATTC AGCCAAGCTC AAGGATGGAA GTGCAGTTAG GGCTGGGAAG 1 140 
GGTCTACCCT CGGCCGCCGT CCAAGACCTA CCGAGGAGCT TTCCAGAATC TGTTCCAGAG 1200 
CGTGCGCGAA GTGATCCAGA ACCCGGGCCC CAGGCACCCA GAGGCCGCGA GOTCAGCACC 1260 
TCCCGGCGCC AGTTTGCTGC TGCTGCAGCA GCAGCAGCAG CAGCAGCAGC AGCAGCAGCA 1320 
GCAGCAGCAG CAGCAGCAGC AGCAGCAAGA GACTAGCCCC AGGCAGCAGC AGCAGCAGCA 1380 

65 GGGTGAGGAT GGTTCTCCCC AAGCCCATCG TAGAGGCCCC ACAGGCTACC TGGTCCTGGA 1440 
TOAGG AACAG CAACCTTCAC AGCCGCAGTC GGCCCTGGAG TGCCACCCCG AGAGAGGTTG 1500 
CGTCCCAGAG CCTGGAGCCG CCGTGGCCGC CAGCAAGGGG CTGCCGCAGC AGCTGCCAGC 1560 
ACCTCCGGAC GAGGATGACT CAGCTGCCCC ATCCACGTTG TCCCTGCTGG GCCCCACTTT 1620 
CCCCGGCTTA AGCAGCTGCT CCGCTGACCT TAAAGACATC CTGAGCGAGG CCAGCACCAT 1680 

70 GCAACTCCTT CAGCAACAGC AGCAGGAAGC AGTATCCGAA GGCAGCAGCA GCGGGAGAGC 1740 
GAGGGAGGCC TCGGGGGCTC CCACTTCCTC CAAGGACAAT TACTTAGGGG GCACTTCGAC 1800 
CATTTCTGAC AACGCCAAGG AGTTGTGTAA GGCAGTGTCG GTGTCCATGG GCCTG^TGT I860 
GGAGGCGTTG GAGCATCTGA GTCCAGGGGA ACAGCTTCGG GGGGATTGCA TGTACGCCCC 1920 
ACTTTTGGGA GTTCCACCCG CTGTGCGTCC CACTCCTTGT GCCCCATTGG CCGAATGCAA 1980 

75 AGGTTCTCTG CTAGACGACA GCGCAGGCAA GAGCACTGAA GATACTGCTG AGTATTCCCC I 2040 
TTTCAAGGGA GGTTACACCA AAGGGCTAGA AGGCGAGAGC CTAGGCTGCT CTGGCAGCGC : 2100 
TGCAGCAGGG AGCTCCGGGA CACTTGAACT GCCGTCTACC CTGTCTCTCT ACAAGTCCGG 2160 

387 



WO 02/059377 



AGCACTGGAC GAGGCAGCTG CGTACCAG AG TCGCG ACTAC TACAACTTTC CACTGGCTCT 2220 
GGCCGGACCG CCGCCCCCTC CGCCGCCTCC CCATCCCCAC GCTCGCATCA AGCTGGAGAA 2280 
CCCGCTGGAC TACGGCAGCG CCTGGGCGGC TGCGGCGGCG CAGTGCCGCT ATGGGGACCT 2340 
GGCG AGCCTG CATGGCGCGG GTGCAGCGGG ACCCGGTTCT GGGTCACCCT CAGCCGCCGC 2400 
TTCCTCATCC TGGCACACTC TCTTCACAGC CG AAGAAGGC CAGTTGTATG GACCGTGTGG 2460 
TGGTGGTGGG GGTGGTGGCG GCGGCGGCGG CGGCGGCGGC GGCQGCGGCG GCGGCGGCGG 2520 
CGGCGGCGGC GAGGCGGGAG CTGTAGCCCC CTACGGCTAC ACTCGGCCCC CTCAGGGGCT 2580 
GGCGGGCCAG GAAAGCGACT TCACCGCACC TGATGTGTCO TACCCTGGCO GCATGGTGAG 2640 
CAGAGTGCCC TATCCCAGTC CCACTTGTGT CAA AAGCGAA ATGGGCCCCT GGATGGATAG 2700 
CTACTCCGGA CCTTACGGGG ACATGCGTTT GG AGACTGCC AGGGACCATG TTTTGCCCAT 2760 
TGACTATTAC TTTCCACCCC AG AAGACCTG CXTGATCTGT GGAGATGAAG CTTCTGGGTG 2820 
TCACTATGGA GCTCTCACAT GTGGAAGCTG CAAGGTCTTC TTCAAAAGAG CCGCTGAAGG 2880 
GAAACAGAAG TACCTGTGCG CCAGCAG AAA TG ATTGCACT ATTGATAAAT TCCGAAGGAA 2940 
AAATTGTCCA TCTTGTCOTC TTCGG AAATG TTATGAAGCA GGG ATGACTC TGGG AGCCCG 3000 
GAAGCTGAAG AAACTTGGTA ATCTGAAACT ACAGGAGGAA GGAGAGGCTT CCAGCACCAC 3060 
CAGCCCCACT GAGGAGACAA CCCAGAAGCT GACAGTGTCA CACATTGAAG GCTATGAATG 3120 
TCAGCCCATC TTTCTGAATG TCCTGGAAGC CATTGAGCCA GGTGTAGTGT GTGCTGGACA 3180 
CGACAACAAC CAGCCCGACT CCTTTGCAGC CTTGCTCTCT AGCCTCAATG AACTGGGAGA 3240 
GAGACAGCTT GTACACGTGG TCAAGTGGGC CAAGGCCTTG CCTGGCTTCC GCAACTTACA 3300 
CGTGGACGAC CAGATGGCTG TCATTCAGTA CTCCTGG ATG GGGCTCATGG TGTTTGCCAT 3360 
GGGCTGGCGA TCCTTCACCA ATGTCAACTC CAOGATGCTC TACTTCGCCC CTOATCTGGT 3420 
TTTCAATGAG TACCGCATGC ACAAGTCCCO GATGTACAGC CAGTGTGTCC GAATGAGGCA 3480 
CCTCTCTCAA GAGTTTGGAT GGCTCCAAAT CACCCCCCAG GAATTCCTGT GCATGAAAGC 3540 
ACTGCTACTC TTCAGCATTA TTCCAGTGGA TGGGCTGAAA AATCAAAAAT TCTTTGATGA 3600 
ACTTCGAATG AACTACATCA AGGAACTCG A TCGTATCATT GCATGCAA AA GAAAAAATCC 3660 
CACATCCTGC TCAAG ACGCT TCTACCAGCT CACCAAGCTC CTGGACTCCG TGCAGCCTAT 3720 
TGCGAGAGAG CTGCATCAGT TCACTTTTGA CCTGCTAATC AAGTCACACA TGOTGAGCGT 3780 
GGACTTTCCG G AAATG ATG G CAGAGATCAT CTCTGTGCAA GTGCCCAAGA TCCTTTCTGG 3840 
GAAAGTCAAG CCCATCTATT TCCACACCCA GTGAAGCATT GGAAACCCTA TTTCCCCACC 3900 
CCAGCTCATG CCCCCTTTCA GATGTCTTCT GCCTGTTATA ACTCTGCACT ACTCCTCTGC 3960 
AGTGCCTTGG GGAATTTCCT CTATTG ATGT ACAGTCTGTC ATGAACATGT TCCTGAATTC 4020 
TATTTGCTGG GCTTTTTTTT TCTCTTTCTC TCCTTTCTTT TTCTTCTTCC CTCCCTATCT 4080 
AACCCTCCCA TGGCACCTTC AGACTTTGCT TCCCATTGTG GCTCCTATCT OTGTTTTGAA 4140 
TGGTGTTGTA TGCCTTTAAA TCTGTGATGA TCCTCATATG GCCCAGTGTC AAGTTGTGCT 4200 
TGTTTACAGC ACTACTCTGT GCCAGCCACA CAAACGTTTA CTTATCTTAT GCCACGGGAA 4260 
GTTTAGAG AG CTAAGATTAT CTGGGGAAAT CAAAACAAAA AACAAGCAAA CAAAAAAAAA 4320 
A 

Seq ID NO: 94 Protein sequence: 
Protein Accession ft NPJJ00035. 1 

1 11 21 31 41 SI 
I I I I I I 

MEVQLGLGRV YPRPPSKTYR GAFQNLFQSV RBVIQNPGPR HPEAASAAPP GASLLLLQQQ 60 
QQQQQQQQQQ QQQQQQQQET SPRQQQQQQG EDGSPQAHRR GPTGYLVLDE EQQPSQPQSA 120 
LECHPERGCV PEPG AAVAAS KGLPQQLPAP PDEDDSAAPS TLSLLGPTFP GLSSCSADLIC 1 80 
DILSEASTMQ LLQQQQQEAV SEGSSSGRAR EASGAPTSSK DNYLGGTSTI SDNAKELCKA 240 
VSVSMGLGVE ALEHLSPGEQ LRGDCMYAPL LGVPPAVRPT PCAPLAECKG SLLDDSAGKS 300 
TEDTAEYSPF KGGYTKGLEG ESLGCSGSAA AGSSGTLELP STLSLYKSGA LDEAAAYQSR 360 
DYYNFPLALA GPPPPPPPPH PHARUCLENP LDYGSAWAAA AAQCRYGDLA SLHGAGAAGP 420 
GSGSPSAAAS SSWHTLFTAE EGQLYGPCGG GGGGGGGGGG GGGGGGOGGG GGEAGAVAPY 480 
GYTRPPQGLA GQESDFTAPD VWYPGGMVSR VPYPSPTCVK SEMGPWMDSY SGPYGDMRLE 540 
TARDHVLPID YYFPPQKTCL ICGDEASGCH YGALTCGSCK VFFKRAAECK QKYLCASRND 600 
CTIDKFRRKN CPSCRLRKCY EAGMTLOARK LKKLGNLKLQ EEGEASSTTS PTEETTQKLT 660 
VSH1EG YECQ PIFLNVLEAI EPG WCAGHD NNQPDSFAAL LSSLNELGER QLVHWKWAK 720 
ALPGFRNLHV DDQMAVIQYS WMGLMVFAMG WRSFTNVNSR MLYFAPDLVF NEYRMHKSRM 780 
YSQCVRMRHL SQEFGWLQIT PQEFLCMKAL LLFSIIPVDG LKNQKFFDEL RMNYHCELDR 840 
IIACKRKNPT SCSRRFYQLT KLLDSVQPLA RELHQFTFDL UKSHMVSVD FPEMMAEHS 900 
VQVPKILSGK VKPIYFHTQ 

Seq ID NO: 95 DNA sequence 

Nucleic Acid Accession ft NMJJ02497 

Coding sequence: 135-1472 (undcriincd sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

GGCACGAGTA GGGGTGGCGG GTCAGTGCTG CTCGGGGGCT TCTCCATCCA GOTCCCTGGA 60 
GTTCCTGGTC CCTGGAGCTC CGCACTTGGC GCGCAACCTG CGTGAGGCAG CGCG ACTCTG 120 
GCGACTGGCC GGCCATGCCT TCCCGGGCTG AGGACTATGA AGTGTTGTAC ACCATTGGCA 180 
CAGGCTCCTA CGGCCGCTGC CAOAAG ATCC GGAGGAAGAG TGATGGCAAO ATATTAGTTT 240 
GGAAAGAACT TGACTATGGC TCCATG ACAG AAGCTG AGAA ACAGATGCTT GTTTCTGAAG 300 
TGAATTTGCT TCGTGAACTG AAACATCCAA ACATCGTTCG TT ACT ATG AT CGGATTATTG 360 
ACCGGACCAA TACAACACTG TACATTGTAA TGGAATATTG TGAAGGAGGG GATCTGGCTA 420 
GTGTAATTAC AAAGGG AACC AAGGAAAGGC AATACTTAGA TGAAG AGTTT GTTCTTCGAO 480 
TGATGACTCA GTTGACTCTG GCCCTGAAGG A ATG CC ACAG ACGAAGTGAT GGTGGTCATA 540 
CCGTATTGCA TCGGGATCTT AAACCAGCCA ATGTTTTCCT GGATGGCAAG CAAAACGTCA 600 
AGCTTGG AGA CTTTGGGCTA GCTAG AATAT TAAACCATG A CACGAGTTTT GCAAAAACAT 660 
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TTGTTGGCAC ACCTTATTAC ATGTCTCCTG AACAAATGAA TCGCATGTCC TACAATGAGA 720 
AATCAGATAT CTGGTCATTG GGCTGCTTGC TGTATGAGTT ATGTGCATTA ATGCCTCCAT 780 
TTACAGCTTT TAGCCAGAAA Q AACTCGCTG GGAAAATCAG AGAAGGCAA A TTCAGGCG AA 840 
TTCCATACCG TTACTCTGAT GAATTGAATG AAATTATTAC GAGGATGTTA AACTTAAAGG 900 
5 ATTACCATCG ACCTTCTGTT G AAGAAATTC TTG AGAACCC TTTAATAGCA G ATTTGGTTG 960 

CAGACGAGCA AAGAAGAAAT CTTGAGAGAA GAGGGCGACA ATTAGGAG AG CCAGAAAAAT 1020 
CGCAGG ATTC CAGCCCTGTA TTGAGTGAGC TGAAACTG AA GGAAATTCAG TTACAGGAGC 1080 
GAGAGCGAGC TCTCAAAGCA AGAGAAGAAA GATTGGAGCA GAAAGAACAG GAGCTTTGTG 1 140 
TTCGTGAGAG ACTAGCAGAG GACAAACTGG CTAGAGCAGA AAATCTGTTG AAGAACTACA 1200 

1 0 GCTTGCTAAA GGAACGGAAG TTCCTGTCTC TGGCAAGTAA TCCAGAACTT CTTAATCTTC 1260 
CATCCTCAGT AATTAAGAAG AAAGTTCATT TCAGTGGGGA AAGTAAAGAG AACATCATG A 1320 
GGAGTG AGAA TTCTGAGAGT CAGCTCACAT CTAAGTCCAA GTGCAAGGAC CTGAAGAAAA 1380 
GGCTTCACGC TGCCCAGCTG CGGGCTCAAG CCCTGTCAGA TATTGAGAAA AATTACCAAC 1440 
TGAAAAGCAG ACAGATCCTG GGCATGCGCTAGCCAGGTAG AGAGACACAG AGCTGTGTAC 1500 

1 5 AGGATGTAAT ATTACCAACC TTTAAAGACT QATATTCAAA TGCTGTAGTG TTGA ATACTT 1560 
GGCCCCATGA GCCATGOCTT TCTGTATAGT ACACATGATA T TTCGG AAT T GGT TTTACTG 1620 
TTCTTCAGCA ACTATTGTAC AAAATGTTCA CATTTAATTT TTCTTTCTTC TTTTAAGAAC 1680 
ATATTATAAA AAGAATACTT TCTTGGTTGG GCTTTTAATC CTGTGTGTGA TTACT AGTAG 1740 
GAACATGAGA TGTGACATTC TAAATCTTGG GAGAAAAAAT AATATTAGGA AAAAAATATT 1800 

20 TATGCAGGAA GAGTAGCACT CACTGAATAG TTTTAAATGA CTGAGTGGTA TGCTTACAAT 1860 
TGTCATGTCT AGATTTAAAT TTTAAGTCTG AGATTTTAAA TGTTTTTGAG CTTAGAAAAC 1920 
CCAGTTAGAT GCAATTTGGT CATTAATACC ATGACATCTT GCTTATAAAT ATTCCATTGC 1980 
TCTGTAGTTC AAATCTGTTA GCTTTGTGAA AATTCATCAC TGTGATGTTT GTATTCTTTT 2040 
TTTTTTTCTG TTTAACAG AA TATGAGCTGT CTGTCATTTA CCTACTTCTT TCCCACTAAA 2100 

25 TAAAAGAATT CTTCAGTTA 

Seq ID NO. 96 Protein sequence; 
Protein Accession #: NP 00248S 

30 1 11 21 31 41 51 

GIREFNPNIM ANEVERINMI TOSISGENEA RELATEDKIN ASEHOMOSAP IENSMPSRAE 60 
DYEVLYTIGT GSYGRCQKIR RKSDGKILVW KELDYGSMTE AEKQMLVSEV NLLRELKHPN 120 
IVRYYDRIID RTNTTLYIVM EYCEGGDLAS VITKGTKERQ YLDEEFVLRV MTQLTLALKE 180 

35 CHRRSDGGHT VLHRDLKPAN VFLDGKQNVK LGDFGLARIL NHDTSFAKTF VGTPYYMSPE 240 
QMNRMSYNEK SDIWSLGCLL YELCALMPPF TAP SQKELAG KJREGKFRRJ PYRYSDELNE 300 
HTRMLNLKD YHRPSVEEIL ENPUADLVA DEQRRNLERR GRQLGEPEKS QDSSPVLSEL 360 
KLKEIQLQER ERALKAREER LEQKEQELCV RERLAEDKLA RAENLLKNYS LLKERKFLSL 420 
ASNPELLNLP SSVIKKKVHF SGESKENIMR SENSESQLTS KSKCKDLKKR LHAAQLRAQA 480 

40 LSDDEKNYQL KSRQILGMR 

Seq ID NO. 97 DNA sequence 

Nucleic Acid Accession #: NMJM7050.2 

Coding sequence: 1 85-4576 (underlined sequences correspond to start and stop codons) 

45 

1 11 21 31 41 51 

CCTCCCGCCT CAGTTCGCGC CGCGCCTCGG CTTGGAACGC AGGAGCGCCG GCTCCGGGAG 60 
CCCGAGCGGA GCCAGCCGCG CGCACAGCCA GCGGCCGCGC CGGCGATGCG GGGCCACCCC 120 

50 GCGaXBCCCCAGTCCCGGCCCCGGCCCCCG 180 
CCGGATGGCG AGCCTCGCCG CGCTCGCCCT CAGCCTGCTC CTGAGGCTGC AGCTGCCGCC 240 
ACTGCCCGGC GCCCGGGCTC AG AGCGCCCC AGGTGGCTGT TCCTTTGATG AG C ACT AC AG 300 
CAACTGTGGT TATAGTGTGG CTCTAGGGAC CAATGGGTTC ACCTGGGAGC AGATTAACAC 360 
AACGGAGAAA CCAATGCTGG ACCAGGCAGT GCCCACAGGA TCTTTCATGA TGGTGAACAG 420 

55 CTCTGGGAGA GCCTCTGGCC AGAAGGCCCA CCTTCTCCTG CCAACCCTGA AGGAGAATGA 480 
CACCCACTGC ATCG ACTTCC ATTACTACTT CTCCAGCCGT G ACAGGTCCA GCCCAGGGGC 540 
CTTGAACGTC TACGTGAAGG TG AATGGTGG CCCCCAAGGG AACCCTGTGT GGAATGTGTC 600 
CGGGGTCGTC ACTGAGGGCT GGGTGAAGGC AGAGCTCGCC ATCAGCACTT TCTGGCCACA 660 
TTTCTATCAG GTGATATTTG AATCCGTCTC ATTGAAGGGT CATCCTGGCT ACATCGCCGT 720 

60 GG ACGAGGTC CGGGTCCTTG CTCATCCATG CAG AAAAGCA CCTCATTTTC TGCGACTCCA 780 
AAACGTGGAG GTGAATGTGG GGCAGAATGC CACATTTCAG TGCATTGCTG GTGGGAAGTG 840 
GTCTCAGCAT GACAAGCTTT GGCTCCAGCA ATGGAATGGC AGGGACACGG CCCTGATGGT 900 
CACCCGTGTG GTCAACCACA GGCGCTTCTC AGCCACAGTC AGTGTGGCAG ACACTGCCCA 960 
GCGGAGCGTC AGCAAGTACC GCTGTGTGAT CCGCTCTG AT GGTGGGTCTG GTGTGTCCAA 1020 

65 CTACGCGGAG CTGATCGTGA AAGAGCCTCC CACGCCCATT GCTCCCCCAG AGCTGCTGGC 1080 
TGTGGGGGCC ACATACCTGT GGATCAAGCC AAATGCCAAC TCCATCATCG GGG ATGGCCC 1 140 
CATCATCCTG AAGGAAGTGG AATATCGCAC CACCACAGGC ACGTGGGCAG AGACCCACAT 1200 
AGTCG ACTCT CCCAACTATA AGCTGTGGCA TCTGGACCCC GATGTTGAGT ATGAGATCCG 1260 
AGTGCTCCTC ACACGACCAG GTGAGGGGGG TACGGGACCG CCAGGGGCTC CCCTCACCAC 1320 

70 CAGGACCAAG TGTGCAGATC CGGTACATGG CCCACAGAAC GTGGAAATCG TAGACATCAG 1380 
AGCCCGGCAG CTG ACCCTGC AGTGGGAGCC CTTCGGCTAC GCGGTGACCC GCTGCCATAG 1440 
CTACAACCTC ACCGTGCAGT ACCAGTATGT GTTCAACCAG CAGCAGTACG AGGCCGAGGA 1500 
GGTCATCCAG ACCTCCTCCC ACTACACCCT GCGAGGCCTG CGCCCCTTCA TGACCATCCG 1560 
GCTGCGACTC TTGCTGTCTA ACCCCGAGGG CCGAATGGAG AGCGAGG AGC TGGTGGTGCA 1620 

75 GACTGAGGAA GACGTTCCAG GAGCTGTTCC TCTAGAATCC ATCCAAGGGG GGCCCTTTGA 1680 
GGAGAAGATC TACATCCAGT GGAAACCTCC CAATGAGACC AATGGGGTCA TCACGCTCTA 1740 
CG AGATCAAC TACAAGGCTG TCGGCTCGCT GGACCCAAGT GCTGACCTCT CGAGCCAGAG 1800 
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GGOOAAAGTQ TTCAAGCTCC GGAATG AAAC CCACCACCTC TTTGTGGGTC TGTACCCAGG 1 860 
GACCACCTAT TCCTTCACCA TCAAGGCCAG CACAGCAAAG GGCTTTGGGC CCCCTCTCAC 1 920 
CACTCGG ATT GCCACCAAAA TTTCAGCTCC ATCCATGOCT G AGTACGACA GAGACACCCC 1980 
ATTGAATGAG ACAGACACG A CCATCACAGT GATGCTGAAA CCCGCTCAGT CCCGGGGAGC 2040 
TCCTGTCAGT GTTTATCAGC TGGTTGTCAA GGAGGAGOGA CTTCAGAAGTCACGOAGGGC 2100 
AGCTGACATT ATTG AGTGCT TTTCGGTGCC CGTGAGCTAT CGGAATGCCT CCAGCCTCOA 2160 
TTCTCTACAC TACTTTGCTG CTGAGTTG AA GCCTGCCAAC CTGCCTGTCA COCAGCCATT 2220 
TACAGTGGGT G ACAATAAGA CATACAATGG CTACTGGAAC CCTCCTCTCT CTCCCCTG AA 2280 
AAGCTACAGC ATCTACTTCC AGGCACTCAG CAAAGCCAAT GG AGAGACCA AAATCAACTG 2340 
TGTTCGTCTG GCTACAAAAG CACCAATGGG CAGCGCCCAG GTGACCCCGG GGACTCCACT 2400 
CTGCCTCCTC ACCACAGGTG CCTCCACCCA GAATTCTAAC ACTGTGGAGC CAGAG AAGCA 2460 
GGTGGACAAC ACCGTGAAGA TGGCTGGCGT GATCGCTGGC CTCCTCATGT TCATCATCAT 2520 
TCTCCTGGGC GTG ATGCTCA CCATCAAAAG GAGAAGAAAT GCTTATTCCT ACTCCTATTA 2580 
CTTGTCCCAA AGGAAGCTGG CCAAGAAGCA GAAGGAGACC CAGAGTGGAG CCCAGAGGGA 2640 
GATGGGGCCT GTGGCCTCTG CCG ACAAACC CACCACCAAG CTCAGCGCCA GCCGCAATGA 2700 
TGAAGGCTTC TCTTCTAGTT CTCAGG ACGT CAACGGATTC ACAGATGGCA GCCGCGGGG A 2760 
GCTTTCCCAG CCCACCCTCA CGATCCAGAC TCATCCCTAC CGCACCTGTG ACCCTGTGGA 2820 
GATGAGCTAC CCCCGGGACC AGTTCCA ACT CGCCATCCGG GTGGCTGACT TGCTGCAGCA 2880 
CATCACGCAQ ATGAAGAGAG GCCAGGGCTA CGGGTTCAAG GAGGAATACG AGGCCTTACC 2940 
AGAGGGGCAG ACAGCTTCGT GGGACACAGC CAAGG AGG AT GAAAACCGCA ATAAG AATCG 3000 
ATATGGGAAC ATCATATCCT ACGACCATTC CCGGGTGAGG CTGCTGGTGC TGGATG GAG A 3060 
CCCGCACTCT GACTACATCA ATGCCAACTA CATTG ACGGA TACCATCGAC CTCGGCACTA 3 120 
CATTGCGACT CAAGGTCCG A TGCAGG AG AC TGTAAAGG AC TTTTGG AG AA TG ATCTGGCA 3 180 
GGAGAACTCC GCCAGCATCG TCATGGTCAC AAACCTGGTG GAAGTGGGCA GGGTGAAATG 3240 
TGTGCGATAC TGGCCAGATG ACACGG AGGT CTACGGAGAC ATTAAAGTCA CCCTGATTGA 3300 
AACAGAGCCC CTGGCAGAAT ACGTCATACG CACCTTCACA GTCCAGAAGA AAGGCTACCA 3360 
TGAGATCCGG GAGCTCCGCC TCTTCCACTT CACCAGCTGG CCTGACCACG GCGTTCCCTG 3420 
CTATGCCACT GGCCTTCTGG GCTTCGTCCG CCAGGTCAAG TTCCTCAACC CCCCGGAAGC 3480 
TGGGCCCATA GTGGTCCACT GCAGTGCTGG GGCTGGGCGG ACTGGCTGCT TCATTGCCAT 3540 
TGACACCATG CTTGACATGG CCGAGAATG A AGGGGTGGTG GACATCTTCA ACTGCGTGCG 3600 
TGAGCTCCGG GCCCAAAGGG TCAACCTGGT ACAGACAGAG GAGCAATATG TGTTTGTGCA 3660 
CGATGCCATC CTGGAAGCGT GCCTCTGTGG CAACACTGCC ATCCCTGTGT GTGAGTTCCG 3720 
TTCTCTCTAC TACAATATCA GCAGGCTGGA CCCCCAGACA AACTCCAGCC AAATCAAAGA 3780 
TGAATTTCAG ACCCTCAACA TTGTGACACC CCGTGTGCGG CCCGAGGACT GCAGCATTGG 3840 
GCTCCTGCCC CGGAACCATG ATAAG AATCG AAGTATGGAC GTGCTGCCTC TGGACCGCTG 3900 
CCTGCCCTTC CTTATCTCAG TGGACGGAGA ATCCAGCAAT TACATCAACG CAGCACTGAT 3960 
GGATAGCCAC AAGCAGCCTG CCGCCTTCGT GGTCACCCAG CACCCTCTAC CCAACACCGT 4020 
GGCAGACTTC TGG AGGCTGG TGTTCGATTA CAACTGCTCC TCTGTGGTGA TGCTGAATGA 4080 
G ATGGACACT GCCCAGTTCT GTATGCAGTA CTGGCCTGAG AAGACCTCCG GGTGCTATGG 4140 
GCCCATCCAG GTGGAGTTCG TCTCCGCAGA CATCGACG AO GACATCATCC ACAGAATATT 4200 
CCGCATCTGT AACATGGCCC GGCCACAGGA TGGTTATCGT ATAGTCCAGC ACCTCCAGTA 4260 
CATTGGCTGG CCTGCCTACC GGGACACGCC CCCCTCCAAG CGCTCTCTGC TCAAAGTGGT 4320 
CCGACGACTG GAGAAGTGGC AGGAGCAGTA TG ACGGGAGG GAGGGACGTA CTGTGGTCCA 4380 
CTGCCTAAAT GGGGGAGGCC GTAGTGGAAC CTTCTGTGCC ATCTGCAGTG TGTGTGAGAT 4440 
GATCCAGCAG CAAAACATCA TTGACGTGTT CCACATCGTG AAAACACTGC GTAACAACAA 4500 
ATCCAACATG GTGGAGACCC TGGAACAGTA TAAATTTGTA TACGAGGTGG CACTGGAATA 4560 
TTTAAGCTCC TTTTAGCTCA ATGGGATGGG GAACTGCCGG AGTCCAGAGG CTGCTGTGAC 4620 
CAAGCCCCCT TTTGTGTG AA TGGCAGTAAC TGGGCTCAGO AGCTCTGAGG TGGCACCCTG 4680 
CCTGACTCCA AGGAG AAGAC TGGTGGCCCT GTGTTCCACG GGGGGCTCTG CACCTTCTGA 4740 
GGGGTCTCCT GTTGCCGTGG GAGATGCTGC TCCAAAAGGC CCAGGCTTCC TTTTCAACCT 4800 
AACCAGCCAC AGCCAAGGGC CCAAGCAGAA GTACACCCAC AAGCAAGGCC TTGGATTTCT 4860 
GGCTCCCAGA CCACCTGCTT TTGTTCTGAG TTTGTGGATC TCTTGGCAAG CCAACTGTGC 4920 
AGGTGCTGGG GAGTGGGAGG CTCCCCTGCC CTCCTTCTCC TTAGGAGTGG AGGAG ATGTG 4980 
TGTTCTGCTC CTCTACGTCA TGGAAAAGAT TGAGGCTCTT GGGGGTCACT GCTCTGCTGC 5040 
CCCCTGCAAC CTCCTTCAGG GGCCTCTGGC ACCAGACATT TGCAGTCTGG ACCAGTGTGA 5100 
CCTTACG ATG TTCCCTAGGC CACAAGAG AG GCCCCCCATC CTCACACCTA ACCTGCATGG 5160 
GGCTTCGCCC ACAACCATTC TGTACCCCTT CCCCAGCCTG GGCCTTGACC GTCCAGCATT 5220 
CACTGGCCGG CCAGCTGTGT CCACAGCAGT TTTTGATAAA GGTGTTCTTT GCT1T1T1GT 5280 
GTGGTCAGTG GGAGGGGGTG GAACTGCAGG GAACTTCTCT GCTCCTCCTT GTCTTTGTAA 5340 
AAAGGGACCA CCTCCCTGGG GCAGGGCTTG GGCTGACCTG TAGGATGTAA OCCCTGTGTT 5400 
TCTTTGGTGG TAGCTTTCTT TGGAAGAGAC AAAC AAG ATA AGATTTG ATT ATTTTCCAAA 5460 
GTGTATGTGA AAAGAAACTT TCTTTTGG AG GGTGTAAAAT CTTAGTCTCT TATGTCAAAA 5520 
AGAAGGGGGC GGGGGAGTTT GAGTATGTAC CTCTAAGACA AATCTCTCGG GCCTTTTATT 5580 
TTTTCCTGGC AATGTCCTTA AAAGCTCCCA CCCTGGGACA GCATGCCACT GAGCAAGGAG 5640 
AGATGGGTGA GCCTGAAGAT GGTCCCTTTG GTTTCTGGGG CAAATAGAGC ACCAGCTTTG 5700 
TGCATAATTT GGATGTCCAA ATTTG AACTC CTTCCTAAAG AAACCCAGCA GCCACCTTGA 5760 
AAAAGGCCAT TGTGG AGCCC ATTATACTTT GATTTAAAAT AGGCCAAGAG AATCAGGCCT 5820 
GGAGATCTAG GGTCTTGTCC AAAGTGTGAG TGAGTCAATG AGAGGG AACC AACATTTGCT 5880 
AAGTCTCTAC TGTATGCCAG GGATCATGCT TGGCACTTTC CATAGGACAT TTCACACAGT 5940 
CCTTAGAACC CCCAGG AG AG AGCTACTG AC TTGTTATCAT CTCCATTTGA TCATCTCCTC 6000 
CAATGAGG AA ACCCACGCAC CTTCCTTAGT AATGAAATCC TGGGTTCCAA AGGGGCAGGT 6060 
AATGGCAATG AGACTTCTCC GTGCTGTTTT CTTCATCTTC TCTAAGCCAA GCAATTATTT 6120 
TATGGAGGGA AAATAAGGCC AGAAACTTCT GAGCAG ATAA CTCCACAAAT GGAAATTTAG 6180 
TACTTTCTTC CTGATGCCAG TTCTTCTGGG AAGCGCAGAA TTTCAGATAT ATTTTAGTAA 6240 
CACATTCCCA GCTCCCCAGG AAAGCCAGTC TCATCTAATT TCTTAGTCAG TAAAAACAAT 6300 
TCCCTGTTCC TTCAGGCTAT GAATGG ACCA GCCAGGGAAA CTCTCGACCT TGATCTCTAG 6360 
CCAGTGCTTA GGCCC AATAT CTGACAGCCT CAGGTGGGCT GGGACCTAGG AAGCTCCATC 6420 
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TTGAAGGCTG GTCTAGCCCC AGACAGGGCA TGAGGGGCAG AG AATTCAAG AAGGTACAGC 6480 
TTTGGCCCTC AAGAGCCCAC TGTATGCTGG GGAAATGGAA CCATGGTGCA GTAGTGTGGA 6540 
GTGGATGAGT GTTCCATG AG CCTAGGAGCA AGAAAGTCTC TTCGGCCTCG GGCTTCCTGG 6600 
AGAAGGGGAC GTCCATTCCT GCTGGGTCTT AACAAGCATA AAAAGGAAAA AAAGGAAACT 6660 
5 CAGGCAAAGG GATCCATATG TGCAATGGCA AAGAAATGTG AAAAGGCATT GGGAGAAGCA 6720 
GTCTGGGGGA GGCCAGCCCA GTGCGGGCAC AGCACAACAC GGGGAGCAGC AAGAGATGAG 6780 
CCAGGGTCCA GGAG ACAGAT GCCCATCGCO AGTACAGACT TTGTCCTATT GGCAACAAGG 6840 
AGTCCATGG A GCTTTAGAGA G ATGCACTCA GCTTCGTGTT GGCCAAG ACT CCTTCTGGGC 6900 
CAATGGGGCT GCCTCTTTTC CTTTCATCAG ACACTGTGAA AACATTCCCT TAAGCGTGCA 6960 

1 0 CTTTTTA ATA TCACATCTAT TTGTCTGTCT GCTCATTGTT TTGTTGCTGG AACTAAATAT 7020 , 
GCAATGGATC ATGAGACTCA GATTCTATGA GAAACCCAGG GTCTCTGCTT TACCACGG AG 7080 
CAGGGTCACC AACCCAGATC TCCAGGCCCA TGAGG ATGG A ACATGAAAGG AGCCGACAAA 7140 
AGTTGCTTCC ATTGGCATGG GCTCTGGAGC TGTCCAGAAG TCCAGGGACA CCAG ACTTGA 7200 
TCAAGGAAGG GCTGTCACTT TAGAGGTTCA AAAGGAAGTG CCTCAAAGCA AAGGCAAGCA 7260 

15 AAGGAACCCC ACGATGAACT TGCTCTTTTC CTTTG ATGAG CCTCTCCCCA GGTGTATTTC 7320 
AGCAGACCCC GGGGACCCAC CCCCACTGGG CCTGCTGGCC TCCCTCGGCT CCAGCCCAAT 7380 
GCCCCAGCTG GCCTTCCCCA GCCTGCAAGG AGCCTGTAGC ATGGCAAATC TGCCTGCTGT 7440 
ATGCTATTTT CTTAGATCTT GGTACATCCA GACAGGATGA GGGTGGAGGG AGAGCTATTT 7500 
AACACAAATC CTAAGATTTT TTTCTGCTCA GGAAGGGGTG AAATAGCTGG CAGATACAAA 7560 

20 AGACAGTGGC TTTTATCATT TTAAATGGTA GGAATTTAAG GTGTGACTTC AGGG AGAAAC 7620 
AAACTTGCAA AAAAAAAAAA TCTCAGGCCA TGTTGGGGTA ACCCAGCAAG GGCCAGTGAT 7680 
GATTTCCCCC AGCTCATCCC CTTATTTTCC CACAACCCAA CCATTCTCTA AAGCAGGACA 7740 
GTGAATAGGT CTTAGGCCAG TGCACACAGG AAGAAATTGA GGCTTATGGA TGGGGATOAC 7800 
TTCCCTAAGA TCCCATGGGA CAAGG ATGTG GCAAGGCTTG GATGAGATGG GGCACCAGTG 7860 

25 CCCAGGAATT TGAACATTTT CCTTTACCCA GGAAATCTCC GGAGCCAACA CCACCACCCC 7920 
CAGGGGGTCT CCCCACCCCA CCCCATTTAC AGGGTGAGCT CAGCCTGTCA TG AGCAG AGO 7980 
AAAATATTAT TAATGCTCTC TGAGTCTTTA CAACAGGAGC TCTTACCTCA TAGATGTGGG 8040 
CTCTGTTTGG GGAAGATGCA AGGAAGTAAT GAGAAGCCCA GGAAATTTCT CC ACCT GTGT 8100 
TTATGGCCTA AATAGCTTCA GGATGTATCT TAGCTGCACT CCAACATTGC ATCCTTTCTG 8 160' 

3 0 GGGTGAAGAA TCTGGGCCAA CC AGGGGTCC TTGGGCCTCT AGAAGGCCAC AGTAGGCCTC 8220 
TCTTTGTGGG AATGGAAGGG GACAGTTTGC TTTTAGTGCT GGCCCTCTCT GTGGGTGTGG 8280 
CCTGCAAAGG AACCAACAGA CCCTATGCTG GGGACTCTAA CATGTGAGCT CATTAAATTC 8340 
TTCCAGCATT CTAAAGGAGG GTTTGTG ATT GTCACCATTT ACTG ATGAGG AAACTAAGGC 8400 
TCCTAGGGGA GAAATCACTT GCCCACAGTT CCACAGCTAG TGAGTGAATG AACCAGGATT 8460 

35 TAAACCGGTT TTTTCTCACT ACAGAG ACAA TATTTTTCCA CCATTGTATC TCACATTTTT 8520 

CCCAGGAGGT TACCCATAAC AGAAG AGACT AGAGTGGAAC AGATACGTCA GTGGATAAAG 8580 
CTCAAAGCAA ACAACAGTAA GCTTAAAATT CCTTCATAGT CTCATGTTTT ACGTTCACAA 8640 
TTCATGCAAA ATTTGCATTC CACTTTCTGA TTTAGCCTTG TTGGTTTTAA TATGACTCTA 8700 
TGAATATTTC AAAAAAAAAT GTGCTCTGTT CCTCATGTTG TTCTGTTCTG TTCACCCCGC 8760 

40 TATGACGGAC CCTAGGTCAG CTGGTCTTCA GCTTGACCCT AGAATTGACT CTAGGAGCAG 8820 
TGACCCTGCT GCCTCCCAG A GCCAGTTATA GGCTCAAGAT CAAGACCAAC TGACCTTCTC 8880 
CTAGGCAGCT CCTTTGGTGT GTGGGTGCTC TGACCTCACT GTTCATGAGG GGACCTCAAC 8940 
TAAGGCATCT TCCAGTTGGG TGCTGGAAGG AACCCATTAA CTCACACTAG AATGATGAGG 9000 
ATTTGCTCAT CTGGCGTGGA G AAGGATGAG CCCACAAAAC CCTAAAGGGA AAAGAGA AGC 9060 

45 TGGACACAGC TGTACTCAGC AGATTCCTGA ATGCTAGGCT GGAAAGTGGT GCCTGTTGTC 9120 
CAAGTGGAGT CACATGGTTG CTAATGTGGG CAAGTCTGAG GACACACTTC ATGAG CAGCT 91 80 
GGGGTCTGGA AGGCTCCTCA CTTTACCCTA GCCACACATA ATTACTGGGT GCCTACAGCA 9240 
CCTAGCACCT TGGAGGGGGC ACT ATT AG G A AATCGAGATT ACTATGGCAC AATTAATTCC 9300 
TGGGTAAGGC ATGGGGTTGT GGTGGACAGA GCTCAGTCTT TAGTTTGAAC GAAAACATAC 9360 

50 ATACATGAAA AACATACATG AAAAAAGGAC CCTCATCAAC ATTAGAAGGG GTAGATTTGG 9420 
AGCACTTTAG GCAGG AAAAC AGGAACGCAA GGCCAGGAAA CTGGAACCCA GTGAATACTC 9480 
AG AACCGAGG ATGCAGATG A CTTATTTAGC AAAATGGTCA CTTCTGTG AC ATAGCTGGAG 9540 
AAAGGATGGG TAACAGCTTG CCAG AGCCAC TTGGAACAAG GGCAAATCTC AGTGTCTGGG 9600 
GC AAA AGATG ATGC ATTTCC CTCTG AOCCA TCATGTTTAT TCATCCTCCA CTCCCCATTG 9660 

5 5 CCACACTAGC TCTTGCTGTA AGTCCTCACC AGGATCTACA TTTCCTCGTC GCTGGTGGGA 9720 
ACCCCTTAGA GTACATAGAG GTATCAGTCC AGTAAGACTG CTCTACACAA CAGAAGTGAG 9780 
GCCCAGGGAG TAGCAGCCAG GCCCTTATCC TGTTACCTCT GCAGG AGTGA CTGCCCAACC 9840 
CAGATCCAG A GACATTGAAG GAAATGATAA TTCCTTGGTA CCTCACTGCC TTGGGACAAA 9900 
ATGAAGAAAG CCACCCTTCC TTAGGCTGCA GCTTGCCACT CCTGGGCTGG GTAAACAGGT 9960 

60 CATCAGCACC AGGCTCAACC AGGAGTAACA TTCTGGAAGA CATGGGTGAG CCCAAGAGGA 10020 
AGCATGAACA GGACGCTGTT CCTAAGTCAT GTCAACAGGT TGTGCTGGGC CAGGATCCCC 10080 
AGGGAAAAAA ATGGTCAACC CAACTGGAGG GTAGGTTAGA AGAAAAAAAA CATAAACGTG 10140 
GATAGTCATG TCATCTCAAA TCCCTGACTT GGCTTCCCCA TTACTTGACA GTCTGAGCTC 10200 
CTTCTTAGCC TGTGACCAGC TTCAAATCAC AGCCAAGTA A AACAAGGAAA TAGGAAAAGT 10260 

65 AAATCCAACT AG AAGAGACA AGCTG AG ATT CAGATTTGTT TACTCCTCCC ATGCAAAGTT 1 0320 
TCCCTGTTGG AGGTTTTCCA TGTATACATG TCTAGAAGTG ATAGAATGCA AGGCCTTGGC 10380 
TTTGTCTTGC AGGGATCTGC CTTTGAGGTC ATAGACTGAA CAGCAGGGAG AGAGGTTAGT 10440 
GGTGGAGTGT GGGGGGAGCT GTTCTAGCTC CAGTTTCTTC TGACACATTT TTCAGGATCA 10500 
TGGATCTGAT CCTCCG AAGC ACAGCAGAG A TATCTAAGCC ATATTTGTGC ACATGAGCAG 10560 

70 ACTCTTCTAG TTTTTTAGTA ACCAGGGATG GGCTTTTGCA TGGCACTGAC TATAGAGATG 10620 
TCTTGTAGAG ATCAAGCCAG TCTTTTGCAT CCCACCTGCC CACCTCCAGA AGAGATGGGA 10680 
AAAGGTCATC AAAGGGCATT CACCAACTGA AATCCACTCA TGAATGTTAG GTCTCTAAAA 10740 
GGAGGCATCA ACACTCACAA TGGTAGCCTC CAAACCTAGC ATCCCACCTA TCTAAGAGCT 10800 
CAGGGGTGGT CCACTGGGGC AG ATACAAGG GAAGTGCAAG GGCTCAGGAT GAAAGAAAAT 10860 

7 5 CTATTGGGAA GAGTTTTAGG GGCTTGATCA TTATGGGGCT TC CTTCT ATA TCTGAGAACT 10920 
GCTCTGGGTG GTGAG ATGTG G ACTCTGATC CTTAATTGGA ATGTTCGGAG AATGAGTGTC 10980 
TGGTGGCCTT GAAGTGTTGG ACAGAAAAGT ATCAGTATAA AAGCCTGGAG CTCAGGGTAA 1 1040 
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TTAATGTAGT TCATGGTTCC TTAGTOAGCA GGACTCTTGG ATGTGGAGGA GAAAGOGTCA 11100 
TAGGAAGTAA ACCACCAAAA TTACAAAATT GAGTCTCTGT ACAATTACTT CAGTGCCTTT 1 1160 
GGGCTTATGA ATACAAATCA GTGGGCCTTC TCTATGATGG TCCAACAAAC TCTCAGTGTC 1 1220 
_ CACCCTOTCC CTGTATCTCC CATGGAAGAT GAATAATGTC AGGTGTTCTT TGGGTCAAAG 1 1280 
5 OCCCCAGGGC AGTCTGGAGG CTTAGAGGGC AGAGTGGTGT CATTCCATGT AAAOTTAOGC 1 1340 
TTCTGAGGGG TCAGGCAGAA TATGGTGTCC ATATCTTCCA TAGCTCTGCA GATTCTTGGA 1 1400 
TGAAGTCAAG CACAGTTTGC TAGACCCAGG TCACTCCTCT GAGTATAACT AGGACCCATG 1 1460 
AGTGAAACTT AATAGCTGTA AGG AAGAACC TGCTGTCTGC CAGAG AGGAT AAGCTGCCCA 1 1 520 
TCTCAGCAGC TGTCTAAAAG AAGGCAGGTG TCTCTTTAAA GGGAAGAGAA GCATTGGTGA 1 1580 

1 0 AATGGATTTC AGGTCACTTC CATTCCAG AT GGGTG AGATC TTGTGOAGCT GGG ATCATGT 1 1 640 
TTGAACTCAT TCATACCTGT AGAGCACG AA TCCAAGTAGA TTGTGTTTGG TCTGTACAGG 1 1700 
CTOAAGCCCC CTGCTCTCCC ACCCAAGTGC CCCCACTGAG CAGGCCAACA TGCTGTTGTG 1 1760 
GCCACATATA CTGGGCTGAT CCAGGCTGGT TATCACCAAA CAGCAAACCA TAOGGAACAG 1 1820 
CTGCTTTGCC ATAGACCCAA TACCCATGTA GATCTCTCAT GAGAGCAGCC ATAACTCAGA 1 1880 

1 5 CCCACTGACC AACAGGGCCA TGAGTGACAO CCAGAACCAG TGAAGGTCCA AGTAGGACAC 1 1940 
AGAGCAGGGC TTTTCTTACC ATACACATTA TCTCCAGAGG TTATTTCTAC CCCACTCCCT 12000 
ATTCAAGGCC TGTTGGAGCA CACTGCAAAA GCAAAAGCAC AGTAACTCAA TTTACACATG 12060 
ATTATAATCA TTTCCAGTGC ACACATTTCA TCACCAGGTG GATCCTGAGC TAGCCCATGT 12120 
AAATCCGGGT T AACCCATAT TGGTAATCAT ACTCAAAAGC ACTTTTCACC CTACATTCTA 12180 

20 CTAGCCAATC AAAGACAAAG AGTTGTGGCC TCTACCATTG CCTTGGCTTC TGGACACCCT 12240 
CACAAGCTAT CCCAAGGTTC CCGCTCAACT CCAGGGAGGC TGACATCTTC ACATCCACTG 12300 
GGCATATAAT ATTGCATGAQ ACCAAAGTCT CCACACTCTT TGCAGCCTCC TCCATGAATC 12360 
CCAATGGCCT GCACTTGTAC AGTTTGGGTG TTTGATAGAT AAAGCACGTA TGAGAAGAGA 12420 
AAACAAAATA AATCAACTTT TTAAAAAAGC CAGCACTGTG CTGTCAATGT TTTTTTTTTC 12480 

2 5 TTTTCAATTC TAGCTCAGAA AAGCAGAAGG TAA ATAATGT CAGGTCAATG AATATCAGAT 12540 
ATATTTTTTG ACTGTACATT ACAGTGAAGT GTAATCTTTT TACACCTGCA AGTCCATCTT 12600 
ATTTATTCTT GTAAATGTTC CCTG ACAATG TTTGTAATAT GGCTGTGTTA AAAAATCTAT 12660 
ACAATAAAGC TGTGACCCTG 

30 Scq ID NO: 98 Protein sequence: 

Protein Accession #: NP_008981. 1 

1 11 21 31 41 51 

35 GIREFNPPRO TEINTYROSI NEPHOSPHAT ASERECEPTO RTYPETHOMO SAPIENSMAS 60 

LAALALSLLL RLQLPPLPGA RAQSAPGGCS FDEHYSNCG Y SVALGTNGFT WEQINTTEKP 120 
MLDQAVPTGS FMMVNSSGRA SGQKAHLLLP TLKENDTHCI DFHYYFSSRD RSSPG ALNVY 180 
VKVNGGPQGN PVWNVSGWT EGWVKAELAI STFWPHFYQV ZFESVSLKGH PGYIAVDEVR 240 
VLAHPCRKAP HFLRLQNVEV NVGQNATFOC IAGGKWSQHD ICLWLQQWNGR DTALMVTRW 300 

40 NHRRFSATVS VADTAQRSVS KYRCVIRSDG GSG VSNYAEL I VKEPPTPIA PPELLAVGAT 360 
YLWDCPNANS IIGDGPULK EVEYRTTTGT WAETHIVDSP NYKJLWHLDPD VEYEIRVLLT 420 
RPGEGGTGPP GAPLTTRTKC ADPVHGPQNV EIVDIRARQL TLQWEPFGYA VTRCHSYNLT 480 
VQYQYVFNQQ QYEAEEV1QT SSHYTLRGLR PFMTIRLRLL LSNPEGRMES EELWQTEED 540 
VPGAVPLE5I QGGPFEEKIY IQWKPPNETN GVITLYEINY KAVGSLDPSA DLSSQRGKVF 600 

45 KLRNETHHLF VGLYPGTTYS FTIKASTAKG FGPPVTTRIA TKISAPSMPE YDTDTPLNET 660 
DTTTTVMUCP AQSRGAPVSV YQLWKEERL QKSRRAADII ECFSVPVS YR NASSLDSLHY 720 
FAAELKPANL P VTQPFT VGD NKTYNGYWNP PLSPLKSYSI YFQALSKANG ETKINCVRLA 780 
TKAPMGSAQV TPGTPLCLLT TGASTQNSNT VEPEKQVDNT VKMAGVIAGL LMFH1LLGV 840 
MLTDCRRRNA YSYSYYLSQR KLAKKQKETQ SG AQREMGPV ASADKPTTKL SASRNDEGFS 900 

50 SSSQDVNGFT DGSRGELSQP TLTIQTHPYR TCDPVEMSYP RDQFQLAIRV ADLLQHTTQM 960 

KRGQG YGFKE EYEALPEGQT ASWDTAKEDE NRNKNRYGNI ISYDHSRVRL LVLDGDPHSD 1020 
YINANYIDGY HRPRHYIATQ GPMQETVKDF WRMIWQENSA SIVMVTNLVE VGRVKCVRYW 1080 
PDDTEVYGDI KVTLIETEPL AEYVIRTFTV QKKGYHEIRE LRLFHFTSWP DHGVPCYATG 1 140 
LLGFVRQVKF LNPPEAGPIV VHCSAGAGRT GCFIAIDTML DMAENEG WD IFNCVRELRA 1200 

55 QRVNLVQTEE QYVFVHDAIL EACLCGNTAI PVCEFRSLYY NISRLDPQTN SSQIKDEFQT 1260 
LNIVTPRVRP EDCS1GLLPR NHDKKRSMDV LPLDRCLPFL ISVDGESSNY INAALMDSHK 1320 
QPAAFWTQH PLPNTVADFW RLVFDYNCSS WMLNEMDTA QFCMQYWPEK TSGCYGPIQV 1380 
EFVSADIDED IIHRIFRICN MARPQDGYRI VQHLQYIGWP AYRDTPPSKR SLLKWRRLE 1440 
KWQEQYDGRE GRTVVHCLNG GGRSGTFCAI CSVCEMIQQQ NI1DVFH1VK TLRMNKSNMV 1500 

00 ETLEQYKFVY EVALEYLSSF 

Scq ID NO: 99 DNA sequence 
Nucleic Acid Accession U: NMJW2988.1 
- Coding sequence: 71-340 (underlined sequences correspond to start and stop codons) 

65 

I 11 21 31 41 51 
I I I I I I 

CCGGCACGAG AGGAGTTGTG AGTTTCC AAG CCCCAGCTCA CTCTGACCAC TTCTCTGCCT 60 
GCCCAGCATC AJGAAGGGCC TTGCAGCTGC CCTCCTTGTC CTCGTCTGCA CCATGGCCCT 120 

70 CTGCTCCTGT GCACAAGTTG GTACCAACAA AG AGCTCTGC TGCCTCGTCT ATACCTCCTG 180 
GCAG ATTCCA CAAAAGTTCA TAGTTG ACTA TTCTGAAACC AGCCCCCAGT GCCCCAAGCC 240 
AGGTGTCATC CTCCTAACCA AGAGAGGCCG GCAG ATCTGT GCTGACCCCA AT AAG AAGTG 300 
GGTCCAG AAA TACATCAGCG ACCTGAAGCT GAATGCCTGA GGGGCCTGGA AGCTGCGAGG 360 
GCCCAGTGAA CTTGGTGGGC CCAGGAGGGA ACAGGAGCCT GAGCCAGGGC AATGGCCCTG 420 

75 CCACCCTGG A GGCCACCTCT TCTAAGAGTC CCATCTGCTA TGCCCAGCCA CATTAACTAA 480 
CTTTAATCTT AGTTTATGCA TCATATTTCA TTTTGAAATT GATTTCTATT GTTGAGCTGC 540 
ATTATGAAAT TAGTATTTTC TCTGACATCT CATG ACATTG TCTTTATCAT CCTTTCCCCT 600 
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TTCCCTTCAA CTCTTCGTAC ATTCAATGCA TGGATCAATC AGTGTGATTA GCTTTCTCAG 660 
CAGACATTGT GCCATATGTA TC AA ATG ACA AATCTTTATT GAATGGTTTT GCTCAGCACC 720 
ACCTTTTAAT ATATTGGCAG TACTTATTAT ATAAAAGGTA AACCAGCATT CTCACTGTGA 780 
^ AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NO: 100 Protein sequence: 
Protein Accession #: NPJJ02979. 1 

H ^ 1 II 21 31 41 51 

10 | | | | j i 

MKGLAAALLV LVCTMALCSC AQVGTNKELC CLVYTSWQIP QKFIVDYSET SPQCPKPGVI 60 
LLTKRGRQIC ADPNKKWVQK YISDLKLNA 

Seq ID NO: 1 01 DNA sequence 
15 Nucleic Acid Accession* NMJH 5507.2 

Coding sequence: 241-1902 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

20 CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 60 
CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTG 1 20 
CGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAG AGAGGAG GCGGCGGCTT AGCTGCTACG 1 80 
GGGTCCGGCC GGCGCCCTCC CG AGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTGCGAG A 240 
ATGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 

2 5 GGG AACGCGG CCAGTGCAAG GC ATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 

TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 
TGTGAAGCTA CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 
AG ATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGG AATG 540 
_ _ AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTGCTTTTGC 600 
30 CTCAGTGGOC ACATGCTCAT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 660 
ATAAACTGTC AGTACAGCTG TGAAG ACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 720 
TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 780 
GGTAAAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGG AAG CTACTACTGC 840 
AAATGTCACA TTGGTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 

3 5 AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 960 

GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 
ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 
AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1 140 
CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 
40 ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 
GAGGGGCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 1320 
AGCCTGCG AG G AGATGTOTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 1380 
CTQGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 
GACTGCAGCT TCAATCATGG GATCTGTGAC TGG AAACAGG ATAGAG AAGA TG ATTTTGAC 1 500 
45 TGGAATCCTG CTG ATCG AGA TAATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1 560 
GGTCACAAGA AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 1620 
AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCGAGTG 1680 
TTTGTGAAAA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACGAGTGA GGATGAAAAG 1740 
TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 1800 
5 0 GAAGCAGAAC GTGGCA AGGG CAAAACCGGC GAAATCGCAG TGGATGGCGT CTTGCTTGTT 1 860 
TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGGATGACIGAATGTTACT ATCTTTATAT 1920 
TTG ACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 1 980 
TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAG AA ATATTATTGT AAGATGCCTT 2040 
TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2100 
5 5 TTTCTG AATC TTTCCACATT ATATTATAAA ATATGG AAAT GTCAGTTTAT CTCCCCTCCT 2 1 60 
CAGTATATCT GATTTGTATA AGTAAGTTGA TG AGCTTCTC TCTACAACAT TTCTAGAAAA 2220 
TAGAAAAAAA AGCACAGAG A AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 
TATGACATCA AAGATAG ACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 
TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 



60 



Seq ID NO: 102 Protein sequence: 
Protein Accession #: NP_056322.2 



1 11 21 31 41 51 

65 | I | | | | 

MPLPWSLALP LLLSWVAGGF GNAAS ARHHG LLASARQPGV CHYGTKLACC YGWRRNSKGV 60 
CEATCEPGCK FGECVGPNKC RCFPG YTGKT CSQDVNECGM KPRPCQHRCV NTHGSYKCFC 120 
LSGHMJLMPDA TCVNSRTCAM INCQYSCEDT EEGPQCLCPS SGLRLAPNGR DCLDIDECAS 180 
GK.VTCPYNRR CVNTFGSYYC KCHIGFELQY ISGRYDCIDI NECTMDSHTC SHHANCFNTQ 240 

70 GSFKCKCKQG YKGNGLRCSA IPENSVKEVL RAPGmDRI KKLLAHKNSM KKKAKJKNVT 300 
PEPTRTPTPK. VNLQPFNYEE IVSRGGNSHG GKKGNEEKMK EGLEDEKREE KALKNDIEER 360 
SLRGDVFFPK VNEAGEFGLI LVQRKALTSK LEHKDLNISV DCSFNHGICD WKQDREDDFD 420 
WNPADRDNAI GFYMAVPALA GHKKDIGRLK LIXPDLQPQS NFCLLFDYRL AGDKVGKLRV 480 

_ _ FVKNSNNALA WEKTTSEDEK WKTGfCIQLYQ GTDATKSIIF EAERGKGKTG EIAVDGVLLV 540 

75 SGLCPDSLLSVDD 

Seq ID NO: 103 DNA sequence 
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Nucleic Acid Accession #: NMjOOl 565. 1 

Coding sequence: 67-363 (underlined sequences correspond to start and stop codons) 

1 U 21 31 41 51 
I I I I I I 

GAGACATTCC TCAATTGCTT AG ACATATTC TG AGCCTACA GCAGAGGAAC CTCCAGTCTC 60 
AGCACCATGA ATCAAACTGC GATTCTGATT TGCTGCCTTA TCTTTCTOAC TCTAAGTGGC 120 
ATTCAAGGAG TACCTCTCTC TAG AACCGTA CGCTGTACCT GCATCAGCAT TAGTAATCAA 180 
CCTGTTAATC CAAGGTCTTT AG AAAAACTT GAAATTATTC CTGCAAGCCA ATTTTGTCCA 240 
CGTGTTGAGA TCATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTCT GAATCCAGAA 300 
TCGAAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCAAGG AAATGTCTAA AAGATCTCCT 360 
TAAAACCAGA GGGGAGCAAA ATCGATGCAG TGCTTCCAAG GATGGACCAC ACAGAGGCTG 420 
CCTCTCCCAT CACTTCCCTA CATGGAGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 480 
GTTACACTAA AAGGTGACCA ATGATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGGAA 540 
GGTTAATGTT CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 600 
GCTCTACTG A GGTGCTATGT TCTTAGTGGA TGTTCTG ACC CTGCTTCAAA TATTTCCCTC 660 
ACCTTTCCCA TCTTCCAAGG GTACTAAGG A ATCTTTCTGC TTTGGGGTTT ATCAGAATTC 720 
TCAGAATCTC AAATAACTAA AAGGTATGCA ATCAAATCTG CTTTTTAAAG AATGCTCTTT 780 
ACTTCATGGA CTTCCACTGC CATCCTCCCA AGGGGCCCAA ATTCTTTCAG TGGCTACCTA 840 
CATACAATTC CAAACACATA CAGGAAGGTA GAAATATCTG AAAATGTATG TGTAAGTATT 900 
CTTATTTAAT GAAAGACTGT ACAAAGTATA AGTCTTAGAT GTATATATTT CCTATATTGT 960 
TTTCAGTGTA CATGGAATAA CATGTAATTA AGTACTATGT ATCAATGAGT AACAGGAAAA 1020 
TTTTAAAAAT ACAGATAGAT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTOAATGO 1080 
TTTTCAAATA AAAATG AGGT ACTCTCCTGG AAATATTAAG 

Seq ID NO: 104 Protein sequence: 
Protein Accession #: NP_001556. l 

1 11 21 31 41 51 
! I I I I I 

MNQTAILICC LIFLTLSGIQ GVPLSRTVRC TCISISNQPV NPRSLEKLE! IPASQFCPRV 60 
EIIATMKiGCG EKRCLNPESK AIKNLLKAVS KEMSKRSP 

Seq ID NO: 105 DNA sequence 1 
Nucleic Acid Accession #: NM_0!5068. 1 

Coding sequence: 1 1 70-2243 (underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 
I I I I I I 

GTAACAACCG TCACCCTGGG TCCCGACTGC CCACCTCCTC CTCCTCOCCC TCCCCCCAAC 60 
AACAACAACA ACAACAACTC CAAGCACACC GGCCATAAGA GTGCGTGTGT CCCCAACATG 120 
ACCGAACGAA G AAGGGACG A GCTCTCTG AA GAG ATCAACA ACTTAAGAG A GAAGGTCATG 1 80 
AAGCAGTCGG AGG AG AACAA C AACCTGCAG AGCCAGGTGC AGAAGCTCAC AGAGGAG AAC 240 
ACCACCCTTC GAG AGCAAGT GGAACCCACC CCTGAGGATG AGGATGATG A CATCG AGCTC 300 
CGCGGTGCTG CAGCAGCTGC TGCCCCACCC CCTCCAATAG AGGAAGAGTG CCCAGAAGAC 360 
CTCCCAGAGA AGTTCG ATGG CAACCCAGAC ATGCTGGCTC CTTTCATGGC CCAGTGCCAG 420 
ATCTTCATGG AAAAG AGCAC CAGGGATTTC TCAGTTGATC GTGTCCGTGT CTGCTTCGTG 480 
ACAAGCATGA TG ACCGGCCG TGCTGCCCGT TGGGCCTCAG CAAAGCTGGA GCGCTCCCAC 540 
TACCTGATGC ACAACTACCC AGCTTTCATG ATGGAAATGA AGCATGTCTT TGAAGACCCT 600 
CAGAGGCGAG AGGTTGCCAA ACGCAAGATC AGACGCCTGC GCCAAGGCAT GGGGTCTGTC 660 
ATCG ACTACT CCAATGCTTT CCAGATGATT GCCCAGG ACC TGGATTGGAA CGAGCCTGCG 720 
CTGATTG ACC AGTACCACG A GGGCCTCAGC GACCACATTC AGGAGGAGCT CTCCCACCTC 780 
GAGGTCGCCA AGTCGCTGTC TGCTCTG ATT GGGCAGTGCA TTCACATTG A GAG AAGGCTG 840 
GCCAGGGCTG CTGCAGCTCG CAAGCCACGC TCGCCACCCC GGGCGCTGGT GTTGCCTCAC 900 
ATTGCAAGCC ACCACCAGGT AG ATCCAACC GAGCCOGTGG GAGGTGCCCG CATGCGCCTG 960 
ACGCAGGAAG AAAAAGAAAG ACGCAGAAAG CTGAACCTGT GCCTCTACTG TGGAACAGGA 1020 
GGTCACTACG CTGACAATTG TCCTGCCAAG GCCTCAAAGT CTTCGCCGGC GGGAAACTCC 1080 
CCGGCCCCGC TGTAGAGGGA CCTTCAGCGA CCGGGCCAGA AATAATAAGG TCCCCACAAG 1140 
ATGATGCCTC ATCTCCACAC TTG CAAGTG A TG CTCCAGAT TCATCTTCCG GGCAGACACA 1200 
CCCTGTTCGT CCGAGCCATG ATCGATTCTG GTGCTTCTGG CAACTTCATT GATCACGAAT 1260 
ATGTTGCTCA AAATGGAATT CCTCTAAGAA TCAAGGACTG GCCAATACTT GTGGAAGCAA 1320 
TTGATGGGCG CCCCATAGCA TOGGGCCCAG TTGTCCACGA AACTCACG AC CTGATAGTTG 1380 
ACCTGGGAGA TCACCGAGAG GTGCTGTCAT TTGATGTGAC TCAGTCTCCA TTCTTCCCTG 1440 
TCGTCCTAGG GGTTCGCTGG CTG AGCAC AC ATGATCCCAA TATCACATGG AGCACTCGAT 1500 
CTATCGTCTT TG ATTCTGAA TACTGCCGCT ACCACTGCCG GATGTATTCT CCAATACCAC 1560 
CATCGCTCCC ACCACCAGCA CCACAACCGC CACTCTATTA TCCAGTAGAT GGATACAGAG 1620 
TTTACCAACC AGTGAGGTAT TACTATGTCC AGAATGTGTA CACTCCAGTA GATGAGCACG 1680 
TCTACCCAGA TCACCGCCTG GTTGACCCTC ACATAGAAAT GATACCTGGA GCACACAGTA 1740 
TTCCCAGTGG ACATGTGTAT TCACTGTCCG AACCTGAAAT GGCAGCTCTT CGAGATTTTG 1800 
TGGCAAOAAA TGTAAAAGAT GGGCTAATTA CTCCAACGAT TGCACCTAAT GGAGCCCAAG 1860 
TTCTCCAGGT GAAGAGGGGG TGGAAACTGC AAGTTTCTTA TGATTGCCGA GCTCCAAACA 1920 
ATTTTACTAT CCAGAATCAG TATCCTCGCC TATCTATTCC AAATTTAGAA GACCAAGCAC 1980 
ACCTGGCAAC GTACACTGAA TTCGTACCTC AAATACCTGG ATACCAAACA TACCCCACAT 2040 
ATGCCGCGTA CCCGACCTAC CCAGTAGGAT TCGCCTGGTA CCCAGTGGGA CGAGACGGAC 2100 
AAGGAAGATC ACTATATGTA CCTGTGATGA TCACTTGGAA TCCACACTGG TACCGCCAGC 2160 
CTCCGGTACC ACAGTACCCG CCGCCACAGC CGCCGCCTCC ACCACCACCA CCGCCGCCGC 2220 
CTCCATCTTA CAGTACCCTG TAAATACCTG TCATGTCCTT CAGGATCTCT GCCCTCAAAA 2280 
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TTTATTCCTG TTCAGCTTCT CAATCAGTG A CTGTGTGCTA AATTTTAGGC TACTGTATCT 2340 
TCAGGCCACC TGAGGCACAT CCTCTCTGAA ACGGCTATGG AAGGTTAGGG CCACTCTGGA 2400 
CTGGCACACA TCCTAAAGCA CCAAAAGACC TTCAACATTT TCTGAGAGCA ACAGAGTATT 2460 
TGCCAATAAA TGATCTCTCA TTTTTCCACC TTG ACTGCCA ATCTAACTAA AATAATTAAT 2520 
AAGTTTACTT TCCAGCCAGT CCTGG AAGTC TGGGTTTTAC CTGCCAAAAC CTCCATCACC 2580 
ATCTAAATTA TAGGCTGCCA AATTTGCTOT TTAACATTTA CAOAGAAGCT GATA CAAAC G 2640 
CAGGAAATGC TGATTTCTTT ATGGAGGGGG AGACGAGGAG G AGGAGG ACA TGACTTTTCT 2700 
TGCGGTTTCG GTACCCTCTT TTTAAATCAC TGGAGGACTG AGGCCTTATT AAGGAAGCCA 2760 
AAATTATCGG TGCAGTGTGG AAAGGCTTCC GTG ATCCTCT CGCTGCACCC TTAGAAACTT 2820 
CACCGTCTTC AAACTCCATT TCCATGGTTC TGTTAATTCT CAAGGAGCAG CAACTCGACT 2880 
GGTTCTCCCA GGAGCAGGAA AAACCCTTGT GACATGAAAC ATCTCAGGCC TGAAAAGAAA 2940 
GTGCTCTCTC AGATGGACTC TTGCATGTTA AGACTATGTC TTCACATCAT GGTGCAAATC 3000 
ACATGTACCC AATGACTCCG GCTTTG ACAC AACACCTTAC CATCATCATG CCATGATGGC 3060 
TTCCACAAAG CATTAAACCT GGTAACCAGA GATTACTGGT GGCTCCAGCG TTGTTAGATG 3120 
TTCATGAAAT GTG ACCACCT CTCAATCACC TTTGAGGGCT AAAG AGTAGC ACATCAAAAG 3 1 80 
GACTCCAAAA TCCCATACCC AACTCTTAAG AGATTTGTCC TGGTACTTCA GAAAGAATTT 3240 
TCATG AGTGT TCTTAATTGG CTGGAAAAGC ACCAGCTGAC GTTTTGGAAG AATCTATCCA 3300 
TGTGTCTGCC TCCATATGCA TCTGGGCATT TCATCTTCAG TCCCCTCATT AGACTGTAGC 3360 
ATTAGGATGT GTGGAGAGAG GAG AAATGAT TTAGCACCCA GATTCACACT CCTATGCCTG 3420 
GAAGGGGGAC ATCTTTG AAG AAGAGG AATT AGGGCTGTGG ACACTGTCTT GAGGATGTGG 3480 
ACTTCCTTAG TGAGCTCCAC ATTACTTGAT GGTAACCACT TCAAAAGGAT CAG AATCCAC 3540 
GTAATGAAAA AGGTCCCTCT AGAGGATGG A GCTGATGTGA AGCTGCCAAT GGATG AAAAG 3600 
, CCTCAGAAAG CAACTCAAAG GACTCAAAGC AACGGACAAC AC AAG AG TTG TCTTCAGCCC 3660 
AOTCACACCT CTOATGTCCC CTGGAAGCTT TGTGC TAACC TGGGACTGCC TGACTTCCTT 3720 
TAGCCTGGTC CCTTGCTACT ACCTTG AACT GTTTTATCTA AC CTCT CTTT TTCTGTTTAA 3780 
TTCTTTGCTA CTGCCATTG A CCCTGCTGCA GGATTTGTGT CATTTTCCTG CCTGGTTGCT 3840 
G AGACTCCAT TTTGCTGCCA CACACAGAGA TGTAAGAGGC AGGCTTTAAT TGCCAAAGCA 3900 
CAGTTTGAGC AGTAGAAAAC AACATGGTGT ATATCTCAAA TTGCCTGACA TGAAGAGGAG 3960 
TCTAACGGTG AAGTTTCACT TTTCATCAGC ATCATCTTTC ACATGTTCAT TATCATCCGC 4020 
TCTTATTCTT GCATGTTTAA ACACTTAAAA TTTTTAGTAT AATTTTTAGT GTGTTTTGAA 4080 
GTGGTGACTA GGCTTTCAAA AACTTCCATT GAATTACAAA GCA CTATC CA GTTCTTATTG 4140 
TTAAACTAAG TAAAAATGAT AAGTAACATA GTGTAAAATA TTCCTTTACT GTGAACTTCT 4200 
TACAATGCTG TGAATGAGAG GCTCCTCAGA ACTGGAGCAT TTGTATA AT A ATTCATCCTG 4260 
TTCATCTTCA ATTTTAACAT CAT AT AT A AT TTCAATTCTA TCA ATTGGGC CTTT AAAAAT 4320 
CATATAAAAG GATATAAAAT TTGAAAAGAG AAACCTAATT GGCTATTT AA TCC AAAACAA 4380 
C T n m irT TCCTTCAATG GAATCAG AAA GCTTGTCAAT CACTCATGTG TTTTAGAGTA 4440 
ATTACTTTTA AAATGGTGCA TTTGTGCTTC TO AACTATTT TGA AGAGTC A CTTCTQTTTA 4500 
CCTCAAGTAT CAATTCATCC TC CAT AC ATT TGAATTCAAG TTGTTTTTTG TCAAATTTAC 4560 
AGTTGTCAAT TG ATCTTCAA GCTGCAGGGT GCCTAGAAAT GGGCCGTTGT CTGTAGCCCT 4620 
GGCATGTGCA CACGGACATT TGCCACCACT GCAAGCAAAA GTCTGGAG AA GTTCACCAAC 4680 
GACAAGAACG ATTAGGGAAA ATATGCTGCT GTGGGTTAAC AACTCAGAAA GTCCCTGATC 4740 
CACATTTGGC TGTTTACTAA AGCTTGTG AT TAACTTTTTG GCAGTGTGTA CTATGCTCTA 4800 
TTGCTATATA TGCTATCTAT AAATGTAGAT GTTAAGG ATA AGTAATTCTA AATTTATTAT 4860 
TCTATAGTTT TG AAGTTTGG TTAAGTTTCC TTTCACTCAA TTGATTTATT TTGTTGTTAA 4920 
TCAAATTTAT GTTAATTGG A TCCTTTAAAT TTTTTTTGGC ATTTTCCAAC AAAAATGGCT 4980 
TTATTCATAA GAAAGG AAAA AAATCAATGG AATTTGATAT CTAAAGAAOT TAOAAAGGGA 5040 
GCAAAATAAA AAACATAAAG GAGATAGATG AATTAGTAAG CAAATCAGTA GTCGAGTTTT 5100 
TCAAACTGGC AAAATTAATT AATTGACTTT TAGCCCAAAT TTACATTGTT AATTAAATCA 5160 
AGAAGGAAGA AG ATCTAAG A GCTCCCATTG ATAGGCAAGC CTAGAGAGAA CTAGCTAAAT 5220 
TTATCATGCT AGGATATTG A AACACAGAAA GTTTACATAC ATTTATGAAG GGTCAATTTA 5280 
GTTTGGACAG TGAGGTATTT GTCTTAGTGG AAAAAAGGAG AATTAGTCTG ATCAAATCGT 5340 
GAAGTAATAC AGTGAACTTG CAGGTGCACA AAATAAGAGG GCCACATCTA TATGGTGCAG 5400 
TCTGGAATTC TGTTTAAGTT TGTAGGTACC TCTTGGACTT CTGAATTGAT CCAGTTGTCA 5460 
TCCACCACAG ACATCTCACA TCAG ATACAG ACAGTTCCAA G ATTGACAAC AGAGAACAAC 5520 
CTGCTGG AAA GACCTGGGCA GAAATGGAGA GCCCTGCGGG AAOCATGCTA CATTTTCATC 5580 
TAAAGAG AGA ATGCACATCT GATG AG ACTG AAAGTTCTTT GTTGTTTTAG ATTGTAGAAT 5640 
GGTATTG AAT TGGTCTGTGG AAAATTGCAT TGCTTTTATT TCTTTGTGTA ATCAAGT TTA 57 00 
AGTAATAGGG GATATATAAT CATAAGCATT TTAGGGTGGG AGGGACTATT A AGTAATTTT 5760 
AAGTGGGTGG GGTTATTTAG AATGTTAG AA TAATATTATG TATTAGATAT CGCTATAAGT 5820 
GGACATGCGT ACTTACTTGT AACCCTTTAC CCTATAATTG CTATCCTTAA AGATTTCAAA 5880 
TAAACTCGG A GGGAACTGCA GGG AG ACCAA CTTATTTAGA GCGAATTGGA CATGG ATAAA 5940 
AACCCCAGTG GGAGAAAGTT CAAAGGTGAT TAGATTAATA ATTTAATAGA GGATGAGTGA 6000 
CCTCTGATAA ATTACTGCTA GAATGAACTT GTCAATGATG G ATGGTAAAT TTTCATGGAA 6060 
GTTATAAAAG TGATAAATAA AAACCCTTGC TTTTACCCCT GTCAGTAGCC CTCCTCCTAC 6120 
CACTGAACCC CATTGCCCCT ACCCCTCCTT CTAACTTTAT TGCTGTATTC TCTTCACTCT 61 80 
ATATTTCTCT CTATTTGCTA ATATTGCATT GCTGTTACAA TAAAAATTCA ATAAAG ATTT 6240 
AGTGGTTAAGTGC 

ScqIDNO: 106 Protein sequence: 
Protein Accession #: NPJ)55883.1 

I II 21 31 41 51 

MTBRKRDELS EEINNLREKV MKQSEENNNL QSQVQKLTEE NTTLRHQVEP TPEDEDDDIE 60 
LRGAAAAAAP PPPIEEECPE DLPEKFDGNP DMLAPFMAQC QIFMEKSTRD FSVDRVRVCF 120 
VTSMMTGRAA RWASAKLERS HYLMHNYPAF MMEMKHVFED PQRREVAKRK IRRLRQGMGS 180 
VIDYSNAFQM IAQDLDWNEP AUDQYHEGL SDHIQEELSH LEVAKSLSAL IGQCHflERR 240 
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LARAAAARKP RSPPRALVLP HIASHHQ VDP TEPVGGARMR LTQEEKERRR KLNLCLYCGT 300 
GGHYADNCPA KASKSSPAGN SPAPL 

Seq ID NO: 107 DNA sequence 

Nucleic Acid Accession »: NM JW3679. 1 

Coding sequence: 47-1507(underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

GGCACGAGCA GAAGCAACAA TAATTGTGAA AAATACTTCA GCACTTATQQ ACTCATCTGT 60 
CATTCAAAGG AAAAAAGTAG CTGTCATTGG TGGTGGCTTG GTTGGCTCAT TACAAGCATG 120 
CTTTCTTGCA AAGAGGAATT TCCAGATTGA TGTATATGAA GCTAGGGAAQ ATACTCGAOT 180 
GGCTACCTTC ACACGTGGAA G AAGCATTAA CTTAGCCCTT TCTCATAGAG GACGACAAGC 240 
CTTGAAAGCT GTTGGCCTGG A AGATCAGAT TGTATCCCAA GGTATTCCCA TGAG AGCAAG 300 
AATGATCCAC TCTCTTTCAG GAAAAAAGTC TGCAATTCCC TATGGGACAA AGTCTCAGTA 360 
TATTCTTTCT GTAAGCAGAG AAAATCTAAA CAAGGATCTA TTGACTGCTG CTGAGAAATA 420 
CCCCAATGTG AAAATGCACT TTAACCACAG GCTGTTG AAA TGTAATCCAG AGGAAGGAAT 480 
GATCACAGTG CTTGGATCTG ACAAAGTTCC CAAAGATGTC ACTTGTGACC TCATTGTAGG 540 
ATGTGATGGA GCCTATTCAA CTGTCAGATC TCACCTGATG AAGAAACCTC GCTTTGATTA 600 
CAGTCAGCAG TACATTCCTC ATGGGTACAT GGAGTTGACT ATTCCACCTA AOAACGGAGA 660 
TTATGCCATG GAACCTAATT ATCTGCATAT TTGGCCTAGA AATACCTTTA TGATGATTGC 720 
ACTTCCTAAC ATG A ACAAAT CATTCACATG TACTTTGTTC ATGCCCTTTG AAG AGTTTGA 780 " 
AAAACTTCTA ACCAGTAATG ATGTGGTAGA TTTCTTCCAG AAATACTTTC CGGATGCCAT 840 
CCCTCTAATT GGAGAG AAAC TCCTAGTGCA AGATTTCTTC CTGTTGOCTG CCCAGCCCAT 900 
GATATCTGTA AAGTGCTCTT CATTTCACTT TAAATCTCAC TGTGTACTGC TGGGAGATGC 960 
AGCTCATGCT ATAGTGCCGT TTTTTGGGCA AGGAATGAAT GCGGGCTTTG AAGACTGCTT 1020 
GGTATTTGAT GAGTTAATGG ATAAATTCAG TAACGACCTT AGTTTGTGTC TTCCTGTGTT 1080 
CTCAAGATTG AGAATCCCAG ATGATCACGC GATTTCAGAC CTATCCATGT ACAATTACAT 1 140 
AGAGATGCGA GCACATGTCA ACTCAAGCTG GTTCATTTTT CAGAAGAACA TGGAGAGATT 1200 
TCTTCATGCG ATTATGCCAT CGACCTTTAT CCCTCTCTAT ACAATGGTCA CTTTTTCCAG 1260 
AAT AAGATA C CATGAGGCTG TGCAGCGTTG GCATTGGCAA AAAAAGGTGA TAAACAAAGG 1320 
ACTCTTTTTC TTGGGATCAC TGATAGCCAT CAGCAGTACC TACCTACTTA TACACTACAT 1380 
GTCACCACGA TCTTTCCTCT GCTTGAGAAG ACCATGGAAC TGOATAGCTC ACTTCCGGAA 1440 
TACAACATGT TTCCCCGCAA AGGCCGTGGA CTCCCTAGAA CAAATTTCCA ATCTCATTAG 1500 
CAGGTQATAG AAAGGTTTTG TGGTAGCAAA TGCATGATTT CTCTGTGACC AAAATTAAGC 1560 
ATGAAAAAAA TGTTTCCATT GCCATATTTG ATTCACTAGT GG AAG ATAGT GTTCTGCTTA 1620 
T AATT AAACT GAATGTAGAG TATCTCTGTA TGTTAATTGC AATTACTGGT TGGGGGGTGC 1680 
ATTTTAAAAG ATG AAACATG CAGCTTCCCT ACATTACACA CACTCAGGTT GAGTCATTCT 1 740 
AACTATAAAA GTGCAATGAC TAAGATCCTT CACTTCTCTO AAAGTAAGGC CCTAGATGCC 1800 
TCAGGGAAGA CAGTAATCAT GCCTTTTCTT TAAAAGACAC AATAGGACTC GCAACAGCAT 1 860 
TGACTCAACA CCTAGGACTA AAAATCACAA CTTAACTAGC ATGTTAACTG CACTTTTCAT 1920 
TACGTGAATG GAACTTACCT AACCACAGGG CTCAGACTTA CTAGATAAAA CCAGAAATGG 1980 
AAATAAGGAA TTCAGGGGAG TTCCAGAGAC TTACAAAATG AACTCATTTT A TTTTO OCAC 2040 
CTTCAAATAT AAGTATTATC ATCTATCTGT TTATCGTCTA TCTATCTATC ATCTATCTAT 2100 
CTATCTATCA TCTATCTATC TATCTATCTA TCTATCTATC TATCTATCTA TCTATCTATC 2160 
TCTATTTATT TATGTATTTA G AG ATCAGGT CTCACTCTGT TGACCAGGCT GG AGTGCAGT 2220 
GGTGAGATCT GGGTTCACTG CAACCTCTGC CTCCTGGGCT CAAGCAATCC TCCCACTTCA 2280 
GCCTCCCAAA TAGCTGGGGC TACCATGGTA TTTTTCAGTA GAGACCGGGT CTTGCCATGC 2340 
TGCCCAGGCC AGTCTCAAAC TCCTGGCCTC ATGTGATCTG CCCACCTCAG CCtCCCAAAG 2400 
TACAGGGATT AGAGTTGTGA GCCACCGCTG CCAGCCCAOA GTTACCCTCT AAAGATAAGA 2460 
AAAAGGCTAT TAATATCATA CTAAGTGAAG GACAGGAAAG GGTTTTATTC ATAAATTAAA 2520 
TGTCTACATG TGCCAGAATG GAAAGGAAAC AAGGGGAGAC AACTTTTATA GAAATACAAA 2580 
GCCATTACTT TATTCAATTT CAG ACCCTCA GAAGCAATTT ACTAATTTAT TCTTCG ACTA 2640 
CATACTGCAG CAGAACCAGC AATACACTTG ATTTTTAAAA GCACATTTAG TGAAATGTTT 2700 
TCTTTGGTTC ATCCTTCTTT AACAGGCTGC TGAGTCACTC AG AAATCCTT CAAACATGAT 2760 
TAATTATGAA GATGA AACAC TAG AGTCATA TAAGAAATAA AAATTGGGCA ATAAAATAAA 2820 
ATGATTCAGT GTTTCTTTTC TATATTGTCA ATGAAAACCT TGAGTTCTAA TAATCCATGT 2880 
TCAGTTTGTA GGG AAAGAAA A AATAATTTT TCCTTCTACC CACTTTAGGT TCCTTGGCTG 2940 
GGGCCCCTAT AACAAAAG AC AG ATTG ACAA G AG AAAAACA AACATAAATT TATTAGCGGG 3000 
TATATGTAAT ATATATGTGG GAAATACAGG GGAATGAGCA AATCTCAAAG AGCTGGCGTC 3060 
TTAGAACTCC CTGGCTTATA TAGCATCGAC AAAGAACAGT AAATTTTTAG AGAAACAACA 3120 
AAACAAAGAA AAAGAGCTTT GAGTCTGTAG GGGCAGCAAT TTGGGGGAAG CAAATATATG 3180 
GGAGTTTGCC TTGTAGATTC CTCTGGTGGT GGTCTCCAGG CTG ACAAGGA TTCAAAGTTG 3240 
TCTCT GAAAC TCCTCTTTGT CATACTGCAC ATATAAAACG TCTTTTGTTT CCAACAAGAG 3300 
GATTTCTTTT TCATTCTAG A ATTATCTCCT TG ATAACTTG ATCAGATATA GGACATGACA 3360 
CTGAATAGAG TCCAACAGTA CA AAAAAAAT TCAGTATGTT CTAGCTACTT CACACATGTG 3420 
TACGCGACAG TTATTTTTAC AGTAAGGTAT TTTCGAGAAA AATGCATTAC GTGTTTTGGA 3480 
AAATAGAGTA ATTTAAAAAA TATATTTGAA ATGAAAATCT CCAACACATT AGAAGATGAT 3540 
GATGTTAGAT GCCCATCGTG TGCCACAAGT GGTTTTTTCA TTATGTAAAG CACCCGTTGA 3600 
ATTAAAAGAA TTTGTTTTTG TTCAACCTCT TCCTG AGGCC CAAG AGCATA TGGGCAATTC 3660 
GGATTTCCTG CTGGACCACA AGGTTCTGTT GATATTACAT AGAAACGGGT ATTCCAGACA 3720 
CTTCTTATGA TG AAAGTCCA A AAGTGGCAT CCAATTTAAG GCCCCATCTT TCGTTGCCAT 3780 
TCTTCATTCC TACAAAGGAC G AACTTGGAT TACATCAACT TTGGACCCAT TGGTTTTGTC 3840 
GCTGTCGTCA ACTGA CAGTG ATTCACCACT GGTG ATGATA AAAATGATGG AAGAAGAGTT 3900 
GAAAGTCACT TTTTTCTTTG GCCTGTCCCC ATCTTTCTGT GACATCACAA TGGGTCTGAT 3960 
CTGCATTTCA CTTCCAGCTG CTGGTAGGTC TTTAGCAGGC CTCTGGCACC TCAGCAGTCG 4020 
GAGGCACAGA AGCTGCAAAA GGGATCTTCG AAACTGGGCA GAG AAAAAAT AAAGTGGAAT 4080 
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ATTAAGTAAA AGTTGGGCAC TAATCTGGAT TAACATTCGA GGAAATCAGT TGAGCTGATT 4140 
TAAGTTGTTT TTTGTTTGTT AGCAGGTGTG GATGTGGGGT TATGTGGTCA TGCTCAGATC 4200 
TACCTAAATC ACCCCAGAGC TTTATGTCTT TTATTCATTC TAAATCTTAT TAACCGG AAT 4260 
ATGTAGGACC ATTTCAATAC CTTGTAATCC TCCAAGCTTC AATCTGCACA CACTTTCTAT 4320 
5 GAGGGCAGGT ACAACTATTA AGAGATTTTG AACATTAAGT TAGTCC ACAA ATATTCAGTG 4380 
GGCATCTACT AGGTG ACAGC CACTGTGCTA TAATTAGAGA CI 1 IT f ACTA TAAGCATCAA 4440 
AAACAGATAA GGCTCTTCCT GGCAGAGTTT ACAGCCTGGT GTACTTGCTA ATGTCTCTTT 4500 
AATTAGGTGA AG AATTTTTT TTTTCTATCG AAATTACTAA TCAGTTGGGG AAAAAAATAC 4560 
TATAGCAGAC AGCACTAATG TCATCAACAA ACATTGTTCT TCTCCGTGTC CTGGGTACAA 4620 

1 0 CATCG AATAA TATTTCTTGG CCTCCTTTCC GCTTCTCCTC TCTGCTGTTC CTCTCTACAA 4680 

GAACCTGGGA GGCCAACGCC TA AAGATCAT AATATCACAA TGGA AGGAAC CTAGATTCCT 4740 
AAATOACTGC ATAGGACAG A TCCCATCTCC TCCACCCAAT ACATTATTAG ACTGAACTGT 4800 
GACCTGAAAT GAGCAATAAA CTCTGTATTA ATTCACTGAA ATGTTGGGGT TGCTTGTTAT 4860 
AGTAGTCGGT CCATCATG AC CAGTAAA ACA TAAAT CAAA A GTTAATGTAA TTGTTATCCC 4920 

1 5 ATTATTTAGA GCGAA ATAAA TGTTG AATAT ATGGACTTTC TCAGATTAGG AAATACCAAT 4980 
TAAAAATATA ATAAATAGCT 

Seq ID NO: 108 Protein sequence: 
Protein Accession »: NP 003670.1 

20 

1 11 21 31 41 51 

MDSSV1QRKK VAVIGGGLVg' SLQACFLAKR NFQIDVYEAR EDTRVATFTR GRSINLALSH 60 
RGRQALKAVG LEDQIVSQGI PMRARMIHSL SGKKSAIPYG TKSQYILSVS RENLNKDLLT 120 

25 AAEKYPNVKM HFNHRJXKCN PEEGMITVLG SDKVPKDVTC DUVGCDGAY STVRSHLMKK 180 
PRFDYSQQYI PHGYMELTIP PKNGDYAMEP NYLHIWPRNT FMMIALPNMN KSFTCTLFMP 240 
FEEFEKLLTS NDWDFFQKY FPDAIPLIGE KLLVQDFFLL PAQPMISVKC SSFHFKSHCV 300 
LLGDAAHAJV PFFGQGMNAG FEDCLVFDEL MDKFSNDLSL CLPVFSRIRI PDDHAISDLS 360 
MYNYIEMRAH VNSSWFIFQK NMERFLHAIM PSTFTPLYTM VTFSRIRYHE AVQRWHWQKK 420 

30 VINKGLFFLG SLIAISSTYL LIHYMSPRSF LCLRRPWNW1 AHFRNTTCFP AKAVDSLEQI 480 
SNLISR 

Seq ID NO: 109 DNA sequence 
Nucleic Acid Accession #: NM JXJ61 1 5. 1 

3 5 Coding sequence: 236- 1 765 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

GCTTCAGGGT ACAGCTCCCC CGCAGCCAGA AGCCGGGCCT GCAGCCCCTC AGCACCGCTC 60 

40 CGGGACACCC CACCCGCTTC CCAGGCGTGA CCTGTCAACA GCAACTTCGC GGTGTGGTGA 120 
ACTCTCTGAG GAAAAACCAT TTTG ATTATT ACTCTCAGAC GTGCGTGGCA ACAAGTGACT 1 80 
GAGACCTAGA AATCCAAGCG TTGGAGGTCC TGAGGCCAGC CTAAGTCGCT TCAAAAJX5GA 240 
ACGAAGGCGT TTGTGGGGTT CCATTCAGAG CCGATACATC AGCATGAGTG TGTGGACAAG 300 
CCCACGGAGA CTTGTGGAGC TGGCAGGGCA GAGCCTGCTG AAGGATGAGG CC CTGGC CAT 360 

45 TGCCGCCCTG GAGTTGCTGC CCAGGGAGCT CTTCCCGCCA CTCTTCATGG CAGCCTTTGA 420 
CGGGAGACAC AGCCAGACCC TGAAGGCAAT GGTGCAGGCC TGGCCCTTCA CCTGCCTCCC 480 
TCTGGGAGTG CTG ATGAAGG GACA ACATCT TCACCTGGAG ACCTTCAAAG CTGTGCTTGA 540 
TGGACTTG AT GTGCTCCTTG CCCAGG AGGT TCGCCCCAGG AGGTGGAAAC TTCAAGTGCT 600 
GGATTTACGG AAGAACTCTC ATCAGG ACTT CTGGACTGTA TGGTCTGGAA ACAGGGCCAG 660 

5 0 TCTGTACTCA TTTCCAG AGC C AG AAG C AG C TCAGCCCATG ACAAAGAAGC GAAAAGTAG A 720 
TGGTTTGAGC ACAGAGGCAG AGCAGCCCTT CATTCCAGTA GAGGTGCTCG TAGACCTGTT 780 
CCTCAAGGAA GGTGCCTGTG ATGAATTGTT CTCCTACCTC ATTGAGAAAG TGAAGCGAAA 840 
GAAAAATGTA CTACGCCTGT GCTGTAAGAA GCTGAAGATT TTTGCAATGC CCATGCAGGA 900 
TATCAAGATG ATCCTGAAAA TGGTGCAGCT GGACTCTATT GAAGATTTGG AAGTGACTTG 960 

55 TACCTGGAAG CTACCCACCT TGGCG AAATT TTCTCCTTAC CTGGGCCAGA TGATTAATCT 1020 
GCGTAGACTC CTCCTCTCCC ACATCCATGC ATCTTCCTAC ATTTCCCCGG AGAAGGAAGA 1080 
GCAGTATATC GCCCAGTTCA CCTCTCAGTT CCTCAGTCTG CAGTGCCTGC AGGCTCTCTA 1 140 
TGTGGACTCT TTATTTTTCC TTAGAGGCCG CCTGGATCAG TTGCTCAGGC ACGTGATGAA 1200 
CCCCTTGGAA ACCCTCTCAA TAACTAACTG CCGGCTTTCG GAAGGGGATG TGATGCATCT 1260 

60 GTCCCAGAGT CCCAGCGTCA GTCAGCTAAG TGTCCTGAGT CTAAGTGGGG TCATGCTGAC 1320 
CGATGTAAGT CCCG AGCCCC TCCAAGCTCT GCTGGAGAGA GCCTCTGCCA CCCTCCAGGA 1380 
CCTGGTCTTT G ATGAGTGTG GGATCACGGA TGATCAGCTC CTTGCCCTCC TGCCTTCCCT 1440 
GAGCCACTGC TCCCAGCTTA CAACCTTAAG CTTCTACGGG AATTCCATCT CCATATCTGC 1500 
• CTTGCAGAGT CTCCTGCAGC ACCTCATCGG GCTGAGCAAT CTGACCCACG TGCTGTATCC 1560 

65 TGTCCCCCTG GAGAGTTATG AGGACATCCA TGGTACCCTC CACCTGGAGA GGCTTGCCTA 1620 
TCTGCATGCC AGGCTCAGGG AGTTGCTGTG TGAGTTGGGG CGGCCCAGCA TGGTCTGGCT 1680 
TAGTGCCAAC CCCTGTCCTC ACTGTGGGGA CAGAACCTTC TATGACCCGG AGCCCATCCT 1740 
GTGCCCCTGT TTCATGCCTA ACTAGCTGGG TGCACATATC AAATGCTTCA TTCTGCATAC 1800 
TTGGACACTA AAGCCAGGAT GTGCATGCAT CTTGAAGCAA CAAAGCAGCC ACAGTTTCAG 1860 

70 ACAAATGTTC AGTGTGAGTG AGGAAAACAT GTTCAGTGAG GAAAAAACAT TCAGACAAAT 1920 
GTTCAGTGAG GAAAAAAAGG GGAAGTTGGG GATAGGCAG A TGTTG ACTTG AGGAGTTAAT 1980 
GTGATCTTTG GGG AG ATACA TCTTATAGAG TT AGAAATAG AATCTGAATT TCTAAAGGGA 2040 
GATTCTGGCT TGGG AAGTAC ATGTAGGAGT TAATCCCTGT GTAGACTGTT GTAAAGAAAC 2100 
TGTTG AA AAT AAAGAGAAGC AATGTGAAGC AAAAAAAAAA AAAAAAAA 

75 

Seq ID NO: 1 10 Protein sequence: 
Protein Accession #: NP_006 106.1 
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1 II 21 31 41 SI 
I 1 I I i I 

MERRRLWG SI QSRYISMSVW TSPRRLVELA GQSLLKDEAL AIAALELLPR ELFPPLFMAA 60 
FDGRHSQTUC AMVQAWPFTC LPLGVLMKGQ HLHLETFKAV LDGLDVLLAQ EVRPRRWKLQ 120 
VLDLRKNSHQ DFWTVWSGNR ASLYSFPEPE AAQPMTKKRK VDGLSTEAEQ PFTPVEVLVD 180 
LFLKEOACDE LFSYLIEKVK RKKNVLRLCC KKLKEFAMPM QDIKM1LKMV QLOSIEOLEV 240 
TCTWKLPTLA KFSPYLGQM1 NLRRLLLSHI HASSYISPEK EEQYIAQFTS QFLSLQCLQA 300 
LYVDSLFFLR GRLDQLLRHV MNPLETLSIT NCRLSEGDVM HLSQSPSVSQ LSVLSLSGVM 360 
LTDVSPEPLQ ALLERASATL QDLVFDECGI TDDQLLALLP SLSHCSQLTT LSFYGNSISI 420 
SALQSLLQHL IGLSNLTHVL YPVPLESYED IHGTLHLERL AYLHARLREL LCELGRPSMV 480 
WLSANPCPHC GDRTFYDPEP 1LCPCFMPN 



Seq ID NO: 1 U DNA sequence 

Nucleic Acid Accession #: NM 0038 15 

Coding sequence: 8-2452 (underlined sequences correspond to start and stop codons) 

I U 21 31 41 51 
I t I ) I I 

CGCTGCCATG CGGCTGOCGC TGCTCTGGGC CCTGGGGCTC CTGGGCGCGG GCAGCCCTCT 60 
OCCTTCCTGG CCGCTCCCAA ATATAGGTGG CACTGAGGAG CAGCAGGCAG AGTCAGAGAA 120 
GGCCCCGAGG GAGCCCTTGG AGCCCCAGGT CCTTCAGGAC GATCTCCCAA TTAGCCTCAA 180 
AAAGGTGCTT CAGACCAGTC TGCCTG AGCC CCTGAGGATC AAGTTGG AGC TGGACGGTGA 240 
CAGTCATATC CTGGAGCTGC TACAGAATAG GGAGTTGGTC CCAGGCCGCC CAACCCTGGT 300 
GTGGTACCAG CCCGATGGCA CTCGGGTGGT CAGTGAGGGA CACACTTTGG AGAACTGCTG 360 
CTACCAGGGA AGAGTGCGGG G ATATGCAGG CTCCTGGGTG TCCATCTGCA CCTGCTCTGG 420 
GCTCAGAGGC TTGGTGGTCC TGACCCCAGA GAGAAGCTAT ACCCTGGAGC AGGGGCCTGG 480 
GGACCTTCAG GGTCCTCCCA TTATTTCGCG AATCCAAGAT CTCCACCTGC CAGGCCACAC 540 
CTGTGCCCTG AGCTGGCGGG AATCTGTACA CACTCAGACG CCACCAGAGC ACCCCCTGGG 600 
ACAGCGCCAC ATTCGCCGGA GGCGGGATGT GGTAACAG AG ACCAAGACTG TGGAGTTGGT 660 
GATTGTGGCT GATCACTCGG AGGCCCAGAA ATACCGGGAC TTCCAGCACC TGCTAAACCG 720 
CACACTGGAA GTGGCCCTCT TGCTGGACAC ATTCTTCCGG CCCCTGAATG TACGAGTGGC 780 
ACTAGTOGGC CTGGAGGCCT GGACCCAGCG TG ACCTGGTG GAGATCAGCC CAAACCCAGC 840 
TGTCACCCTC GAAAACTTCC TCCACTGGCG CAGGGCACAT TTGCTGCCTC GATTGCCCCA 900 
TGACAGTGCC CAGCTGGTGA CTGGTACTTC ATTCTCTGGG CCTACGGTGG GCATGGCCAT 960 
TCAGAACTCC ATCTGTTCTC CTGACTTCTC AGGAGGTGTG AACATGGACC ACTCCACCAG 1020 
CATCCTGGGA GTCGCCTCCT CCATAGCCCA TGAGTTGGGC CACAGCCTGG GCCTGGACCA 1080 
TGATTTGCCT GGGAATAGCT GCCCCTGTCC AGGTCCAGCC CCAGCCAAGA CCTGCATCAT 1 140 
GGAGGCCTCC ACAGACTTCC TACCAGGCCT GAACTTCAGC AACTGCAGCC GACGGGCCCT 1200 
GGAGAAAGCC CTCCTGGATG GAATGGGCAG CTGCCTCTTC GAACGGCTGC CTAGCCTACC 1260 
CCCTATOGCT GCTTTCTGCG GAAATATGTT TGTGGAGCCG GGCGAGCAGT GTGACTGTGG 1320 
CTTCCTGGAT GACTGCGTCG ATCCCTGCTG TGATTCTTTG ACCTGCCAGC TGAGGCCAGG 1380 
TGCACAGTGT GCATCTGACG GACCCTGTTO TCAAAATTGC CAGCTGCGCC CGTCTGGCTG 1440 
GCAGTGTCGT CCTACCAGAG GGG ATTGTGA CTTGCCTGAA TTCTGCCCAG GAGACAGCTC 1500 
CCAGTGTCCC CCTGATGTCA GCCTAGGGGA TGGCGAGCCC TGCGCTGGCO GGCAAGCTGT 1560 
GTGCATOCAC GGGCGTTGTG CCTCCTATGC CCAGCAGTGC CAGTCACTTT GGGOACCTGG 1620 
AGCCCAGCCC G CTGCGCC AC TTTGCCTCC A GACAGCTAAT ACTCGGGGAA ATGCTTTTGG 1680 
GAGCTGTGGG CGCAACCCCA GTGGCAGTTA TGTOTCCTGC ACCCCTAGAG ATCCCATTTG 1 740 
TGGGCAGCTC CAGTGCCAGA CAGGTAGGAC CCAGCCTCTG CTGGGCTCCA TCCGGGATCT 1800 
ACTCTGGGAG ACAATAGATG TGAATGGGAC TGAGCTGAAC TGCAGCTGGG TCCACCTGGA 1860 
CCTGGGCAGT GATGTGGCCC AGCCCCTCCT GACTCTGCCT GGCACAGCCT GTGGCCCTGG 1920 
CCTGOTGTGT ATAGACCATC GATGCCAGCG TGTGGATCTC CTGGGGGCAC AGGAATGTCG 1980 
AAGCAAATGC CATGG ACATG GGGTCTGTGA CAGCAACAGG CACTGCTACT GTGAGGAGGG 2040 
CTGGGCACCC CCTGACTGCA CCACTCAGCT CAAAGCAACC AGCTCCCTGA CCACAGGGCT 2100 
GCTCCTCAGC CTCCTGGTCT TATTGGTCCT GGTGATGCTT GGTGCCGGCT ACTGGTACCG 2 1 60 
TGCCCGCCTG CACCAGCGAC TCTGCCAGCT CAAGGGACCC ACCTGCCAGT ACAGGGCAGC 2220 
CCAATCTGGT CCCTCTGAAC GGCCAGGACC TCCGCAGAGG GCCCTGCTGG CACGAGGCAC 2280 
TAAGTCTCAG GGGCCAGCCA AGCCCCCACC CCCAAGGAAG CCACTGCCTG CCGACCCCCA 2340 
QGGCCGGTGC CCATCGGGTG ACCTGCCCGG CCCAGGGGCT GGAATCCCGC CCCTAGTGGT 2400 
ACCCTCCAGA CCAGCGCCAC CGCCTCCGAC AGTGTCCTCG CTCTACCTC T GA CCTCTCCG 2460 
GAGGTTCCGC TGCCTCCAAG CCGGACTTAG GGCTTCAAGA GGCGGGCOTG CCCTCTGGAG 2520 
TCCCCTACCA TGACTGAAGG CGCCAGAGAC TGGCGGTGTC TTAAGACTCC GGGCACCGCC 2580 
ACGCGCTGTC AAGCAACACT CTGCGGACCT GCCGGCGTAG TTGCAGCGGG GGCTTGGGGA 2640 
GGGGCTGGGG GTTGGACGGG ATTGAGGAAG GTCCGCACAG CCTGTCTCTG CTCAGTTGCA 2700 
ATAAACGTGA CATCTTGGGA GCGTTAAAAA AA AAA AAA A A 

Seq ID NO: 1 12 Protein sequence: 
Protein Accession #: NP_003 806.2 

1 II 21 31 41 51 
I I I I I I 

MRLALLWALG LLGAGSPLPS WPLPNIGGTE EQQAESEKAP REPLEPQVLQ DDLPISLKKV 60 
LOTSLPEPLR IKLELDGDSH ILELLQNREL VPGRPTLVWY QPDGTRWSE GHTLENCCYQ 120 
GRVRGYAGSW VSICTCSGLR GLVVLTPERS YTLEQGPGDL QGPPHSRIQ DLHLPGHTCA 1 80 
LSWRESVHTQ TPPEHPLGQR HIRRRRDWT ETKTVELVTV ADHSEAQKYR DFQHLLNRTL 240 
EVALLLDTFF RPLNVRVALV GLEAWTQRDL VEISPNPAVT LENFLHWRRA HLLPRLPHDS 300 
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10 



60 



AQLVTGTSFS GPTVGMAIQN SICSPDFSGG VNMDHSTSIL GVASSIAHEL GHSLGLDHDL 360 
PGNSCPCPGP APAKTCIMEA STDFLPGLNF SNCSRRALEK ALLDGMGSCL FERLPSLPPM 420 
AAFCGNMFVE PGEQCDCGFL DDCVDPCCDS LTCQLRPGAQ CASDGPCCQN CQLRPSGWQC 480 
RPTRGDCDLP EFCPGDSSQC PPDVSLGDGE PCAGGQAVCM HGRCASYAQQ CQSLWGPGAQ 540 
PAAPLCLQTA NTRGNAFGSC GRNPSGSYVS CTPRDAICGQ LQCQTGRTQP LLGSIRDLLW 600 
ETIDVNGTEL NCSWVHLDLG SDVAQPLLTL PGTAOGPGLV CIDHRCQRVD LLGAQECRSK 660 
CHGHGVCDSK RHCYCEEGWA PPDCTTQLKA TSSLTTGLLL SLLVLLVLVM LGAGYWYRAR 720 
LHQRLCQLKG PTCQYRAAQS GPSERPGPPQ RALLARGTKS QGPAKPPPPR KPLPADPQGR 780 
CPSGDLPGPG AGIPPLWPS RPAPPPPTVS SLYL 

Seq ID NO: 1 13 DNA sequence 

Nucleic Acid Accession #: NM_00241 6 

Coding sequence: 40-41 7 (underlined sequences correspond to start and stop codons) 



15 1 11 21 31 41 51 

ATCCAATACA GGAGTGACTT GGAACTCCAT TCTATCACTATG AAGAAAAG TGGTGTTCTT 60 
TTCCTCTTGG GCATCATCTT GCTGGTTCTG ATTGGAGTGC AAGGAACCCC AGTAGTGAGA 120 
AAGGGTCGCT GTTCCTGCAT CAGCACCAAC CAAGGGACTA TCCACCTACA ATCCTTGAAA 180 
20 GACCTTAAAC AATTTGCCCC AAGCCCTTCC TGCGAG AAAA TTGAAATCAT TGCTACACTG 240 
AAGAATGGAG TTCAAACATG TCTAAACCCA GATTCAGCAG ATGTGAAGGA ACTGATTAAA 300 
AAGTCGG AG A AACAGGTCAG CCAAAAC AAA AAGCAAAAGA ATGGGAAAAA ACATCAAAAA 360 
AAGAAAGTTC TGAAAGTTCG A AA ATCTC AA CGTTCTCGTC AAAAGAAGAC TACA3MGAG 420 
ACCACTTCAC CAATAAGTAT TCTGTGTTAA AAATGTTCTA TTTTAATTAT ACCGCTATCA 480 
25 TTCCAAAGGA GGATGGCATA TAATACA AAG GCTTA TTAAT TTGACTAGAA AATTTAAAAC 540 
ATTACTCTG A AATTGTAACT AAAGTTAGAA AGTTG ATTTT AAGAATCCAA ACGTTAAGAA 600 
TTGTTAAAGG CTATGATTGT CTTTGTTCTT CTACCACCCA CCAGTTGAAT TTCATCATGC 660 
TTAAGGCCAT GATTTTAGCA ATACCCATGT CTACACAGAT GTTCACCCAA CCACATCCCA 720 
CTCACAACAG CTGCCTGGAA GAGCAGCCCT AGGCTTCCAC GTACTGCAGC CTCCAGAGAG 780 
30 TATCTGAGGC ACATGTCAGC AAGTCCTAAG CCTGTTAGCA TGCTGGTGAG CCAAGCAGTT 840 
TGAAATTGAG CTGGACCTCA CCAAGCTGCT GTGGCCATCA ACCTCTGTAT TTGAATCAGC 900 
CTACAGGCCT CACACACAAT GTGTCTG AG A GATTCATGCT GATTGTTATT GGGTATCACC 960 
ACTGGAGATC ACCAGTGTGT GGCTTTCAGA GCCTCCTTTC TGGCTTTGGA AGCCATGTGA 1020 
TTCCATCTTG CCCGCTCAGG CTGACCACTT TA11 TCl ll T TGTTCCCCTT TGCTTCATTC 1080 
35 AAGTCAGCTC TTCTCCATCC TACCACAATG CAGTGCCTTT CITCTCTCCA GTGCACCTGT 1 140 
CATATGCTCT GATTTATCTG AGTCAACTCC TTTCTCATCT TGTCCCCAAC ACCCCACAGA 1200 
AGTGCTTTCT TCTCCCAATT CATCCTCACT CAGTCCAGCT TAGTTCAAGT CCTGCCTCTT 1260 
AAATAAACCT TTTTGGACAC ACAAATTATC TTAAAACTCC TGTTTCACTT GGTTCAGTAC 1320 
CACATGGGTG AACACTCAAT GGTTAACTAA TTCTTGGGTG TTTATCCTAT CTCTCCAACC 1380 
40 AGATTGTCAG CTCCTTG AGG GCAAG AGCC A CAGTATATTT CCCTGTTTCT TCCACAGTGC 1440 
CTAATAATAC TGTGGAACTA GGTTTTAATA ATTTTTTAAT TGATGTTGTT ATGGGCAGGA 1500 
TGGCAACCAG ACCATTGTCT CAGAGCAGGT GCTGGCTCTT TCCTGGCTAC TCCATGTTGG 1560 
CTAGCCTCTG GTAACCTCTT ACTTATTATC TTCAGGACAC TCACTACAGG GACCAGGGAT 1620 
GATGCAACAT CCTTGTCTTT TTATGACAGG ATGTTTGCTC AGCTTCTCCA ACAATAAGAA 1680 
45 GCACGTGGTA AAACACTTGC GGATATTCTG G ACTGTTTTT AAAAAATATA CAGTTTACCG 1740 
AAAATCATAT AATCTTACAA TGAAAAGGAC TTTATAGATC AGCCAGTGAC CAACCTTTTC 1800 
CCAACCATAC AAAAATTCCT TTTCCCG AAG GAAAAGGGCT TTCTCAATAA GCCTCAGCTT 1860 
TCTAAGATCT AACAAGATAG CCACCGAG AT CCTTATCGAA ACTCATTTTA GGCAAATATG 1920 
AGTTTTATTG TCCGTTTACT TGTTTCAG AG TTTGTATTGT GATTATCAAT TACCACACCA 1980 
50 TCTCCCATGA AGAAAGGGAA CGGTGAAGTA CTAAGCGCTA GAGGAAGCAG CCAAGTCGGT 2040 
TAGTGGAAGC ATGATTGGTG CCCAGTTAGC CTCTGCAGGA TGTGGAAACC TCCTTCCAGG 2100 
GGAGGTTCAG TGAATTGTGT AGGAGAGGTT GTCTGTGGCC AGAATTTAAA CCTATACTCA 2160 
CTTTCCCAAA TTG AATCACT GCTCACACTG CTGATG ATTT AGAGTGCTGT CCGGTGGAGA 2220 
TCCCACCCG A ACGTCTTATC TAATCATGAA ACTCCCTAGT TCCTTCATGT AACTTCCCTG 2280 
55 AAAAATCTAA GTGTTTCATA AATTTG AG AG TCTGTG ACCC ACTTACCTTG CATCTCACAG 2340 
GTAGACAGTA TATAACTAAC AACCAAAG AC TACATATTGT CACTGACACA CACGTTATAA 2400 
TCATTTATCA TAT ATATACA TACATGCATA CACTCTC A AA GCAAATAATT TTTCACTTCA 2460 
AAACAGTATT GACTTGTATA CCTTGTAATT TG A AATATTT TCTTTGTTAA AATAGAATGG 2520 
TATCAATAAA TAGACCATTA ATCAG 



Seq ID NO: 1 14 Protein sequence: 
Protein Accession #: NPJW2407 



1 II 21 31 41 51 

lj MKKSGVLFLL GIILLVLIGV QGTPWRKGR CSCISTNQGT IHLQSLKDUC QFAPSPSCEK. 60 

IEIIATLKNG VQTCLNPDSA DVKELUCKWE KQVSQKKKQK NGKKHQKKKV LKVRKSQRSR 120 . 
QKKTT 

70 Seq ID NO: 1 1 5 DNA sequence 

Nucleic Acid Accession #: NM JJ0323 8 . 1 

Coding sequence: 182-1426 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

CAAGCAGGAT ACGTTTTTCT GTTGGGCATT GACTAGATTG TTTGCAAAAG TTTCGCATCA 60 
AAAACAAACA ACAACAACAA AAAACCAAAC AACTCTCCTT GATCTATACT TTG AGAATTG 120 
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TTGATTTCTT 1 1 I'l l I f ATT CTGACTTTTA AAAACAACTT TTTTTTCCAC TTTTTTA AAA 1 80 
AATQCACTAC TQTGTGCTGA GCGCTTTTCT GATCCTGCAT CTGGTCACGG TCGCGCTCAO 240 
CXrrGTCTACC TGCAGCACAC TCGATATGGA CCAOTTCATG CGCAAGAGGA TCGAGGCG AT 300 
CCGCGGGCAG ATCCTGAGCA AGCTGAAGCT CACCAGTCCC CCAGAAGACT ATCCTGAGCC 360 
CGAGGAAGTC CCCCCGGAGG TGATTTCCAT CTACAACAGC ACCAGGGACT TGCTCCAGQ A 420 
GAAGGCGAGC CGG AGGGCGG CCGCCTGOGA GCGCGAGAGG AGCGACGAAG AGTACTACGC 480 
CAAGO AGGTT TACAAAATAG ACATGCCGCC CTTCTTCCCC TCCGAAAATG CCATCCCGCC 540 
CACTTTCTAC AGACCCTACT TCAGAATTGT TCG ATTTG AC GTCTCAGCAA TGGAGAAGAA 600 
TGCTTCCAAT TTGGTGAAAG CAGAGTTCAG A OTCTTTO OT TTGCAGAACC CAAAAGCCAG 660 
AGTGCCTGAA CAACGGATTG AGCTATATCA G ATTCTCAAG TCCAAAGATT TAACATCTCC 720 
AACCCAGCGC TACATCGACA GCAAAGTTGT GAAAACAAGA GCAGAAGGCG AATGGCTCTC 780 
CTTCGATGTA ACTGATGCTG TTCATGAATG GCTTCACCAT AAAGACAGGA ACCTGGGATT 840 
TAAAATAAGC TTACACTGTC CCTGCTGCAC TTTTGTACCA TCTAATAATT ACATCATCCC 900 
AAATAAAAGT G AAG AACTAG AAGCAAGATT TGCAGGTATT G ATGGCACCT CCACATATAC 960 
CAGTGGTGAT CAGAAAACTA TAAAGTCCAC TAGGAAAAAA AACAGTGGGA AGACCCCACA 1020 
TCTCCTGCTA ATGTTATTGC CCTCCTACAG ACTTGAGTCA CAACAGACCA ACCGGCGGAA 1080 
GAAGCGTGCT TTGGATGCGG CCTATTGCTT TAGAAATGTG CAGGATAATT GCTGCCTACG 1 140 
TCCACTTTAC ATTG ATTTCA AGAGGGATCT AGGGTGGAAA TGGATACACG AACCCAAAGG 1200 
GTACAATGCC AACTTCTGTG CTGGAGCATG CCCGTATTTA TGGAGTTCAG ACACTCAGCA 1260 
CAGCAGGGTC CTGAGCTTAT ATAATACCAT AAATCCAGAA GCATCTGCTT CTCCTTGCTG 1320 
GGTGTCCCAA G ATTTAGAAC CTCTAACCAT TCTCTACTAC ATTG GCA AAA CACCCAAGAT 1380 
TGAACAGCTT TCTAATATGA TTGTAAAGTC TTGCAAATGC AGCTAAAATT CTTGGAAAAG 1440 
TGGCAAGACC AAAATGACAA TGATGATGAT AATGATGATG ACGACGACAA CGATGATGCT 1500 
TGTAACAAGA AAACATAAGA GAGCCTTGGT TCATCAGTGT TAAAAAATTT TTGAAAAGGC 1560 
GGTACTAGTT CAGACACTTT GGAAGTTTGT GTTCTGTTTG TTAAAACTGG CATCTGACAC 1620 
AAAAAAAGTT GAAGGCCTTA TTCTACATTT CACCTACTTT GTAAGTGAGA GAGACAAGAA 1680 
GCAAATTTTT TTAAA 

Seq ID NO: 1 16 Protein sequence: 
Protein Accession #: NP 003229. 1 

I II 21 31 41 51 
I I I I I I 

MHYCVLSAFL ILHLVTVALS LSTCSTLDMD QFMRKRIEAI RGQILSKLKL TSPPEDYPEP 60 
EEVPPEVTSI YNSTRDLLQE KASRRAAACE RERSDEEYYA KEVYKIDMPP FFPSENAIPP 120 
TFVRPYFRIV RFDVSAMEKN ASNLVKAEFR VFRLQNPKAR VPBQRIELYQ ILKSKDLTSP 180 
TQRY1DSKW KTRAEGEWLS FDVTDAVHEW LHHKDRNLGF KISLHCPCCT FVPSNNYIIP 240 
NKSEELEARF AGIDGTSTYT SGDQKTIKST RKKNSGKTPH LLLMLLPSYR LESQQTNRRK 300 
KRALDAAYCF RNVQDNCCLR PLYIDFKRDL GWKWIHEPKG YNANFCAGAC PYLWSSDTQH 360 
SRVLSLYNTI NPEASASPCC VSQDLEPLTI LYYIGKTPKJ EQLSNMIVKS CKCS 

ScqlD NO: 117 DNA sequence 

Nucleic Acid Accession U: NM .000095. 1 

Coding sequence: 26-2299 (underlined sequences correspond to start end stop codons) 

1 11 21 31 41 51 
I I I I I I 

CAGCACCC AG CTCCCCGCCA CCGCCATGGT CCCCGACACC GCCTGCGTTC TTCTGCTCAC 60 
CCTGGCTGCC CTCGGCGCGT CCGGACAGGG CCAGAGCCCG TTGGGCTCAG ACCTGGGCCC 120 
GCAGATGCTT CGGGAACTGC AGGAAACCAA CGCGGCGCTG CAGGACGTGC GGGACTGGCT 180 
GCGCCAGCAG GTCAGGG AG A TCACGTTCCT GAAAAACACG GTGATGGAGT GTG ACGCGTG 240 
CGGGATGCAG CAGTCAGTAC GCACCGGCCT ACCCAGCGTG CGGCCCCTGC TCCACTGCGC 300 
GCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAG ACQ GAGAGCGGCG GCCGCTGCGG 360 
CCCCTGCCCC GCGGGCTTCA CGGGCAACGG CTCGCACTGC ACCGACGTCA ACGAGTGCAA 420 
CGCCCACCCC TGCTTCCCCC G AGTCCGCTG TATCAACACC AGCCCGGGGT TCCGCTGCGA 480 
GGCTTGCCCG CCGGGGTACA GCGGCCCCAC CCACCAGGGC GTGGGGCTGG CTTTCGCCAA 540 
GGCCAACAAG CAGGTTTGCA CGGACATCAA CGAGTGTGAG ACCGGGCAAC ATAACTGCGT 600 
CCCCAACTCC GTGTGCATCA ACACCCGGGG CTCCTTCCAG TGCGGCCCGT GCCAGCCCGG 660 
CTTCGTGGGC GACCAGGCGT CCGGCTGCCA GCGCGGCGCA CAGCGCTTCT GCCCCGACGG 720 
CTCGCCCAGC GAGTGCCACG AGCATGCAGA CTGCGTCCTA GAGCGCGATG GCTCGCGGTC 780 
GTGCGTGTGT CGCGTTGGCT GGGCCGGCAA CGGGATCCTC TGTGGTCGCG ACACTGACCT 840 
AGACGGCTTC CCGG ACGAGA AGCTGCGCTG CCCGGAGCCG CAGTGCCGTA AGGACAACTG 900 
CGTGACTGTG CCCAACTCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGCCTG 960 
CGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGAAAAG GACAACTGCC CGCTGGTGCG 1020 
GAACCCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGCGTGCG ACAACTGCCG 1080 
GTCCCAGAAG AACG ACGACC AAAAGGACAC AGACCAGGAC GGCCGGGOCG ATGCGTGCGA 1 140 
CGACGACATC GACGGCGACC GGATCCGCAA CCAGGCCGAC AACTGCCCTA GGGTACCCAA 1200 
CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAGG GGAT GCCTGTGACA ACTGTCCCCA 1260 
GAAGAGCAAC CCGGATCAGG CGGATGTGGA CCACGACTTT GTGGGAGATG CTTGTGACAO 1320 
CGATCAAGAC CAGGATGGAG ACGG AC ATCA GGACTCTCGQ GACAACTGTC CCACGGTGCC 1380 
TAACAGTGCC CAGGAGGACT CAGACCACGA TGGCCAGGGT GATGCCTGCG ACGACG ACGA 1440 
CGACAATGAC GGAGTCCCTG ACAGTCGGGA CAACTGCCGC CTGGTGCCTA ACCCCGGCCA 1500 
GGAGGACCCG GACAGGGACG GCGTGGGCGA CGTGTGCCAG GACGACTTTG ATGCAGACAA 1560 
GGTGGTAGAC AAGATCGACG TGTGTCCGGA G AACGCTGAA GTCACGCTCA CCGACTTCAG 1620 
GGCCTTCCAG ACAGTCGTGC TGGACCCGGA GGGTGACGCG CAGATTGACC CCAACTGGGT 1680 
GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAATGAAC AGCGACCCAG GCCTGGCTGT 1740 
GGGTTACACT GCCTTCAATG GCGTGGACTT CGAGGGCACG TTCCATGTGA ACACGGTCAC 1800 
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GGATGACGAC TATGCGGGCT TCATCTTTGG CTACCAGGAC AGCTCCAGCT TCTACGTGGT I860 
CATGTGGAAG CAGATGGAGC AAACGTATTG GCAGGCOAAC CCCTTCCGTG CTGTGGCCGA 1920 
GCCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGGAA 1980 
CGCTCTGTGG CATACAGGAG ACACAGAGTC CCAGGTGCGG CTGCTGTGGA AGGACCCGCG 2040 
AAACGTGGOT TGGAAGGACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GOCCCCAAGT 2100 
GGGCTACATC AGGGTGCGAT TCTATGAGGG CCCTGAGCTG GTGGCCGACA GCAACGTGGT 2160 
CTTGGACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGG AGAACAT 2220 
CATCTGGGCC AACCTGCGTT ACCGCTGCAA TG ACACCATC CCAGAGG ACT ATGAGACCCA 2280 
TCAGCTGCGG CAAGCCTAGG GACCAGGGTG AGGACCCGCC GG ATGACAGC CACCCTCACC 2340 
GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CGTCCTGAGG GGGAAGTGAG 2400 
AAGGGCTCAG AGAGGACAAA ATAAAGTGTG TGTGCAGGG 

SeqIDNO: 118 Protein sequence: 
Protein Accession #: NP_000086. 1 

1 11 21 31 41 51 

MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVRD WLRQQVREIT 60 
FLKNTVMECD ACGMQQSVRT GLPSVRPLLH CAPGFCFPGV ACIQTESGGR CGPCPAGFTG 120 
NGSHCTDVNE CNAHPCFPRV RCINTSPGFR CEACPPGYSG PTHQGVGLAF AKANKQVCTD 180 
INECETGQHN CVPNSVCINT RGSFOCGPCQ PGFVGDQASG CQRGAQRFCP DGSPSECHEH 240 
ADCVLERDGS RSCVCRVGWA GNGILCGRDT DLDGFPDEKL RCPEPQCRKD NCVTVPNSGQ 300 
EDVDRDGIGD ACDPDADGDG VPNEKDNCPL VRNPDQRNTD EDKWGD ACDN CRSQKNDDQK 360 
DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG DGIGDACDNC PQKSNPDQAD 420 
VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD HDGQGDACDD DDDNDGVPDS 480 
RDNCRLVPNP GQED ADRDG V GD VCQDDFDA DKWDKIDVC PENAEVTLTD FRAFQTVVLD 540 
PEGDAOIDPN WVVLNQGRE1 VQTMNSDPGL AVGYTAFNGV DFEGTFHVNT VTDDDYAGFI 600 
FGYODSSSFY VVMWKQMEQT YWQANPFRAV AEPGIQLKAV KSSTGPGEQL RNALWHTGDT 660 
ESQVRLLWKD PRNVGWKDKK SYRWFLQHRP QVGYIRVRFY EGPELVADSN WLDTTMRGG 720 
RLGVFCFSQE NUWANLRYR CNDTIPEDYE THQLRQA 

Seq ID NO: 1 1 9 DN A sequence 

Nucleic Acid Accession #: NM_0142U 

Coding sequence: 1 57-1 479 (underlined sequences correspond to start and stop codons) 
I II 21 31 41 51 

GGGACAGGGC TGAGGATGAG GAGAACCCTG GGGACCCAGA AGACCGTGCC TTGCCCGGAA 60 
GTCCTGCCTG TAGGCCTGAA GGACTTGCCC TAACAG AGCC TCAACAACTA CCTGGTGATT 120 
CCTACTTCAG CCCCTTGGTG TGAGCAGCTT CTCAACATGA ACTACAGCCT CCACTTGGCC 180 
TTCGTGTGTC TG AGTCTCTT CACTGAGAGG ATGTGCATCC AGGGGAGTCA GTTCAACGTC 240 
G AGGTCGGCA GAAGTGACAA GCTTTCCCTG CCTGGCTTTG AGAACCTCAC AGCAGGATAT 300 
AACAAATTTC TCAGGCCCA A TTTTGGTGGA G AACCCGTAC AGATAGCGCT G ACTCTGG AC 360 
ATTGCAAGTA TCTCTAGCAT TTCAG AG AGT AACATGG ACT ACACAGCCAC CATATACCTC 420 
CGACAGCGCT GG ATGGACCA GCGGCTGGTG TTTG AAGGCA ACAAGAGCTT CACTCTGGAT 480 
GCCCGCCTCG TGG AGTTCCT CTGGGTGCCA GATACTTACA TTGTGGAGTC CAAGAAGTCC 540 
TTCCTCCATG AAGTCACTGT GGGAAACAGG CTCATCCGCC TCTTCTCCAA TGGCACGGTC 600 
CTGTATGCCC TCAGAATCAC G ACAACTGTT GCATGTAACA TGGATCTGTC TAAATACCCC 660 
ATGGACACAC AGACATGCAA GTTGCAGCTG GAAAGCTGGG GCTATGATGG A^TGATCTG 720 
GAGTTCACCT GGCTGAG AGG GAACG ACTCT GTGCGTGGAC TGG AACACCT GCGGCTTGCT 780 
CAGTACACCA TAGAGCGGTA TTTCACCTTA GTCACCAGAT CGCAGCAGG A GACAGGAAAT 840 
TACACTAG AT TGGTCTTACA GTTTGAGCTT CGGAGGAATG TTCTGTATTT CATTTTGGAA 900 
ACCTACGTTC CTTCCACTTT CCTGGTGGTG TTGTCCTGGG TTTCATTTTG GATCTCTCTC 960 
GATTCAGTCC CTGCAAGAAC CTGCATTGGA GTGACGACCG TGTTATCAAT GACCACACTG 1020 
ATGATCGGGT CCCGCACTTC TCTTCCCAAC ACCAACTGCT TCATCAAGGC CATCQATGTG 1080 
TACCTGGGGA TCTGCTTTAG CTTTGTGTTT GGGGCCTTGC TAGAATATGC AGTTGCTCAC 1140 
TACAGTTCCT TACAGCAGAT GGCAGCCAAA GATAGGGGGA CAACAAAGGA AGTAGAAGAA 1200 
GTCAGTATTA CTAATATCAT CAACAGCTCC ATCTCCAGCT TTAAACGGAA GATCAGCTTT 1260 
GCCAGCATTG AAATTTCCAG CGACA ACGTT GACTACAGTG ACTTGACAAT GAAAACCAGC 1320 
GACAAGTTCA AGTTTGTCTT CCGAGAAAAG ATGGGCAGGA TTGTTGATTA TTTCACAATT 1380 
CAAAACCCCA GTAATGTTG A TCACTATTCC AAACTACTGT TTCCTTTGAT TTTTATGCTA 1440 
GCCAATGTAT TTTACTGGGC ATACTACATG TATTTTTGAG TCAATGTTAA ATTTCTTGCA 1500 
TGCCATAGGT CTTC AACAGG ACAAGATAAT GATGTAAATG GTATTTTAGG CCAAGTGTGC 1560 
ACCCACATCC AATGGTGCTA CAAGTG ACTG AAATAATATT TGAGTCTTTC TGCTCAAAG A 1620 
ATGAAGCTCC AACCATTGTT CTAAGCTGTG TAGAAGTCCT AGCATTATAG GATCTTGTAA 1680 
TAG AAACATC AGTCCATTCC TCTTTCATCT TAATCAAGGA CATTCCCATG G AGCCCAAG A 1740 
TTACAAATGT ACTCAGGGCT GTTTATTCGG TGGCTCCCTG GTTTGCATTT ACCTCATATA 1800 
AAG AATGGGA AGGAG ACCAT TGGGTAACCC TCAAGTGTCA /3AAGTTOTTT 
TATACATGTT TTTTACTAAA TCTCTGCAGT GCTTATAAAA TACATTGTTG CCTATTTAGG 1920 
GAGTAACATT TTCTAGTTTT TGTTTCTGGT TAAAATGAAA TATGGGCTTA TGTCAATTCA 1980 
TTGGAAGTCA ATGCACTAAC TCAATACCAA GATGAGTTTT TAAATAATGA ATATTATTTA 2040 
ATACCACAAC AGAATTATCC CCAATTTCCA ATAAGTCCTA TCATTG AAAA TTCAAATATA 2100 
ACTGA^GAAA AAATTA^ TAAACAAATC CCTCGGTTCT AAG ATACAAT 2160 

G^ATTCO^^T/i^TGGvSjG ACTCTGAGGCTTTATTCCCCCA 22^0 
TTTATTATTA TACACACATC CATCCTAAAC TATACTAAAG CCCTTTTCCC ATGCATGGAT 2280 
G^^VTGG AA G ATTTTTTTG TAACTTGTTC TAGAAGTCTT A ATATGGGCT GTTGCCATGA 2340 

agg^gc^a^^ 2400 
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AAAGTACTGG GTTGACTCAG AGAGTCGCTG TCATTCTGTC ATTGCTGCTA CTCTAACACT 2460 
GAGCAACACT CTCCCAGTGG CAGATCCCCT GTATCATTCC AAGAGGAGCA TTCATCCCTT 2520 
TGCTCTAATG ATCAGG AATG ATGCTTATTA GAAAACAAAC TGCTTGACCC AGGAACAAGT 2580 
. GGCTTAGCTT AAGTAAACTT GGCTTTGCTC AGATCCCTGA TCCTTCCAGC TGGTCTGCTC 2640 
5 TGAGTGGCTT ATCCCGCATG AGCAGGAGCG TGCTGGCCCT GAGTACTGAA CTTTCTGAGT 2700 
AACAATGAGA CACGTTACAG AACCTATGTT CAGGTTGCGG GTG AGCTGCC CTCTCCAAAT 2760 
CCAGCCAGAG ATGCACATTC CTCGGCCAGT CTCAGCCAAC AGTACCAAAA GTGATTTTTG 2820 
AGTGTGCCAG GGTAAAGGCT TCCAGTTCAG CCTCAGTTAT TTTAGACAAT CTCGCCATCT 2880 

. _ TTAA'1' 1 l 'C I T AGCTTCCTGT TCTAATAAAT GCACGGCTTT ACCTTTCCTG TCAGAAATAA 2940 

1 0 ACCAAGGCTC TAAAAGATGA TTTCCCTTCT GTAACTCCCT AG AGCCACAG GTTCTCATTC 3000 
CTTTTCCCAT TATACTTCTC AC AATTCAGT TTCTATG AGT TTG ATCACCT G ATTTTTTTA 3060 
ACAAAATATT TCTAACGGGA ATGGGTGGGA GTGCTGGTGA AAAGAGATGA AATGTGGTTG 3120 
TATGAGCCAA TCATATTTGT GATTTTTTAA AAAAAGTTTA AAAGGAAATA TCTGTTCTGA 3180 

, e AACCCCACTT AAGCATTGTT TTTATATAAA AACAATGATA AAGATGTGAA CTGTGAAATA 3240 

15 AATATACCAT ATTAGCTACC CACC 

Seq ID NO: 1 20 Protein sequence: 
Protein Accession Ui NP_055026. 1 

20 1 11 21 31 41 51 
I I I I I I 

MNYSLHLAFV CLSLFTERMC IQGSQFNVEV GRSDKLSLPG FENLTAGYNK FLRPNFGGEP 60 
VQIALTLDIA SISSISESNM DYTATIYLRQ RWMDQRLVFE GNKSFTLDAR LVEFLWVPDT 120 
YIVESKKSFL HEVTVGNRLI RLFSNGTVLY AtRITTTVAC NMDLSKYPMD TQTCKLQLES 180 
25 WGYDGNDVEF TWLRGNDSVR GLEHLRLAQY TTERYFTLVT RSQQETGNYT RLVLQFELRR 240 
NVLYFILETY VPSTFLVVLS WVSFWISLDS VPARTCIGVT TVLSMTTLMI GSRTSLPNTN 300 
CFDCAIDVYL GICFSFVFGA LLEYAVAHYS SLQQMAAKDR GTTKEVEEVS ITNUNSSIS 360 
SFKRK1SFAS IEISSDNVDY SDLTMKTSDK FKFVFREKMG RIVDYFTIQN PSNVDHYSKL 420 
LFPLDFMLAN VFYWAYYMYF 

30 

Seq ID NO: 121 DNA sequence 

Nucleic Acid Accession #: NMO01 854 

Coding sequence: 1 63-5582 (underlined sequences correspond to start and stop codons) 

35 1 11 21 31 41 51 

I I I I I I . 

AACCATCAAA TTTAGAAGAA AAAGCCCTTT G ACTTTTTCC CCCTCTCCCT CCCCAATGGC 60 
TGTGTAGCAA ACATCCCTGG CGATACCTTG GAAAGGACGA AGTTGGTCTG CAGTCGCAAT 120 
TTCGTGGGTT GAGTTCACAG TTGTG AGTGC GGGGCTCGG A GATGG AGCCG TGGTCCTCTA 1 80 

40 GGTGGAAAAC G AAACGGTGG CTCTGGG ATT TCACCOTAAC AACCCTCGCA TTGACCTTCC 240 
TCTTCCAAGC TAGAGAGGTC AGAGGAGCTG CTCCAGTTGA TGTACTAAAA GCACTAGATT 300 
TTCACAATTC TCCAGAGGGA ATATCAAAAA CAACGGGATT TTGCACAAAC AGAAAGAATT 360 
CTAAAGGCTC AG ATACTGCT TACAG AGTTT CAAAGCAAGC ACAACTCAGT GCCCCAACAA 420 
AACAGTTATT TCCAGGTGGA ACTTTCCCAG AAGACTTTTC AATACTATTT ACAGTAAAAC 480 

45 CAAAAAAAGG AATTCAGTCT TTCCTTTTAT CTATATATAA TGAGCATGGT ATTCAGCAAA 540 
TTGGTGTTGA GGTTGGG AG A TC ACCTGTTT TTCTGTTTGA AG ACCACACT GGAAAACCTG 600 
CCCCAGAAG A CTATCCCCTC TTCAGAACTG TTAACATCGC TGACGGGAAG TGGCATCGGG 660 
TAGCAATCAG CGTGGAGAAG AAAACTGTGA CAATGATTGT TGATTGTAAG AAGAAAACCA 720 
CGAAACCACT TGATAGAAGT G AG AG AGCAA TTGTTGATAC CAATGGAATC ACGGTTTTTG 780 

5 0 G AACAAGG AT TTTGGATG AA G AAGTTTTTG AGGGGG ACAT TCAGCAGTTT TTG ATCACAG 840 
GTG ATCCCAA GGCAGCATAT G ACTACTGTG AGCATTATAG TCCAGACTGT GACTCTTCAG 900 
CACCCAAGGC TGCTCAAGCT CAGGAACCTC AGATAG ATG A GTATGCACCA GAGGATATAA 960 
TCGAATATGA CTATGAGTAT GGGGAAGCAO AGTATAAAGA GGCTGAAAGT GTAACAGAGG 1020 
GACCCACTGT AACTGAGGAG ACAATAGCAC AGACGGAGGC AAACATCGTT GATGATTTTC 1080 

55 AAGAATACAA CTATGGAACA ATGGAAAGTT ACCAGACAGA AGCTCCTAGG CATGTTTCTG 1140 
GGACAAATG A GCCAAATCCA GTTGAAGAAA TATTTACTGA AGAATATCTA ACGGGAGAGG 1200 
ATTATGATTC CCAG AGGAAA AATTCTGAGG ATACACTATA TOAAA ACAAA GAAATAGACG 1260 
GCAGGGATTC TGATCTTCTG GTAGATGGAG ATTTAGGCGA ATATGATTTT TATGAATATA 1320 

- . AAG AATATG A AG ATAAACCA ACAAGCCCCC CTAATGAACA ATTTGGTCCA GGTGTACCAG 1380 

60 CAGAAACf GA TATTACAG AA ACAAGCATAA ATGGCCATGG TGCATATGGA GAGAAAGGAC 1440 
AGAAAGGAGA ACCAGCAGTG GTTGAGCCTG GTATGCTTGT CGAAGGACCA CCAGGACCAG 1500 
CAGGACCTGC AGGTATTATG GGTCCTCCAG GTCTACAAGG CCCCACTGGA CCCCCTGGTG 1560 
ACCCTGGCGA TAGGGGCCCC CCAGGACGTC CTGGCTTACC AGGGGCTGAT GGTCTACCTG 1620 
GTCCTCCTGG TACT ATG TTG ATGTTACCGT TCCGTTATGG TGGTGATGGT TCCAAAGGAC 1680 

65 CAACCATCTC TGCTCAGG AA GCTCAGGCTC AAGCTATTCT TCAGCAGGCT CGGATTGCTC 1 740 
TGAGAGGCCC ACCTGGCCCA ATGGGTCTAA CTGGAAGACC AGGTCCTGTG GGGGGGCCTQ 1800 
GTTCATCTGG GGCCAAAGGT GAGAGTGGTG ATCCAGGTCC TCAGGGCCCT CGAGGCGTCC I860 
AGGGTCCCCC TGGTCCAACG GGAAAACCTG GAAAAAGGGG TCGTCCAGGT GCAGATGGAG 1920 
GAAGAGGAAT GCCAGGAG AA CCTGGGGCAA AGGGAGATCG AGGGTTTGAT GGACTTCCGG 1980 

70 GTCTGCCAGG TGACAAAGGT CACAGGGGTG AACGAGGTCC TCAAGGTCCT CCAGGTCCTC 2040 
CTGGTGATGA TGGAATGAGG GGAGAAGATG GAGAAATTGG ACCAAGAGGT CTTCCAGGTG 2100 
AAGCTGGCCC ACGAGGTTTG CTGGGTCCAA GGGGAACTCC AGGAGCTCCA GGGCAGCCTG 2160 
GTATGGCAGG TGTAGATGGC CCCCCAGGAC CAAAAGGGAA CATGGGTCCC CAAGGGGAGC 2220 
CTGGGCCTCC AGGTCAACAA GGGAATCCAG GACCTCAGGG TCTTCCTGGT CCACAAGGTC 2280 

75 CAATTGGTCC TCCTGGTG AA AAAGGACCAC AAGGAAAACC AGGACTTGCT GGACTTCCTG 2340 
GTGCTGATGG GCCTCCTGGT CATCCTGGG A AAG AAGGCCA GTCTGG AGAA AAGGGGGCTC 2400 
TGGGTCCCCC TGGTCCACAA GGTCCTATTG GATNNCCGGG CCCCCGGGGA GTAAAGGGAQ 2460 
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CAGATGGTGT CAOAGGTCTC AAGGGATCTA AAGGTGAAAA GGGTGAAGAT GGTTTTCCAG 2520 
GATTCAAAGG TGACATGGGT CTAAAAGGTG ACAGAGGAGA AGTTGGTCAA ATTGGCCCAA 2580 
GAGGGNAAG A TGGCCCTGAA GGACCCAAAG GTCGAGCAGG CCCAACTGGA GACCCAGGTC 2640 
CTTCAGGTCA AGCAGGAGAA AAGGG AA AAC TTGGAGTTCC AGGATTACCA GGATATCCAG 2700 
5 G AAGACAAGG TCCAAAGGGT TCCACTGGAT TCCCTGGGTT TCCAGGTGCC AATGGAGAGA 2760 
AAGGTGCACG GGGAGTAGCT GGCAAACCAG GCCCTCGGGG TCAGCGTGGT CCAACGGGTC 2820 
CTCGAGGTTC AAGAGGTGCA AGAGGTCCCA CTGGGAAACC TGGGCCAAAG GGCACTTCAG 2880 
GTGGCGATGG CCCTCCTGGC CCTCCAGGTG AAAGAGGTCC TCAAGGACCT CAGGGTCCAG 2940 
TTGGATTCCC TGGACCAAAA GGCCCTCCTG GACCACCAGG AAGGATGGGC TOCCCAGG AC 3000 

1 0 ACCCTGGGCA ACGTGGGG AG ACTGGATTTC AAGGCAAG AC CGGCCCTCCT GGGCCAGGGG 3060 
GAGTGGTTGG ACCACAGGGA CCAACCGGTG AGACTGGTCC AATAGGGGAA CGTGGGTATC 3120 
CTGGTCCTCC TGGCCCTCCT GGTGAGCAAG GTCTTCCTGG TGCTGCAGGA AAAGAAGGTG 3 1 80 
CAAAGGGTGA TCCAGGTCCT CAAGGTATCT CAGGGAAAGA TGG ACCAGCA GGATTACGTG 3240 
GTTTCCCAGG GGAAAGAGGT CTTCCTGGAG CTCAGGGTGC ACCTGGACTG AAAGGAGGGG 3300 

1 5 AAGGTCCCCA GGGCCCACCA GGTCCAGTTG GCTCACCAGG AGAACGTGGG TCAGCAGGTA 3360 
CAGCTGGCCC AATTGGTTTA CGAGGGCGCC CGGGACCTCA GGGTCCTCCT GGTCCAGCTG 3420 
GAGAGAAAGG TGCTCCTGGA GAAAAAGGTC CCCAAGGGCC TGCAGGGAGA GATGG AGTTC 3480 
AAGGTCCTGT TGGTCTCCCA GGGCCAGCTG GTCCTGCCGG CTCCCCTGGG GAAGACGG AG 3540 
ACAAGGGTGA AATTGGTGAG CCGGGACAAA AAGGCAGCAA GGGTGGCAAG GGAGAAAATG 3600 

20 GCCCTCCCGG TCCCCCAGGT CTTCAAGGAC CAGTTGGTGC CCCTGGAATT GCTGGAGGTG 3660 
ATGGTG AACC AGGTCCTAGA GGACAGCAGG GGATGTTTGG GCAAAAAGGT GATGAGGGTG 3720 
CCAGAGGCTT CCCTGGACCT CCTGGTCCAA TAGGTCTTCA GGGTCTGCCA GGCCCACCTG 3780 
GTGAAAAAGG TGAAAATG GG GATGTTGGTC CATGGGGGCC ACCTGGTCCT CCAGGCCCAA 3840 
GAGGCCCTCA AGGTCCCAAT GGAGCTGATG GACCACAAGG ACCCCCAGGT TCTGTTCGTT 3900 

25 CAGTTGGTGC TGTTGGAGAA AAGGGTGAAC CTGGAG AAGC AGGAAACCCA GGGCCTCCTG 3960 
GGGAAGCAGG TGTAGGCGGT CCCAAAGGAG AAAGAGGAGA GAAAGGGG AA GCTGGTCCAC 4020 
CTGGAGCTGC TGGACCTCCA GGTGCCAAGG GGCCGCCAGG TG ATGATGGC CCTAAGGGTA 4080 
ACCCGGGTCC TGTTGGTTTT CCTGGAGATC CTGGTCCTCC TGGGGAACTT GGCCCTGCAG 4140 
GTCAAGATGG TGTTGGTGGT GACAAGGGTG AAGATGG AGA TCCTGGTCAA CCGGGTCCTC 4200 

30 CTGGCCCATC TGGTGAGGCT GGCCCACCAG GTCCTCCTGG AAAACOAGGT CCTCCTGGAG 4260 

CTGCAGGTGC AGAGGGAAGA CAAGGTGAAA AAGGTGCTAA GGGGGAAGCA GGTGCAG AAG 4320 
GTCCTCCTGG AAAAACCGGC CCAGTCGGTC CTCAGGG ACC TGCAGG AAAG CCTGGTCCAG 4380 
AAGGTCTTCG GGGCATCCCT GGTCCTGTGG GAGAACAAGO TCTCCCTGGA GCTGCAGGCC 4440 
AAGATGGACC ACCTGGTCCT ATGGGACCTC CTGGCTTACC TGGTCTCAAA GGTGACCCTG 4500 

3 5 GCTCCAAGGG TG AAAAGGGA CATCCTGGTT TAATTGGCCT GATTGGTCCT CCAGGAGAAC 4560 
AAGGGGAAAA AGGTGACCGA GGGCTCCCTG GAACTCAAGO ATCTCCAGGA GCAAAAGGGO 4620 
ATGGGGGAAT TCCTGGTCCT GCTGGTCCCT TAGGTCCACC TGGTCCTCCA GGCTTACCAG 4680 
GTCCTCAAGG CCCAAAGGGT AACAAAGGCT CTACTGG ACC CGCTGGCCAG AAAGGTGACA 4740 
GTGGTCTTCC AGGGCCTCCT GGGCCTCCAG GTCCACCTGG TGAAGTCATT CAGCCTTTAC : 4800 

40 CAATCTTGTC CTCCAAAAAA ACGAGAAGAC ATACTGAAGG CATGCAAGCA GATGCAGATG 4860 
ATAATATTCT TGATTACTCG GATGG AATGG AAGAAATATT TGGTTCCCTC AATTCCCTGA 4920 
AACAAGACAT CGAGCATATG AAATTTCCAA TGGGTACTCA G ACCAATCCA GCCCGAACTT 4980 
GTAAAGACCT GCAACTCAGC CATCCTGACT TCCCAGATGG TGAATATTGG ATTGATCCTA 504O 
ACCAAGGTTG CTCAGGAGAT TCCTTCAAAG TTTACTGTAA TTTCACATCT GGTGGTGAGA 5100 

45 CTTGCATTTA TCCAGACAAA AAATCTGAGG G AGTAAG AAT TTCATCATGG CCAAAGGAGA 5160 
AACCAGGAAG TTGGTTTAGT G AATTTAAGA GGGGAAAACT GCTTTCATAC TTAGATGTTG 5220 
AAGGAAATTC CATCAATATG GTGCAAATGA CATTCCTGAA ACTTCTGACT GCCTCTGCTC 5280 
GGCAAAATTT C ACCTACCAC TGTCATCAGT CAGCAGCCTG GTATG ATGTG TCATCAGGAA 5340 
GTTATGACAA AGCACTTCGC TTCCTGGGAT CAAATGATGA GGAGATGTCC TATGACAATA 5400 

50 ATCCTTTTAT CAAAACACTG TATGATGGTT GTACGTCCAG AAAAGGCTAT GAAAAAACTG 5460 
TCATTGAAAT CAATACACCA AAAATTGATC AAGTACCTAT TGTTGATGTC ATGATCAGJtj 5520 
ACTTTGGTGA TCAGAATCAG AAGTTCGG AT TTGAAGTTGG TCCTGTTTGT JTTCTTGGCX 5580 
AAG ATT AAG A CAAAG AACAT ATCAAATCAA CAGAAAATGT ACCTTGCTGC CACC^CCCA 56W 
TTTTGTGCCA CATGCAAGTT TTG AATAAGG ATGTATGGAA AACAACGCTG CATATACAGG 5700 

55 TACCATTTAG GAAATACCGA TGCCTTTGTG GGGGCAGAAT CAC^GACAAA AGCTTJGAAA 5760 
ATCATAAAGA TATAAGTTGG TGTGGCTAAG ATGGAAACAG GGCTGATTCT TGATTCCCAA 5O0 
TTCTCAACTC TCCTTTTCCT ATTTG AATTT CTTTGGTGCT GTAGAAAACA AAA^GAAA ,5880 
AATATATATT CATAAAAA AT ATGGTGCTCA TTCTCATCCA T<XAGGATGT ACTAAAACAG 5940 
TGTGTTTAAT AAATTGTAAT TATTTTGTGT ACAGTTCTAT ACTGTTATCT GTGTrcATTT 6000 

60 CCAAAACTTG CACGTGTCCC TGAATTCCGC TGACTCTAAT TTATGAGG AT GCCGAACTCT 6060 
GATGGCAATA ATATATGTAT TATGAAAATG AAGTTATGAT TTCCGATGAC CCTAAGTCCC 6120 
TTTCTTTGGT TAATG ATGAA ATTCCTTTGT GTGTGTTT 

Seq ID NO: 122 Protein sequence: 
65 Protein Accession #: NPJ0OI84S 

1 U 21 31 41 51 

70 MEPWSSRWKT KRWLWDFTVT TLALTFEFQA REVRGAAPVD VLKALDFWS PEGISKTOF 60 
CTNRKNSKGS DTAYRVSKQA QLSAPTKQLF PGGTFPEDFS IUTVKPKKG J^i^i^E^. / , Qn 
EHGIOQIGVE VGRSPVFLFE DHTGKPAPED YPLFRTVNIA DGKWHRVAIS VEKKTVTMTV 180 
DCKKKTTKPL DRSERAIVDT NGITVFGTRI LDEEVFEGDI QQFUTGDPK AATOYCEHYS 240 
PDCDSSAPKA AQAQEPQIDE YAPEDIIEYD YEYGEAEYKE AESVTEGPTV T^TIAQTEA JOO 

75 NTVDDFQEYN YGTMESYQTE APRHVSGTNE PNPVEE1FTE EYLTGEDYDS QRKNSEDTLY 360 
ENKEIDGRDS DLLVDGDLGE YDFYEYKEYE DKPTSPPNEE FGPGVPAETD rn^£TGHG 420 
AYGEKGQKGE PAVVEPGMLV EGPPGPAGPA GIMGPPGLQG PTGPPGDPGD RGPPGRPGLP 480 
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GADGLPGPPG TMLMLPFRYG GDGSKGPTIS AQEAQAQAIL QQARJALRGP PGPMGLTGRP 540 
GPVGGPGSSO AKGESGDPGP QGPRGVQGPP GPTGKPGKRG RPGADGGRGM PGEPGAKGDR 600 
GFDGLPGLPG DKGHRGERGP QGPPGPPGDD GMRGEDGEIG PRGLPGEAGP RGLLGPRGTP 660 
GAPGQPGMAG VDGPPGPKGN MGPQGEPCPP GQQGNPGPQG LPGPQGPIGP PGEKGPQGKP 720 
GLAGLPGADG PPGHPGKEGQ SGEKGALGPP GPQGPIGXPG PROVKGADGV RGLKGSKGEK 780 
GEDGFPGFKG DMGLKGDRGE VGQIGPRGXD GPEGPKGRAG PTGDPGPSGQ AGEKGKLGVP 840 
GLPGYPGRQG PKGSTGFPGF PGANGEKGAR GVAGKPGPRG QRGPTGPRGS RGARGPTGKP 900 
GPKGTSGGDG PPGPPGERGP QGPQGPVGFP GPKGPPGPPG RMGCPGHPGQ RGETGFQGKT 960 
GPPGPGGWG PQGPTGETGP IGERGYPGPP GPPGEQGLPG AAGKEGAKGD PGPQG1SGKD 1020 
GPAGLRGFPG ERGLPGAQGA PGLKGGEGPQ GPPGPVGSPG ERGSAGTAGP IGLRGRPGPQ 1080 
GPPGPAGEKG APGEKGPQGP AGRDGVQGPV GLPGPAGPAG SPGEDGDKGE IGEPGQKGSK 1 140 
GGKGENGPPG PPGLQGPVGA PGIAGGDGEP GPRGQQGMFG QKGDEGARGF PGPPGPIGLQ 1200 
GLPGPPGEKG ENGDVGPWGP PGPPGPRGPQ GPNGADGPQG PPGSVGSVGG VGEKGEPGEA 1260 
GNPGPPGEAG VGGPKGERGE KGEAGPPGAA GPPGAKGPPG DDGPKGNPGP VGFPGDPGPP 1320 
GELGPAGQDG VGGDKGEDQD PGQPGPPGPS GEAGPPGPPG KRGPPGAAGA EGRQGEKGAK 1380 
GEAGAEGPPG KTGPVGPQGP AGKPGPEGLR GIPGPVGEQG LPGAAGQDGP PGPMGPPGLP 1440 
GLKGDPGSKG EKGHPGUGL IGPPGEQGEK GDRGLPGTQG SPGAKGDGGI PGPAGPLGPP 1500 
GPPGLPGPQG PKGNKGSTGP AGQKGDSGLP GPPGPPGPPG EVIQPLPILS SKKTRRHTEG 1560 
MQADADDNIL DYSDGMEEIF GSLNSLKQD1 EHMKFPMGTQ TNPARTCKDL QLSHPDFPDG 1620 
EYWIDPNQGC SGDSFKVYCN FTSGGETCIY PDKKSEGVRI SSWPKEKPGS WFSEFKRGKL 1680 
LSYLDVEGNS INMVQMTFUC LLTASARQNF TYHCHQSAAW YDVSSGSYDK ALRFLGSNDE 1740 
EMSYDNNPF1 KTLYDGCTSR KGYEKTVIEI NTPKIDQVPI VDVMISDFGD QNQKFGFEVG 1800 
PVCFLG 

Seq ID NO. 123 DNA sequence 

Nucleic Acid Accession #: NMJ)1 5886 

Coding sequence: 485-1 261 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

GAATTCCCCC CCCCCCCCCC TCACTTGGTG TGTCTATATG TCTGGCAG AC ATTATCAGCA 60 
CATTCTCTGT TGTTACCTGT GATTCATTTT TTCTTCACTC TCCAOGTGAA TTTCAATTGC 120 
TGAAAATTTC CCACTGA AAA TATGCAGTAA TATATTTTGT GGTTCAGACA TTTGGGGCAA 1 80 
ATGGTTCACA TTCATTTTAG GGTTAGTGGT CATGCTGTTT ATTTTTCTCT GCTATACAAA 240 
GTTOCTCTTA GGGGTCTGCC TCATG AC ACT AAAAAATGAA TAGAGATTCT ACTGTAGGTT 300 
ATCTCCTAGG CTTGAGTTCA ACATTTGTTT GGATTTTTGA AGA AAGTC AA ATCAAGCAAT 360 
GCTCCCAAAT GATGTCTTTG TAAATTCATA CCCTCTOGCC CTATTTTTTT TCATAGACCC 420 
TAACTCTACC TTTCTGCTTT AAAGCAAAGT AAACTCGGTG GCCTCTTCTT CTCCACCCCT 480 
CAAAATGATA GCAATCTCTG CCGTCAGCAG TGCACTCCTG TTCTCCCTTC TCTGTGAAGC 540 
AAGTACCGTC GTCCTACTCA ATTCCACTGA CTCATCCCCG CCAACCAATA ATTTCACTGA 600 
TATTGAAGC A GCTCTG AAAG CACAATTAG A TTCAGCGG AT ATCCCCAAAG CCAGGCGGAA 660 
GCGCTACATT TCGCAG AATG ACATGATCGC CATTCTTG AT TATCATAATC AAGTTCGGGG 720 
CAAAGTGTTC CCACCGGCAG CAAATATGG A ATATATGGTT TGGG ATGAAA ATCTTGCAA A 780 
ATCGGCAGAG GCTTGGGCGG CTACTTGCAT TTGGGACCAT GGACCTTCTT ACTTACTGAG 840 
ATTTTTGGGC CAAAATCTAT CTGTACGCAC TGGAAGATAT CGCTCTATTC TCCAGTTGGT 900 
CAAGCCATGG TATGATG AAG TGAAAGATTA TGCTTTTCCA TATCCCCAGG ATTGCAACCC 960 
CAGATGTCCT ATGAGATGTT TTGGTCCCAT GTGCACACAT TATACGCAGA TGGTTTGGGC 1020 
CACTTCCAAT CGGATAGGAT GCGCAATTCA TGCTTGCCAA AACATGAATG TTTGGGGATC 1080 
TGTGTGGCGA CGTGCAGTTT ACTTGGTATG CAACTATGCC CCAAAGGGCA ATTGGATTGG 1 140 
AGAAGCACCA TATAAAGTAG GGGTACCATG TTCATCTTGT CCTCCAAGTT ATGGGGGATC 1200 
TTGTACTGAC AATCTGTGTT TTCCAGGAGT TACGTCAAAC TACCTGTACT GGTTTAAATA 1260 
AGTTTACCTT TTCCTCCAGG AAATATAATG ATTTCTGGGA ACATGGGCAT GTATATATAT 1320 
ATATGGAGAG AGAATTTTGC ACATATTATA CATATTTTGT GCTAATCTTG TTTTCCTCTT 1380 
AGTATTCCTT TGTATAAATT AGTGTTTGTC TAGCATGTTT GTTTAATCCT TTGGGAATTC 

Scq ID NO: 124 Protein sequence: 
Protein Accession U: NP_056970.1 

1 11 21 31 41 51 
I I I 1 I I 

MIAISAVSSA LLFSLLCEAS TVVLLNSTDS SPPTNNFTD1 EAALKAQLDS ADEPKARRKR 60 
Y1SQNDMIAI LDYHNQVRGK VFPPAANMEY MVWDENLAKS AEAWAATCIW DHGPSYLLRF 120 
LGQNLSVRTG RYRSILQLVK PWYDEVKDYA FPYPQDCNPR CPMRCFGPMC THYTQMVWAT 1 80 
SNRIGCAIHA CQNMNVWGSV WRRAVYLVCN YAPKGNWIGE APYKVGVPCS SCPPSYGGSC 240 
TDNLCFPGVT SNYLYWFK 



Seq ID NO: 125 DNA sequence 

Nucleic Acid Accession #: NM_0O1793 

Coding sequence: 54-2543 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
| I I I i I 

GOGGAACACC GGCCCGCCGT CGCGGCAGCT GCTTCACCCC TCTCTCTGCA GCCAJGGGGC 60 
TCCCTCGTGG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTG CAGTGCGCGG 120 
CCTCCGAGCC GTGCCGGGCG GTCTTCAGGG AGGCTGAAGT G ACCTTGG AG GCGGG AGGCG 1 80 
CGGAGCAGGA GCCCGGCCAG GCGCTGGGGA AAGTATTCAT GGGCTGCCCT GGGCAAGAGC 240 
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CAGCTCTGTT TAGCACTGAT AATGATGACT TCACTGTGCG GAATGGCGAO ACAGTCCAGO 300 
AAAGAAGGTC ACTGAAGGAA AGGAATCCAT TGAAGATCTT CCCATCCAAA CGTATCTTAC 360 
GAAGACACAA GAGAGATTGG GTGGTTGCTC CAATATCTGT CCCTG AAAAT GGC AAGGGTC 420 
CCTTCCCCCA GAGACTGAAT CAGCTCAAGT CTAATAAAGA TAGAGACACC AAGATTTTCT 480 
ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCGCT GTAGAGAAGG 540 
AGACAGGCTG GTTGTTGTTG AATAAGCCAC TGG ACCGGGA GGAGATTGCC AAGTATGAGC 600 
TCTTTGGCCA CGCTGTGTCA GAG AATGGTG CCTCAGTGGA GGACCCCATG AACATCTCCA 660 
TCATCGTG AC CGACCAGAAT GACCACAAGC CCAAGTTTAC CCAGG ACACC TTCCGAGGGA 720 
GTGTCTTAGA GGGAGTCCTA CCAGGTACTT CTGTGATGCA GGTGACAGCC ACAG ATGAGG 780 
ATGATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA 840 
AGGACCCACA CGACCTCATG TTCACAATTC ACCGGAGCAC AGGCACCATC AGCGTCATCT 900 
CCAGTGGCCT GGACCGGGAA AAAGTCCCTG AGTACACACT GACCATCCAG GCCACAGACA 960 
TGGATGGGGA CGGCTCCACC ACCACGGCAG TGGCAGTAGT GGAGATCCTT GATGCCAATG 1020 
ACAATGCTCC CATGTTTGAC CCCCAGAAGT ACGAGGCCCA TGTGCCTGAG AATGCAGTGG 1080 
GCCA7GAGGT GCAGAGGCTG ACGGTCACTG ATCTGGACGC CCCCAACTCA CCAGCGTGGC 1140 
GTGCCACCTA CCTTATCATG GGCGGTGACG ACGGGGACCA TTTTACCATC ACCACCCACC 1200 
CTGAG AGCAA CCAGGGCATC CTGACAACCA GG AAGGGTTT GGATTTTGAG GCCAAAAACC 1260 
AGCACACCCT GTACGTTGAA GTGACCAACG AGGCCCCTTT TGTGCTGAAG CTCCCAACCT 1320 
CCACAGCCAC CATAGTGGTC CACGTGGAGG ATGTGAATGA GGCACCTGTG TITGTCCCAC 1380 
CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 
CTGCAGAAGA CCCTG ACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAGACCCAG 1500 
CAGGGTGGCT AGCCATGGAC CCAGACAGTG GGCAGGTCAC AGCTGTGGGC ACCCTCGACC 1560 
GTGAGGATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGTCTTG GCCATGGACA 1620 
ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT GATGTCAACG 1680 
ACCATGGCCC AGTCCCTG AG CCCCGTC AG A TCACCATCTG CAACCAAAGC CCTGTGCGCC 1740 
ACGTGCTGAA CATCACGGAC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 1800 
TCACAGATGA CTCAGACATC TACTGGACGG CAGAGGTCAA CGAGGAAGGT GACACAGTGG 1860 
TCTTGTCCCT GAAGAAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 1920 
ACCATGGCAA CAAAGAGCAG CTGACGGTG A TCAGGGCCAC TGTGTGCGAC TGCCATGGCC 1 980 
ATGTCGAAAC CTGCCCTGG A CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG CTGGGGGCTG 2040 
TCCTGGCTCT GCTGTTCCTC CTGCTGGTGC TGCTTTTGTT GGTGAGAAAG AAGCGGAAGA 2100 
TCAAGGAGCC CCTCCTACTC CCAGAAGATG ACACCCGTGA CAACGTCTTC TACTATGGCG 2160 
AAGAGGGGGG TGGCGAAGAG G ACCAGGACT ATG ACATCAC CCAGCTCCAC CGAGGTCTGG 2220 
AGGCCAGGCC GGAGGTGGTT CTCCGCAATG ACGTGGCACC AACCATCATC CCG ACACCCA 2280 
TGTACCGTCC TAGGCCAGCC AACCCAGATG AAATCGGCAA CTTTATAATT GAGAACCTGA 2340 
AGGCGGCTAA CACAGACCCC ACAGCCCCGC CCTACGACAC CCTCTTGGTG TTCGACTATG 2400 
AGGGCAGCGG CTCCGACGCC GCGTCCCTGA GCTCCCTCAC CTCCTCCGCC TCCGACCAAG 2460 
ACCAAG ATTA CGATTATCTG A ACG AGTGGG GCAGCCGCTT CAAGAAGCTG GCAGACATGT 2520 
ACGGTGGCGG GGAGGACGAC TAGGCGGCCT GCCTGCAGGG CTGGGGACCA AACGTCAGGC 2580 
CACAGAGCAT CTCCAAGGGG TCTCAGTTCC CCCTTCAGCT GAGG ACTTCG GAGCTTGTCA 2640 
GGAAGTGGCC GTAGCAACTT GGCGG AGACA GGCTATGAGT CTGACGTTAG AGTGGTTGCT 2700 
TCCTTAGCCT TTCAGGATGG AGG AATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 2760 
CACCTGGGCC AGGGTTGCCT CAG AGGCCAA GTTTCCAGAA GCCTCTTACC TG CCGT AAAA 2820 
TGCTCAACCC TGTGTCCTGG GCCTGGGCCT GCTGTGACTG ACCTACAGTG GACTTTCTCT 2880 
CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 
TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAG A GCTGCTGGGC CCA CTGGC CG 3000 
TCCTGCATTT CTGGTTTCCA GACCCCAATG CCTCCCATTC GG ATGGATCT CTGCGTTTTT 3060 
ATACTGAGTG TGCCTAGGTT GCCCCTTATT TTTTATTTTC C CTGTTGCQT TGCTATAGAT 3120 
GAAGGGTGAG GACAATCGTG TATATGTACT AGAACTTTTT TATTAAAG AA A 

Seq ID NO: 126 Protein sequence: 
Protein Accession #: NP_001784 

1 11 21 31 41 51 

MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 
QEPALFSTDN DDFTVRNGET VQERRSLKER NPUCIFPSKR ILRRHKRDWV VAPISVPENG 120 
KGPFPQRLNQ LKSNKDRDTK IFYSITGPGA DSPPEGVFAV EKETGWLLLN KPLDREEIAK 180 
YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 
DEDDAIYTYN GVVAYSIHSQ EPKDPHDLMF T1HRSTGTIS VISSGLDREK VPEYTLTIQA 300 
TDMDGDGSTT TAVAWEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 360 
AWRATYUMG GDDGDHFT1T THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKX 420 
PTSTATIVVH VEDVNEAPVF VPPSKVVEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 
DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTUD 540 
VNDHGPVPEP RQITICNQSP VRHVLNITDK DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 600 
TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKGGFILPVL 660 
GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 720 
GLEARPEWL RNDVAPTUP TPMYRPRPAN PDEIGNFUE NLKAANTDPT APPYDTLLVF 780 
DYEGSGSDAA SLSSLTSS AS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 

Seq ID NO: 127 DNA sequence 

Nucleic Acid Accession #: NM_003256.1 

Coding sequence: 60-734 (underlined sequences correspond to start and stop codons) 
1 U 21 31 41 51 

CCTGCTGGGG CCGTCCAGTC CCCCAG ACCT CACAGGCTCA GTCGCGGATC TGCAGTGTCA 60 
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TGCCTGGG AO CCCTCGGCCC GCGCC AAGCT GGGTGCTGTT GCTGCGGCTG CTGGCGTTGC 1 20 
TGCGGCCCCC GGGGCTGGGT GAGGCATGCA GCTGCGCCCC GGCGCACCCT CAGCAGCACA 1 80 
TCTGCCACTC GGCACTTGTG ATTCGGGCCA AAATCTCCAG TGAGAAGGTA GTTCCGGCCA 240 
GTGCAGACCC TGCTGACACT GAAAAAATGC TCCGGTATGA AATCAAACAG ATAAAGATGT 300 
TCAAAGGGTT TGAGAAAGTC AAGG ATGTTC AGTATATCTA TACGCCTTTT GACTCTTCCC 360 
TCTGTCGTGT GAAACTAGAA GCCAACAGCC AGAAGCAGTA TCTCTTG ACT GGTCAGGTCC 420 
TCAGTGATGG AAAAGTCTTC ATCCATCTGT GCAACTACAT CGAGCCCTGG GAGGACCTGT 480 
CCTTGGTGCA G AGGGAA AGT CTGAATCATC ACTAOCATCT GAACTGTGGC TGCCAA ATCA 540 
CCACCTGCTA CACAGTACCC TGTACCATCT CGGCCCCTAA CGAGTGCCTC TGGACAGACT 600 
GGCTGTTGG A ACGAAAGCTC TATGGTTACC AGGCTCAGCA TTATGTCTGT ATG AAGCATG 660 
TTGACGGCAC CTGCAGCTGG TACCGGGGCC ACCTGCCTCT CAGGAAGGAG TTTGTTGACA 720 
TCGTTCAGCC CTAGTAGGGA CCAGTGACCA TCACATCCCT TCAAGAGTCC TGAAGATCAA 780 
GCCAGTTCTC CTTCCCTGCA GAG CTTTGGC CATTACCACC TGACCTCTTG CTGCCAGCTA 840 
ATAAGAAGTG CCAAGTGGAC AGTCTGGCCA CTGTCAAGGC AGGGAAGGGG CCATGACTTT 900 
TCTGCCCTGC CCTCAGCCTG TTGCCCTGCC TCCCAAACCC CATTAGTCTA GCCTTGTAGC 960 
TGTTACTGCA AGTGTTTCTT CTGGCTTAGT CTGTTTTCTA AAGCCAGGAC TATTCCCTTT 1020 
CCTCCCCAGG AATATGTGTT TTCCTTTGTC TTAATCQ ATC TGGTAGQGGA GAAATGGCGA 1080 
ATGTCATACA CATGAGATGG TATATCCTTG CGATGTACAG AATCAGAAGG TGGTTTGACA 1 140 
GCATCATAAA CAGGCTGACT GGCAGGAATG AAAAAAAAAA AAAAAAAAA 

Sco ID NO: 128 Protein sequence: 
Protein Accession #: NPJW3247.1 

1 11 21 31 41 51 

111.) 1 I 

MPGSPRPAPS WVLLLRLLAL LRPPGLGEAC SCAPAHPQQH ICHSALV1RA KISSEKWPA 60 
SADPADTEKM LRYEIKQIKM FKGFEKVKDV QYIYTPFDSS LCGVK LEANS QKQYLLTGQV 1 20 
LSDGKVFIHL CNYIEPWEDL SLVQRESLNH HYHLNCGCQI TTCYTVPCTI SAPNECLWTD 180 
WLLERKLYGY QAQHYVCMKH VDGTCSWYRG HLPLRKEFVD IVQP 

Seq ID NO 129 DNA sequence 

Nucleic Acid Accession #: NM_007207.2 

Coding sequence: 143-1 591 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

CCACGCGTCC GCAATGAAGC CGAGTGAATG GGGGCTGAAT GTGCGAGTCC ATAGCTGAAG 60 
AGO AGCGCCA G ATGGTGG AG G AATACACTT ATTTATG AAA CTGTCTTG AG TTCTTCTTGA 1 20 
ATTGCCAGTT TTCAGCCTCC TC ATG CCTCC GTCTCCTTTA GACGACAGGG TAGTAGTGGC 1 80 
ACTATCTAGG CCCGTCCGAC CTCAGGATCT CAACCTTTGT TTAGACTCTA GTTACCTTGG 240 
CTCTGCCAAC CCAGGCAGTA ACAGCCACCC TCCTGTCATC GCCACCACCG TTGTGTCCCT 300 
CAAGGCTGCG AATCTGACGT ATATGCCCTC ATCCAGCGGC TCTGCCCGCT CGCTGAATTG 360 
TGGATGCAGC AGTGCCAGCT GCTGCACTGT GGCAACCTAC GACAAGGACA ATCAGGCCCA 420 
AACCCAAGCC ATTGCCGCTG GCACCACCAC CACTGCCATC GGAACCTCTA CCACCTGCCC 480 
TGCTAACCAG ATGGTCAACA ATAATG AG AA TACAGGCTCT CTAAGTCCAT CAAGTGGGGT 540 
GGGCAGCCCT GTGTCAGGG A CCCCCAAGCA GCTAGCCAGC ATCAAAATAA TCTACCCCAA 600 
TGACTTGGCA AAGAAGATG A CCAAATGCAG CAAGAGTCAC CTGCCGAGTC AGGGCCCTGT 660 
CATCATTGAC TGCAGGCCCT TCATGGAGTA CAACAAGAGT CACATCCAAG GAGCTGTOCA 720 
CATTAACTGT GCCG ATAAG A TCAGCCGGCG GAGACTGCAG CAGGGCAAGA TCACTGTCCT 780 
AG ACTTGATT TCCTGTAGGG AAGGCAAGGA CTCTTTCAAG AGGATCTTTT CCAAAGAAAT 840 
TATAGTTTAT GATGAG AATA CCAATGAACC AAGCCGAGTG ATGCCCTCCC AGCCACTTCA 900 
CATAGTCCTC GAGTCCCTGA AGAGAGAAGG CAAAOAACCT CTGGTGTTGA AAGGTGOACT 960 
TAGTAGTTTT AAGCAG AACC ATGAAAACCT CTGTGACAAC TCCCTCCAGC TCCAAGAGTG 1020 
CCGGGAGGTG GGGGGCGGCG CATCCGCGGC CTCGAGCTTG CTACCTCAGC CCATCCCCAC 1080 
CACCCCTOAC ATCGAG AACG CTGAGCTCAC CCCCATCTTG CCCTTCCTGT TCCTTGGCAA 1 140 
TOAGCAGGAT GCTCAGGACC TGGACACCAT GCAGCGGCTG AACATCGGCT ACGTCATCAA 1200 
CGTCACCACT CATCTTCCCC TCTACCACTA TG AG AAAGGC CTGTTCAACT ACAAGCGGCT 1260 
GCCAGCCACT GACAGCAACA AGCAGAACCT GCGGCAGTAC TTTGAAG AGG CTTTTGAGTT 1320 
CATTOAGGAA GCTCACCAGT GTGGGAAGGG GCTTCTCATC CACTGCCAGG CTGGGGTGTC 1380 
CCGCTCCGCC ACCATCGTCA TCGCTTACTT GATGAAGCAC ACTCGGATGA CCATGACTGA 1440 
TGCTTATAAA TTTGTCAAAG GCAAACGACC AATTATCTCC CCAAACCTTA ACTTCATGGG 1500 
GCAGTTGCTA GAGTTCGAGG AAGACCTAAA CAACGGTGTO ACACCGAGAA TCCTTACACC 1560 
AAAGCTGATG GGCGTGGAGA CGGTTGTG TG AC AATGGTCT GQATGGAAA G GATTGCTGCT 1620 
CTCCATTAGG AGACAATGAG G AAGG AGG AT GGATTCTGGT TTTTTTTCTT TCI 1 1 II 111 1680 
TTGTAGTTGG G AGTAAAGTT TGTG AATGGA AACAAACTTG GTTAAACACT TTATTTTTAA 1740 
CAAGTGTAAG AAGACTATAC TTTTGATGCC ATTGAGATTC ACCTTCCACA AACTGGCCAA 1800 
ATTAAGGAGG TTAA AGAAGT AATTTTTTTT AAGCCCAACC ATTAAAAATT TAAT ACAACT 1 860 
TGGTTTCTCC CCCTTTTTCC TTTAAAGCTA NTTTGTAAAA GTTTATGAG 

Seq ID NO: 130 Protein sequence: 
Protein Accession #: NP_009 138.1 

I 11 21 31 41 51 
I I I I I 1 

MPPSPLDDRV WALSRPVRP QDLNLCLDSS YLGSANPGSN SHPPV IATTV VSLKAANLTY 60 
MPSSSGSARS LNCGCSSASC CTVATYDKDN QAQTQAIAAG TTTTAIGTST TCPANQMVNN 120 
NENTGSLSPS SGVGSPVSGT PKQLASIKII YPNDLAKKMT KCSKSHLPSQ GPVUDCRPF 180 
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PCT/US02/02242 



MEYNKSHIQG AVHINCADKI SRRRLQQGKJ TVLDLISCRE GKDSFKR1FS KEUVYDENT 240 
NEPSRVMPSQ PLHIVLESUC REGKEPLVLK GGLSSFKQNH ENLCDNSLQL QECREVGGGA 300 
SAASSLLPQP IPTTPDIENA ELTPILPFLF LGNEQDAQDL DTMQRLNIGY VTNVTTHLPL 360 
YHYEKGLFNY KRLPATDSNK QNLRQYFEEA FEFIEEAHQC GKGLUHCQA GVSRSATM 420 
5 AYU^KTRMT MTDAYKFVKG KRPIISPNLN FMGQLLEFEE DLNNGVTPRI LTPKLMGVET 480 
W 

SeqIDKO:13I DNA sequence 
Nucleic Acid Accession #: NM.005409.3 
1 0 Coding sequence: 94-378 (underlined sequences correspond to start and stop codons) 

1 U 21 31 41 51 

WcCTTTCAT GTTCAGCATT TCTACTCCTT CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG 60 

1 5 CAACAGCACC AGCAGCAACA GCAAAAAACA AACATGAGTG TGAAGGGCAT GGCT ATAGCC 1 20 
TTGGCTGTGA TATTGTGTGC TACAGTTGTT CAAGGCTTCC CCATGTTCAA AAGAGGACGC 180 
TGTCTTTGCA TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG CAGATATTGA GAAAGCCTCC 240 
ATAATGTACC CAAGTAACAA CTGTG ACA AA ATAG AAGTGA TTATTACCCT GAAAGAAAAT 300 
AAAGG ACAAC GATGCCTAAA TCCCAAATCG AAGCAAGCAA GGCTTATAAT CAAAAAAGTT 360 

20 GAAAGAAAGA ATTTTTAAAA ATATCAAAAC ATAtGAAGTC CTGGAAAAGG GCATCTGAAA 420 
AACCTAGAAC AAGTTTAACT GTG ACTACTG AAATGACAAG AATTCTACAG TACKWAAOT 1480 
AGACTTTTCT ATGGTTTTGT G ACTTTCAAC TTTTGTACAG TTATGTGAAG GATGAAAGGT 540 
GGGTGAAAGG ACCAAAAACA G AAATACAGT CTTCCTGAAT GAATGACAAT CAG AATTCCA 600 
CTGCCCAAAG GAGTCCAGCA ATTAAATGG A TTTCTAGGAA AAGCTACCTT AAGAAAGGCT 660 

25 GGTTACCATC GGAGTTTACA AAGTGCTTTC ACX3TTCTTAC TTGTTGTATT ATACATTCAT 720 
GCATTTCTAG GCTAG AG AAC CTTCTAG ATT TGATGCTTAC AACTATTCTG TTGTG ACTAT 780 
GAGAACATTT CTGTCTCTAG AAGTTATCTG TCTGTATTGA TCrTTAjGCT ATATTACTAT 840 
CTGTGGTTAC AGTGG AGACA TTGACATTAT TACTGGAGTC AAGCCCTTAT AAQTCAAAAO 900 
CATCTATGTG TCGTAAAGCA TTCCTCAAAC ATTTTTTCAT GCAAAT ACAC ACTTCTTTCC 960 

3 0 CCAAATATCA TGTAGCACAT CAATATGTAG GGAAACATTC TTATGCATCA TTTGGTTTGT 1020 
TTT^^ACCA ATTCATTAAA TGTAATTCAT AAAATGTACT ATG AAAAAAA TTAT^GCTA 1080 
TGGGATACTG GCAACAGTGC ACATATTTCA TAACCAAATT AGCAGCACCGGTCTTAATTT 1 140 
GATGTTTrtC^AACTTTTATT CATTGAGATG TTTTGAAGCA ATTAGGATAT GTGTGTTTAC 1200 
TGTACTTTTT GTTTTGATCC GTTTGTATAA ATGATAGCAA TATCTTGGAC ACATTTGAAA 1260 

3 5 TACAAAATGT TTTTGTCTAC C AAAG AAAAA TGTTG AAAAA TAAGCAAATG TATACCTAGC 1320 
AATCACTTTT ACTTTTTGTA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATTTCT 1380 
TTGTTCATGC CTATATACTG TAAAATTTAG GTAT ACTCAA GACTAGTTTA AAGAATCAAA 1440 
GTCATTTTTT TCTCTA ATA A ACTACCACAA CCTTTCTTTT TTAAAAAAAA AAA 

40 *>q THNO; 132 Protein sequence: 

Protein Accession #: NP JJ05400. 1 

I 11 21 31 41 51 

45 MSVKGMAIAL AVILCATWQ GFPMFKRGRC LCIGPGVKAV KVADIEKASI MYPSNNCDKI 60 
EVUTLKENK GQRCLNPKSK QARUIKKVE RKNF 

Seq ID NO: 133 DNA sequence 
Nucleic Acid Accession ft: NM0I2342 
50 Coding sequence: 373-1 155 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

CTGGCGCGGG CGGGAGCTGC GGCGGATACC CTTGCGTGCT GTGGAGACCC TACTCTCTTC 60 
55 GCTGAGAACG GCCGCTAGCG GGGACTGAAG GCCGGGAGCC CACTCCCGAC CCGGG GCTAG 120 

cStccc^ 

AACTTTTCTG GGCTCCTGGA GAGCCCTGTA GCCGCGCTCC ATGCTOCGGC AG^GCCCGA 240 
AACCCAGCCC CGCCGCTG AC GG AGCCCGCC GCTCCGGGCA GGGCCCATGC CCTGCGCGCT 300 
CCGCGGGTCG TAGCTGCCGC CGAGCCGGGG CTCCGGAAGC CGGCGGGGGC GCCGCGGCCG JI60 

60 TGCGGGGCGT CAATGGATCG CCACTCCAGC TACATCTTCA TCTGGCTGCA GCTGGAGCTC 420 
TGCGCCATGG CCGTGCTGCT CACCAAAGGT GAAATTCGAT GCTACTGTGA ^GCTGCCCAC 480 
TGTGTAGCCA CTGGTTATAT GTGTAAATCT GAGCTCAGCG CCTGCTTCTC TAGACTTCTT 540 
GATCCTCAGA ACTCAAATTC CCCACTCACC CATGGCTGCC TGG ACTCTCT TGCAAGCACG 600 
AcStCTGCCAAGCCAA ACAGGCCCGA AACCACTCTG GC^CXA^T AC^ATTG 660 

65 G AATGCTGTC ATG A AG AC AT GTG C A ATT AC AGAGGGCTGC ACGATGTTCT CTCTCCTCCC 720 

ag^gSgcctcaggacaaggaaacaggtatcag^^^^ 

ACCAAGGTGC AGGAGCTGAC TTCTTCCAAA GAGTTGTGGTTCCGGGCAGC GGTCATTGCC 840 
gtgcccattg CTGG agggct gattttagtg TTGCTTATTA TGTTGGCCCT GAGGATGCTT 900 
CO^GAAA ATAAG AGGCT GCAGGATCAG CGGCAACAGA TGCTCTCCCG TTTGCACTAC 960 
70 AGCTTTCACG GACACCATTC CAAAAAGGGG CAGGTTGCAA AGTTAGACTT GGAATGCATG 1020 

gSgtca^ 1080 

CTCAGCAACG ATAAGATCCT CTCGCTTGTT CACTGGGGCATGTACAGTGG GCACGGGAAG 1140 
CTGGAATTCG TATGACGG AG TCTTATCTGA ACTACACTTA CTGAACAGCT TGAAGGCCTT 1200 
TTGAGTTCTG CTGIsaCAGGA GCACTTTATC TGAAGACAAA CTCATTTAAT CATCTTTGAG 1260 
75 AGACAAAATG ACCTCTGCAA ACAG AATCTT GGATATTTCT TCTGAAGGAT TATTTGCACA 1320 
GACTTAAATA CAGTTAA ATG TGTTATTTGC TTTTAAAATT ATAAAAAGCA AAGAGAAGAC 1380 
TTTCTACACA CTGTC ACC AG GGTTATTTGC ATCCAAGGGA GCTGGAATTG AGTACCTAAA 1440 

407 
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TAAACAAAAA TGTGCCCTAT GTAAGCTTCT ACATCTTG AT TTATTGTAAA OATTTAAAAG 1500 
AAATATATAT ATTTTGTCTG A 

Seq ID NO: 134 Protein sequence: 
S Protein Accession fl: NP_036474. 1 

I It 21 31 4! 51 

MDRHSSYIFI WLQLELCAMA VLLTKGEIRC YCDAAHCVAT GYMCKSELSA CFSRLLDPQN 60 
10 SNSPLTHGCL DSLASTTDIC QAKQARNHSG TTIPTLEOCH EDMCNYRGLH DVLSPPRGEA 120 
SGQGNRYQKD GSRNLITKVQ ELTSSKELWF RAAVIAVPIA GGULVLUM LALRMLRSEN 180 
KRLQDQRQQM LSRLHYSFHG HHSKKGQVAK LDLECMVPVS GHENCCLTCD KMRQADLSND 240 
KILSLVHWGM YSGHGKLEFV 

15 Seq ID NO: 135 DNA sequence 

Nucleic Acid Accessions: NM_C0!627.1 

Coding sequence: 64-1 81 5 (underlined sequences correspond to start and stop coda ns) 
1 II 21 31 41 51 

20 i i | i i i 

CGGGACGACG CCCCCTCCTG CGGCGTGGAC TCCGTCAGTO GCCCACCAAG AAGGAGGAGG 60 
AATATGG AAT CCAAGGGGGC CAGTTCCTGC CGTCTGCTCT TCTGCCTCTT GATCTCCGCC 120 
ACCGTCTTCA GGCCAGGCCT TGG ATGGTAT ACTGTAAATT CAGCATATGG AGATACCATT 180 
ATCATACCTT GCCGACTTGA CGTACCTCAG AATCTCATGT TTGGCAAATG G AAATATGAA 240 

25 AAGCCCGATG GCTCCCCAGT ATTTATTGCC TTCAGATCCT CTACAAAOAA AAGTGTGCAG 300 
TACGACGATG TACCAG AATA CAAAG ACAG A TTGAACCTCT CAGAAAACTA CACTTTGTCT 360 
ATCAGTAATG CAAGGATCAG TGATGAAAAG AGATTTGTGT GCATGCTAGT AACTGAGGAC 420 
AAOGTGTTTG AGGCACCTAC AATAGTCAAG GTGTTCAAGC AACCATCTAA ACCTGAAATT 480 
GTAAGCAAAG CACTGTTTCT CG A A ACAG AG CAGCTAAAAA AGTTGGGTGA CTGCATTTCA 540 

3 0 G AAGACAGTT ATCCAG ATGG CA ATATCACA TGGTACAGG A ATGG AAAAGT GCTACATCCC 600 
CTTGAAGGAG CGGTGGTCAT AATTTTTAAA AAGGAAATGG ACCCAGTGAC TCAGCTCTAT 660 
ACCATGACTT CCACCCTGGA GTACAAGACA ACCAAGGCTG ACATACAAAT GCCATTCACC 720 
TQCTCGGTGA CATATTATGG ACC ATCTGGC CAGAAAACAA TTCATTCTGA ACAGGCAGTA 780 
TTTGATATTT ACTATCCTAC AG AGCAGGTG ACAATACAAG TGCTGCCACC AAAAAATGCC 840 

3 5 ATCAAAGAAG GGGATAACAT CACTCTTAAA TGCTTAGGGA ATGGCAACCC TCCCCCAGAG 900 
GAATTTTTGT TTTACTTACC AGG ACAGCCC GAAGGAATAA GAAGCTCAAA TACTTACACA 960 
CTGATGGATG TGAGGCGCAA TGCAACAGGA G ACTACAAGT GTTCCCTG AT AGACAAAAAA 1020 
AGCATG ATTG CTTCAACAGC CATCACAGTT CACTATTTGG ATTTGTCCTT AAACCCAAGT 1080 
GGAGAAGTGA CTAOACAG AT TGGTGATGCC CTACCCGTGT CATGCACAAT ATCTGCTAGC 1 140 

40 AGGAATGCAA CTGTGGTATG GATGAAAGAT AACATCAGGC TTCGATCTAG CCCGTCATTT 1200 
TCTAGTCTTC ATTATCAGGA TGCTGGAAAC TATGTCTGCG AAACTGCTCT GCAGGAGGTT 1260 
GAAGGACTAA AGAAAAGAG A GTCATTGACT CTCATTGTAG AAGGCAAACC TCAAATAAAA 1320 
ATGACAAAG A AAACTGATCC CAGTGGACTA TCTAAAACAA TAATCTGCCA TGTGG AAGGT 1380 
TTTCCAAAGC CAGCCATTCA GTGGACAATT ACTGGCAGTG GAAGCGTCAT AAACCAAACA 1440 

45 GAGGAATCTC CTTATATTAA TGGCAGGTAT TATAGTAAAA TTATCATTTC CCCTGAAGAG 1500 
AATGTTACAT TAACTTGCAC AGCAGAAAAC CAACTGGAGA GAACAGTAAA CTCCTTGAAT 1560 
GTCTCTGCTA TAAGTATTCC AGAACACGAT GAGGCAGACG AGATAAGTGA TGAAAACAGA 1620 
GAAAAGGTGA ATG ACCAGGC AAAACTAATT GTGGGAATCG TTGTTGOTCT CCTCCTTGCT 1680 
GCCCTTGTTG CTGGTGTCGT CTACTGGCTG TACATGAAGA AGTCAAAG AC TGCATCAAAA 1740 

5 0 CATGTAAACA AGGACCTCGG TAATATGG A A G AAAACAAAA AOTTAGAAG A AAACAATCAC 1 800 
AAAACTGAAG CCTAAGAGAG AAACTGTCCT AGTTGTCCAG AGATAAAAAT CATATAGACC I860 
AATTGAAGCA TGAACGTGGA TTGTATTTAA GACATAAACA AAGACATTGA CAGCAATTCA 1920 
TGGTTCAAGT ATTAAGCAGT TCATTCTACC AAGCTGTCAC AGOTTTTCAG AGAATTATCT 1980 
CAAGTAAAAC AAATGAAATT TAATTACAAA CAATAAGAAC AAGTTTTGGC AGCCATGATA 2040 

55 ATAOGTCATA TGTTGTOTTT GGTTCAATTT TTTTTCCGTA AATGT CTGCA CTGAGQATTT 2100 
CTTTTTGGTT TGCCTTTTAT GTAAATTTTT TACGTAGCTA TTTTTATACA CTGTAAGCTT 2 1 60 
TGTTCTGGGA GTTGCTGTTA ATCTGATGTA TAATGTAATG TTTTTATTTC AATTGTTTAT 2220 
ATGOATAATC TGAGCAGGTA CATTTCTG AT TCTGATTGCT ATCAGCAATG CCCCAAACTT 2280 
TCTCATAAGC ACCTAAAACC CAAAGGTGGC AGCTTGTGAA GATTGGGGAC ACTCATATTG 2340 

60 CCCTAATTAA AAACTGTGAT TTTTATCACA AGGG AGGGGA GGCCGAG AGT CAGACTGATA 2400 
GACACCATAG G AGCCGACTC TTTGATATGC CACCAGCG AA CTCT CAGA AA TAAATCACAG 2460 
ATGCATATAG ACACACATAC ATAATGGTAC TCCCAAACTG ACAATTTTAC CTATTCTGAA 2520 
AAAGACATAA AACAGAATT 

65 Seq ID NO: 136 Protein sequence: 

Protein Accession #: NP_00 1618.1 

1 11 21 31 41 51 

70 MESKGASSCR LLFCLLISAT VFRPGLGWYT VNSAYGDTII IPCRLDVPQN LMFGKWKYEK 60 
PDGSPVFIAF RSSTKKSVQY DDVPEYKDRL NLSENYTLSI SNAR1SDEKR FVCMLVTEDN 120 
VFEAPTIVKV FKQPSKPEIV SKALFLETEQ LKKLGDCISE DSYPDGN1TW YRNGKVLHPL 180 
EGAW0FKK EMDPVTQLYT MTSTLEYKTT KADIQMPFTC SVTYYGPSGQ KTTHSEQAVF 240 
DIYYPTEQVT IQVLPPKNAI KEGDNITLKC LGNGNPPPEE FLFYLPGQPE GIRSSNTYTL 300 

75 MDVRRNATGD YKCSLIDKKS MIASTAITVH YLDLSLNPSG EVTRQIGDAL PVSCTISASR 360 
NATWWMKDN IRLRSSPSFS SLH YQDAGNY VCETALQEVE GLKKRESLTL IVEGKPQDCM 420 
TKKTDPSGLS KTIICHVEGF PKPAIQWTIT GSGSVINQTE ESPYINGRYY SKUISPEEN 480 
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VTLTCTAENQ LERTVNSLNV SAISIPEHDE ADEISDENRE KVNDQAKUV GIVVGUXAA 540 
LVAGVVYWLY MKKSKTASKH VNKDLGNMEE NKKLEENNHK TEA 



Scq ID NO: 137 DNA sequence 

Nucleic Acid Accession #: XM J)30559 

Coding sequence: 1 -119 (underlined sequences correspond to start and stop codons) 
1 II 21 31 41 51 

ATGAACCGCA GCCACCGGCA CGGGGCGGGC AGCGGCTGCC TGGGCACTAT GGAGGTGAAG 60 
AGCAAGTTTG GAGCTGA ATT TCGTCGGTTT TCGCTGGAAA GATCAAAACC T GGAAA ATTT 120 
GAGGAGTTTT ATGGATTACT ACAACATGTT CATAAGATCC CCAATGTTGA CGTTTTGGTA 180 
GGCTATGCAG ACATCCATGG AGACTTACTA CCTATAAATA ATGATGATAA TTATCACAAA 240 
GCTGTTTCAA CGGCCAATCC ACTGCTTAGG ATATTTATAC AAAAGAAGGA AGAAGCAGAC 300 
TACAGTGCCT TTGGTACAGA CACGCTAATA AAGAAGAAGA ATGTTTTAAC CAA CGTA TTG 360 
CGTCCTG ACA ACCATAG AAA AAAGCCACAT ATAGTCATTA GTATGCCCCA AG ACTTTAGA 420 
CCTGTGTCTT CTATTATAG A CGTGGATATT CTCCCAGAAA CGCATCGTAG GGTACGTCTT 480 
TACAAATACG GCACGGAGAA ACCCCTAGGA TTCTACATCC GGGATGGCTC CAGTGTCAGG 540 
GTAACACCAC ATGGCTTAG A AAAGGTTCCA GGGATCTTTA TATCCAGGCT TGTCCCAGGA 600 
GGTCTGGCTC AAAGTACAGG ACTATT AGCT GTTAATGATG AAGTTTTAGA AGTTAATGGC 660 
ATAGAAGTTT CAGGGAAG AG CCTTGATCAA GTAACAG ACA TGATGATTGC AAATAGCCGT 720 
AACCTCATCA TAACAGTGAG ACCGGCAAAC CAGAGGAATA ATGTTGTGAG GAACAGTCGG 780 
ACTTCTGGCA GTTCCGGTCA GTCTACTGAT AACAGCCTTC TTGGCTACCC ACAGCAGATT 840 
GAACCAAGCT TTGAGCCAG A GGATGAAGAC AGCGAAGAAG ATGACATTAT CATTGAAG AC 900 
AATGG AGTGC CACAGCAG AT TCCAAAAGCT GTTCCTAATA CTGAGAGCCT GG AGTCATTA 960 
ACACAGATAG AGCTAAGCTT TG AGTCTOG A CAGAATGGCT TTATTCCCTC TAATG AAGTG 1020 
AGCTTAGCAG CCATAGCAAG CAGCTCAAAC ACGGAATTTG AAACACATGC TCCAGATCAA 1080 
AAACTCTTAG AAGAAGATGG AACAATCATA ACATTATGA 

Seq ID NO: 138 Protein sequence: 
Protein Accession #: XPJW0559 

1 II 21 31 41 51 

MNRSHRHGAG SGCLGTMEVK SKFGAEFRRF SLERSKPGKF EEFYGLLQHV HKIPNVDVLV 60 
GYADIHGDLL PINNDDNYHK A VSTANPLLR IFIQKKEEAD YSAFGTDTU KKKNVLTNVL 120 
RPDNHRKKPH IVISMPQDFR PVSSHDVDI LPETHRRVRL YKYGTEKPLG FYTCDGSSVR 180 
VTPHGLEKVP GIF1SRLVPG GLAQSTGIXA VNDEVLEVNG IEVSGKSLDQ VTDMMIANSR 240 
NUnVRPAN QRNNWRNSR TSGSSGQSTD NSLLG YPQQI EPSFEPEDED SEEDDIUED 300 
NGVPQQIPKA VPNTESLESL TQIELSFESG QNGFIPSNEV SLAAIASSSN TEFETHAPDQ 360 
KLLEEDGTU TL 
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It is understood that the examples described above in no way serve to limit the 
true scope of this invention, but rather are presented for illustrative purposes. All 
publications, sequences of accession numbers, and patent applications cited in this 
specification are herein incorporated by reference as if each individual publication or patent 
application were specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 

1 LA method of detecting a breast cancer-associated transcript in a cell 

2 from a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1-25. 

1 2. The method of claim 1 , wherein the biological sample comprises 

2 isolated nucleic acids. 

1 3 . The method of claim 2, wherein the nucleic acids are mRNA. 

1 4. The method of claim 2, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 5. The method of claim 1, wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1-25. 

1 6. The method of claim 1 , wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 7. The method of claim 1, wherein the patient is undergoing a therapeutic 

' 2 regimen to treat breast cancer. 

1 8. The method of claim 1, wherein the patient is suspected of having 

2 breast cancer. 

1 9. An isolated nucleic acid molecule consisting of a polynucleotide 

2 sequence as shown in Tables 1-25. 

1 10. The nucleic acid molecule of claim 9, which is labeled. 

1 1 1. An expression vector comprising the nucleic acid of claim 9. 

1 1 2. A host cell comprising the expression vector of claim 1 1 . 
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1 1 3 . An isolated polypeptide which is encoded by a nucleic acid molecule 

2 having polynucleotide sequence as shown in Tables 1-25. 

1 14. An antibody that specifically binds a polypeptide of claim 13. 

1 15. The antibody of claim 14, further conjugated to an effector component. 

1 16. The antibody of claim 15, wherein the effector component is a 

2 fluorescent label. 

1 17. The antibody of claim 1 5, wherein the effector component is a 

2 radioisotope or a cytotoxic chemical. 

1 18. The antibody of claim 15, which is an antibody fragment. 

1 1 9. The antibody of claim 15, which is a humanized antibody 

1 20. A method of detecting a breast cancer cell in a biological sample from 

2 a patient, the method comprising contacting the biological sample with an antibody of claim 

3 14. 

1 21. The method of claim 20, wherein the antibody is further conjugated to 

2 an effector component. 

1 22. The method of claim 2 1 , wherein the effector component is a 

2 fluorescent label. 

1 23. A method for identifying a compound that modulates a breast cancer- 

2 associated polypeptide, the method comprising the steps of: 

3 (i) contacting the compound with a breast cancer-associated polypeptide, the 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 

5 80% identical to a sequence as shown in Tables 1-25; and 

6 (ii) determining the functional effect of the compound upon the polypeptide. 
1 24. A drug screening assay comprising the steps of 
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2 (i) administering a test compound to a mammal having breast cancer or a cell 

3 isolated therefrom; 

4 (ii) comparing the level of gene expression of a polynucleotide that selectively 

5 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1 -25 in a 

6 treated cell or mammal with the level of gene expression of the polynucleotide in a control 

7 cell or mammal, wherein a test compound that modulates the level of expression of the 

8 polynucleotide is a candidate for the treatment of breast cancer. 
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